Graph Analytics: Beyond the small circle of friends
|
|
- Hester Poole
- 7 years ago
- Views:
Transcription
1 Graph Analytics: Beyond the small circle of friends An Ovum white paper for YarcData SUMMARY Catalyst Ovum view Big Data has raised the bar for analytics: organizations need no longer make the tradeoff between depth of analysis and breadth of data ( rich vs. reach ). New platforms and frameworks are making possible the ability to solve new kinds of problems, such as analyzing real world scenarios involving networks of people, places, or things connected by many-to-many relationships that are impacted by events, economics, natural phenomena, or other trends. Graph analytics have emerged to help organizations make sense of these trends, yielding 360-degree views of all the relationships involving an entity and the ability to explore complex patterns in those relationships. Yet early approaches to graph computing have largely been restricted to the intelligence community or social network analysis because their compute-intensive nature has limited them to piecemeal, batch processes. If performance and capacity limitations to graph solutions could be overcome the benefit of graph analytics could be applied to w wider variety of scenarios as capital markets risk management, life sciences research, healthcare delivery, smart infrastructure management, and other areas. Fast Data the velocity component of Big Data is important to unlocking the value of graph analytics. Because most real world scenarios are dynamic, graph computing solutions must be sufficiently powerful to handle the entire problem without breaking it into multiple parts, and must be able to compute the entire problem without partitioning it, and deliver results faster than batch mode because of the changing nature of the scenarios being studied. Speed is also critical for handling the continuous flow of new data, because the world, and all the relationships that exist, are constantly in changing. That in turn makes it essential to perform rapid querying so that the results do not fall out of date. New, optimized system or appliance oriented approaches could help break the graph computing bottleneck, enabling these solutions to address a far wider range of business, infrastructure, and political scenarios. Key messages Big Data and Fast Data are changing expectations around analytics Page 1
2 New analytic approaches, such as graph analysis, are emerging to address new kinds of problems not otherwise possible with conventional data warehousing platforms Graph analytics is a new approach for analyzing the complex networks of many-tomany relationships that pervade the real world Graph analytics is a prime example of the type of analytic problem that benefits from a Fast Data approach New, optimized system or appliance oriented approaches could help break the graph computing bottleneck BIG DATA AND FAST DATA RAISES EXPECTATIONS A new analytics norm is emerging Big Data has raised the bar for analytics: organizations need no longer make the tradeoff between rich and reach. They no longer need to limit reach (sample size) when running rich, sophisticated analytics, and they can now extend analytics to other forms of data beyond the structured data that have populated enterprise data warehouses. Big Data defined by Ovum as embodying the 4 Vs including volume, variety, velocity, and value is now fair game for analytics. By expanding the range of addressable data, Big Data has raised expectations for delivering superior outcomes for analytics aimed at use cases such as improving customer retention, managing public infrastructure, protecting the nation s security, and optimizing healthcare delivery. Big Data has also raised the appetite for Fast Data, which addresses data velocity. While there have always been specialized applications for consuming real-time data feeds, growing bandwidth, memory density, and continuing trends with Moore s Law are in turn feeding expectations for realtime analytic solutions that not only improve performance, but allow greater adaptability and flexibility. New data, new analytic approaches While Big Data has improved the fidelity of analytics with many familiar problems, new platforms and frameworks are clearing the way for solving new sets of problems that were traditionally beyond the capabilities of the largest and fastest conventional data warehousing platforms. There is the need for organizations, not simply to explore data, but gain a better understanding of the environment in which they operate. Typically, the real world is characterized by many-to-many relationships; examples include: People A person has relationships, not just with a single person, but with different groups and things. For instance, a person who buys fertilizer might not be of interest unless he shares a post office box with another person who owns a truck and was recently detected by video cameras loitering around Times Square. Products -- The success of a product depends, not only on the perception of people in its target market, but the introduction of other products, the ability to make delivery to Page 2
3 specific regions, changes in consumer buying power, and news events that cannot always be anticipated. Trading Risk When complex instruments are traded in capital markets, buyers and sellers must manage counterparty risk. In today s fast-paced financial markets requires an approach that assumes that relationships between parties and counterparties are continually changing. Modeling and analyzing the complex, many-to-many relationships that are present in the everyday world proved an extremely difficult problem for existing relational database platforms. At minimum, computing such problems would have required multiple join operations exacting huge overhead on performance. In practice, analytics involving complex webs of relationships was broken up into multiple discrete runs, each of them number crunching portions of the problem, yielding vague, approximate conclusions at best. INTRODUCING GRAPH ANALYTICS The wave of innovation surrounding Big Data provided early answers to this puzzle. A number of open source graph databases have emerged, that are often run to analyze social tribes that populate social networks; arguably, many of them cannot handle the volumes or velocities of Big Data. In effect, these analytics apply the technique of analyzing focus groups to the real world: identify key people in a demographic and identify the groups where they belong and the thought leaders who influence them. Unlike relational databases that represent data in tables that resemble spreadsheets, graph databases depict entities as nodes with unique properties and pointers to related elements that in aggregate form interconnected networks. As such, an entity or node has its own attributes and can have relationships with multiple other entities, and vice versa. A simplified representation of a graph model for a social group or tribe is shown in Figure 1. Yet, the well-known example of mapping social tribes with graph analytics represents only the tip of the iceberg of what s possible. Until now, the challenge with graph analytics that has largely limited it to rudimentary uses such as social networking is that they are extremely computeintensive, or extended use cases such as mapping homeland security threats. With the softwareonly solutions that have been available to date, these computations have had to be processed in batch mode. Furthermore, capacity issues forced the need to partition graph processing to different nodes in a cluster. Page 3
4 Figure 1. Basic graph model for social group YarcData s systems approach to graph Big Data analytics emerged initially with scaled-out clusters containing commodity hardware. Such approaches have been inefficient for graph analytics because they were optimized for sequential file access computing patterns. Graph computing requires, not only the ability to massively parallel processing, but also the ability to pool large, un-partitioned problems over large multiprocessor architectures and shared memory that is linked through high speed interconnects. YarcData adopted a systems-oriented approach to address graph computing bottlenecks. Its solution, urika, includes specialized hardware and software. The hardware divides the workloads between specialized accelerator nodes that are optimized for graph processing, and service nodes that handle core systems housekeeping chores such as appliance management, database management, and visualization. The system, which can accommodate up to 8,192 graph accelerator processors, can run jobs un-partitioned thanks to access to up to 512 TB of shared memory. Because processors and memory have different throughput rates, the system has been optimized to overcome latency through massive multithreaded techniques that expand effective processing bandwidth. On the back end, urika packages a parallel file system that can scale to exabytes. urika is tuned to run a software stack built on industry standards that is compatible with the front end of the open source Apache Jena graph database. Jena was designed to use RDF (Resource Page 4
5 Description Framework), a W3C metadata modeling standard for exchange of data on the web; and SPARQL, a protocol for remotely triggering RDF queries. The back end was optimized for YarcData s accelerator processors. Figure 2. Map of interactions between proteins Source: Institute for Systems Biology, courtesy of YarcData urika is the first appliance specially designed for applying a Fast Data approach to graph processing. As a result, urika opens graph processing to a wider range of potential use cases. Example: Improving healthcare delivery A federal science and technology contractor has been exploring graph analytics for addressing a wide range of problems, such as improving telecommunications network resiliency or accelerating life science discoveries. Recently, the organization has explored the potential of implementing the urika appliance to help optimize the prescription of drugs for evidence-based healthcare delivery. The challenge is that the information must be pieced together from multiple online sources including electronic patient records that list the disease and treatments; Medicare or other databases that track care delivery; databases that describe drugs; and in some cases, separate databases that track clinical trials. The scenario is a classic linked network with interdependencies between each of the nodes. Variables could include patient age and gender, the outcomes with specific drugs (or combinations of medications), specific dosages, stage of disease when administered, and potential drug interactions. Computing such a problem would be practically Page 5
6 impossible with conventional SQL databases because of the need for deeply nested joins, while traditional software-only graph implementations could not deliver the required performance. The organization is looking to urika to connect the dots interactively, so practitioners can make smart, evidence-based prescriptions on the spot. RECOMMENDATIONS FOR ENTERPRISES Author The public emergence of Big Data has raised the bar of expectations for what is possible with analytics. That encompasses the ability to make smart decisions in the real world, where many-tomany relationships are the norm. The surface is only just being scratched regarding the potential of graph analytics to help organizations make smart decisions in a complex world where dynamic, many-to-many relationships are the rule. Most of the analytics to date have been relatively static, requiring extended batch processing runs. Fast Data approaches are critical to expanding the range of graph processing beyond niche tool for social tribal analytics. The possibilities are especially broad as Fast Data approaches to graph processing are emerging to make this form of analytics responsive to a changing world. There will be a learning curve as the technologies and practices are new. But for organizations that are seeking a competitive edge in addressing so-called intractable problems of the real world, now is the time to start learning. Tony Baer, Principal Analyst, Ovum Enterprise Solutions tony.baer@ovum.com Ovum Consulting Disclaimer We hope that this analysis will help you make informed and imaginative business decisions. If you have further requirements, Ovum s consulting team may be able to help you. For more information about Ovum s consulting capabilities, please contact us directly at consulting@ovum.com. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form by any means, electronic, mechanical, photocopying, recording, or otherwise, without the prior permission of the publisher, Ovum (an Informa business). The facts of this report are believed to be correct at the time of publication but cannot be guaranteed. Please note that the findings, conclusions, and recommendations that Ovum delivers will be based on information gathered in good faith from both primary and secondary sources, whose accuracy we are not always in a position to guarantee. As such Ovum can accept no liability whatever for actions taken based on any information that may subsequently prove to be incorrect. Page 6
Cray: Enabling Real-Time Discovery in Big Data
Cray: Enabling Real-Time Discovery in Big Data Discovery is the process of gaining valuable insights into the world around us by recognizing previously unknown relationships between occurrences, objects
More informationComplexity and Scalability in Semantic Graph Analysis Semantic Days 2013
Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013 James Maltby, Ph.D 1 Outline of Presentation Semantic Graph Analytics Database Architectures In-memory Semantic Database Formulation
More informationThe Fusion of Supercomputing and Big Data. Peter Ungaro President & CEO
The Fusion of Supercomputing and Big Data Peter Ungaro President & CEO The Supercomputing Company Supercomputing Big Data Because some great things never change One other thing that hasn t changed. Cray
More informationHow In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time
SCALEOUT SOFTWARE How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time by Dr. William Bain and Dr. Mikhail Sobolev, ScaleOut Software, Inc. 2012 ScaleOut Software, Inc. 12/27/2012 T wenty-first
More informationWhy enterprise data archiving is critical in a changing landscape
Why enterprise data archiving is critical in a changing landscape Ovum white paper for Informatica SUMMARY Catalyst Ovum view The most successful enterprises manage data as strategic asset. They have complete
More informationMake the Most of Big Data to Drive Innovation Through Reseach
White Paper Make the Most of Big Data to Drive Innovation Through Reseach Bob Burwell, NetApp November 2012 WP-7172 Abstract Monumental data growth is a fact of life in research universities. The ability
More informationTooling is starting to tame Hadoop
Tooling is starting to tame Hadoop Reference Code: IT015 001716 Publication Date: 21 Jun 2012 Author: Tony Baer THIS IS A CHAPTER EXTRACT FROM PUBLISHED OVUM RESEARCH. THE FULL REPORT IS AVAILABLE ON THE
More informationHigh Performance Computing and Big Data: The coming wave.
High Performance Computing and Big Data: The coming wave. 1 In science and engineering, in order to compete, you must compute Today, the toughest challenges, and greatest opportunities, require computation
More informationYarcData urika Technical White Paper
YarcData urika Technical White Paper 2012 Cray Inc. All rights reserved. Specifications subject to change without notice. Cray is a registered trademark, YarcData, urika and Threadstorm are trademarks
More informationBig Data Processing: Past, Present and Future
Big Data Processing: Past, Present and Future Orion Gebremedhin National Solutions Director BI & Big Data, Neudesic LLC. VTSP Microsoft Corp. Orion.Gebremedhin@Neudesic.COM B-orgebr@Microsoft.com @OrionGM
More informationMicrosoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010
Microsoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010 Better Together Writer: Bill Baer, Technical Product Manager, SharePoint Product Group Technical Reviewers: Steve Peschka,
More informationBIG DATA TECHNOLOGY. Hadoop Ecosystem
BIG DATA TECHNOLOGY Hadoop Ecosystem Agenda Background What is Big Data Solution Objective Introduction to Hadoop Hadoop Ecosystem Hybrid EDW Model Predictive Analysis using Hadoop Conclusion What is Big
More informationWindows Embedded Security and Surveillance Solutions
Windows Embedded Security and Surveillance Solutions Windows Embedded 2010 Page 1 Copyright The information contained in this document represents the current view of Microsoft Corporation on the issues
More informationBig Data must become a first class citizen in the enterprise
Big Data must become a first class citizen in the enterprise An Ovum white paper for Cloudera Publication Date: 14 January 2014 Author: Tony Baer SUMMARY Catalyst Ovum view Big Data analytics have caught
More informationW H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract
W H I T E P A P E R Deriving Intelligence from Large Data Using Hadoop and Applying Analytics Abstract This white paper is focused on discussing the challenges facing large scale data processing and the
More informationBy Evan Quinn, Senior Principal Analyst. This ESG White Paper was commissioned by YarcData and is distributed under license from ESG.
White Paper Discovering Big Data s Value with Graph Analytics By Evan Quinn, Senior Principal Analyst April 2013 This ESG White Paper was commissioned by YarcData and is distributed under license from
More informationHigh-Volume Data Warehousing in Centerprise. Product Datasheet
High-Volume Data Warehousing in Centerprise Product Datasheet Table of Contents Overview 3 Data Complexity 3 Data Quality 3 Speed and Scalability 3 Centerprise Data Warehouse Features 4 ETL in a Unified
More informationHadoopTM Analytics DDN
DDN Solution Brief Accelerate> HadoopTM Analytics with the SFA Big Data Platform Organizations that need to extract value from all data can leverage the award winning SFA platform to really accelerate
More informationbigdata Managing Scale in Ontological Systems
Managing Scale in Ontological Systems 1 This presentation offers a brief look scale in ontological (semantic) systems, tradeoffs in expressivity and data scale, and both information and systems architectural
More informationBig Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum
Big Data Analytics with EMC Greenplum and Hadoop Big Data Analytics with EMC Greenplum and Hadoop Ofir Manor Pre Sales Technical Architect EMC Greenplum 1 Big Data and the Data Warehouse Potential All
More informationVirtual Data Warehouse Appliances
infrastructure (WX 2 and blade server Kognitio provides solutions to business problems that require acquisition, rationalization and analysis of large and/or complex data The Kognitio Technology and Data
More informationBig Data Maximizing the Flow
Technology Insight Paper Big Data Maximizing the Flow By John Webster August 15, 2012 Enabling you to make the best technology decisions Big Data Maximizing the Flow 1 Big Data Maximizing the Flow 2 The
More informationThe Next Wave of Data Management. Is Big Data The New Normal?
The Next Wave of Data Management Is Big Data The New Normal? Table of Contents Introduction 3 Separating Reality and Hype 3 Why Are Firms Making IT Investments In Big Data? 4 Trends In Data Management
More informationThe 4 Pillars of Technosoft s Big Data Practice
beyond possible Big Use End-user applications Big Analytics Visualisation tools Big Analytical tools Big management systems The 4 Pillars of Technosoft s Big Practice Overview Businesses have long managed
More informationConverged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities
Technology Insight Paper Converged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities By John Webster February 2015 Enabling you to make the best technology decisions Enabling
More informationORACLE OLAP. Oracle OLAP is embedded in the Oracle Database kernel and runs in the same database process
ORACLE OLAP KEY FEATURES AND BENEFITS FAST ANSWERS TO TOUGH QUESTIONS EASILY KEY FEATURES & BENEFITS World class analytic engine Superior query performance Simple SQL access to advanced analytics Enhanced
More informationHow A V3 Appliance Employs Superior VDI Architecture to Reduce Latency and Increase Performance
How A V3 Appliance Employs Superior VDI Architecture to Reduce Latency and Increase Performance www. ipro-com.com/i t Contents Overview...3 Introduction...3 Understanding Latency...3 Network Latency...3
More informationManaging Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database
Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica
More informationBig Data and Healthcare Payers WHITE PAPER
Knowledgent White Paper Series Big Data and Healthcare Payers WHITE PAPER Summary With the implementation of the Affordable Care Act, the transition to a more member-centric relationship model, and other
More informationUsing Big Data for Smarter Decision Making. Colin White, BI Research July 2011 Sponsored by IBM
Using Big Data for Smarter Decision Making Colin White, BI Research July 2011 Sponsored by IBM USING BIG DATA FOR SMARTER DECISION MAKING To increase competitiveness, 83% of CIOs have visionary plans that
More informationAssociate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2
Volume 6, Issue 3, March 2016 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Special Issue
More informationUnderstanding traffic flow
White Paper A Real-time Data Hub For Smarter City Applications Intelligent Transportation Innovation for Real-time Traffic Flow Analytics with Dynamic Congestion Management 2 Understanding traffic flow
More informationInternational Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: 2454-2377 Vol. 1, Issue 6, October 2015. Big Data and Hadoop
ISSN: 2454-2377, October 2015 Big Data and Hadoop Simmi Bagga 1 Satinder Kaur 2 1 Assistant Professor, Sant Hira Dass Kanya MahaVidyalaya, Kala Sanghian, Distt Kpt. INDIA E-mail: simmibagga12@gmail.com
More informationAn Oracle White Paper June 2012. High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database
An Oracle White Paper June 2012 High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database Executive Overview... 1 Introduction... 1 Oracle Loader for Hadoop... 2 Oracle Direct
More informationAffordable, Scalable, Reliable OLTP in a Cloud and Big Data World: IBM DB2 purescale
WHITE PAPER Affordable, Scalable, Reliable OLTP in a Cloud and Big Data World: IBM DB2 purescale Sponsored by: IBM Carl W. Olofson December 2014 IN THIS WHITE PAPER This white paper discusses the concept
More informationThe Modern Data Warehouse: Agile, Automated, Adaptive
The Modern Data Warehouse: Agile, Automated, Adaptive Produced by David Loshin and Abie Reifer from DecisionWorx, LLC in collaboration with The Bloor Group December 2015 Sponsored by: 1 Table of Contents
More informationSQL Server 2012 Performance White Paper
Published: April 2012 Applies to: SQL Server 2012 Copyright The information contained in this document represents the current view of Microsoft Corporation on the issues discussed as of the date of publication.
More informationEMC XtremSF: Delivering Next Generation Storage Performance for SQL Server
White Paper EMC XtremSF: Delivering Next Generation Storage Performance for SQL Server Abstract This white paper addresses the challenges currently facing business executives to store and process the growing
More informationHow To Handle Big Data With A Data Scientist
III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution
More informationBIG DATA-AS-A-SERVICE
White Paper BIG DATA-AS-A-SERVICE What Big Data is about What service providers can do with Big Data What EMC can do to help EMC Solutions Group Abstract This white paper looks at what service providers
More informationDell One Identity Manager Scalability and Performance
Dell One Identity Manager Scalability and Performance Scale up and out to ensure simple, effective governance for users. Abstract For years, organizations have had to be able to support user communities
More informationThe big data revolution
The big data revolution Friso van Vollenhoven (Xebia) Enterprise NoSQL Recently, there has been a lot of buzz about the NoSQL movement, a collection of related technologies mostly concerned with storing
More informationDell* In-Memory Appliance for Cloudera* Enterprise
Built with Intel Dell* In-Memory Appliance for Cloudera* Enterprise Find out what faster big data analytics can do for your business The need for speed in all things related to big data is an enormous
More informationOracle Engineered Systems and Triple Point Technology
Oracle Engineered Systems and Triple Point Technology SuccESSful volatility management for commodities 2 commodity Xl for Oil Extreme commodity volatility and increased supply chain complexity are the
More informationFast, Low-Overhead Encryption for Apache Hadoop*
Fast, Low-Overhead Encryption for Apache Hadoop* Solution Brief Intel Xeon Processors Intel Advanced Encryption Standard New Instructions (Intel AES-NI) The Intel Distribution for Apache Hadoop* software
More informationInteractive data analytics drive insights
Big data Interactive data analytics drive insights Daniel Davis/Invodo/S&P. Screen images courtesy of Landmark Software and Services By Armando Acosta and Joey Jablonski The Apache Hadoop Big data has
More informationDetecting Anomalous Behavior with the Business Data Lake. Reference Architecture and Enterprise Approaches.
Detecting Anomalous Behavior with the Business Data Lake Reference Architecture and Enterprise Approaches. 2 Detecting Anomalous Behavior with the Business Data Lake Pivotal the way we see it Reference
More informationSix Days in the Network Security Trenches at SC14. A Cray Graph Analytics Case Study
Six Days in the Network Security Trenches at SC14 A Cray Graph Analytics Case Study WP-NetworkSecurity-0315 www.cray.com Table of Contents Introduction... 3 Analytics Mission and Source Data... 3 Analytics
More informationHigh-Performance Business Analytics: SAS and IBM Netezza Data Warehouse Appliances
High-Performance Business Analytics: SAS and IBM Netezza Data Warehouse Appliances Highlights IBM Netezza and SAS together provide appliances and analytic software solutions that help organizations improve
More informationReal-Time Big Data Analytics SAP HANA with the Intel Distribution for Apache Hadoop software
Real-Time Big Data Analytics with the Intel Distribution for Apache Hadoop software Executive Summary is already helping businesses extract value out of Big Data by enabling real-time analysis of diverse
More informationUnderstanding the Value of In-Memory in the IT Landscape
February 2012 Understing the Value of In-Memory in Sponsored by QlikView Contents The Many Faces of In-Memory 1 The Meaning of In-Memory 2 The Data Analysis Value Chain Your Goals 3 Mapping Vendors to
More informationWith DDN Big Data Storage
DDN Solution Brief Accelerate > ISR With DDN Big Data Storage The Way to Capture and Analyze the Growing Amount of Data Created by New Technologies 2012 DataDirect Networks. All Rights Reserved. The Big
More informationOn the Radar: Esri UK
On the Radar: Esri UK Geographic information reveals the determinants of better health Reference Code: IT011 000316 Publication Date: 30 May 2013 Author: Cornelia Wels Maug SUMMARY Catalyst The adoption
More informationHPC & Big Data THE TIME HAS COME FOR A SCALABLE FRAMEWORK
HPC & Big Data THE TIME HAS COME FOR A SCALABLE FRAMEWORK Barry Davis, General Manager, High Performance Fabrics Operation Data Center Group, Intel Corporation Legal Disclaimer Today s presentations contain
More informationHow To Make Data Streaming A Real Time Intelligence
REAL-TIME OPERATIONAL INTELLIGENCE Competitive advantage from unstructured, high-velocity log and machine Big Data 2 SQLstream: Our s-streaming products unlock the value of high-velocity unstructured log
More informationSQLstream Blaze and Apache Storm A BENCHMARK COMPARISON
SQLstream Blaze and Apache Storm A BENCHMARK COMPARISON 2 The V of Big Data Velocity means both how fast data is being produced and how fast the data must be processed to meet demand. Gartner The emergence
More informationHur hanterar vi utmaningar inom området - Big Data. Jan Östling Enterprise Technologies Intel Corporation, NER
Hur hanterar vi utmaningar inom området - Big Data Jan Östling Enterprise Technologies Intel Corporation, NER Legal Disclaimers All products, computer systems, dates, and figures specified are preliminary
More informationHadoopRDF : A Scalable RDF Data Analysis System
HadoopRDF : A Scalable RDF Data Analysis System Yuan Tian 1, Jinhang DU 1, Haofen Wang 1, Yuan Ni 2, and Yong Yu 1 1 Shanghai Jiao Tong University, Shanghai, China {tian,dujh,whfcarter}@apex.sjtu.edu.cn
More informationOn the Radar: Alation harnesses crowdsourcing and machine learning to speed data access
On the Radar: Alation harnesses crowdsourcing and machine learning to speed data access Summary Catalyst As organizations widen their net and analyze more data sources, it becomes all too easy for business
More informationHadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012. Viswa Sharma Solutions Architect Tata Consultancy Services
Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012 Viswa Sharma Solutions Architect Tata Consultancy Services 1 Agenda What is Hadoop Why Hadoop? The Net Generation is here Sizing the
More informationHARDWARE ACCELERATION IN FINANCIAL MARKETS. A step change in speed
HARDWARE ACCELERATION IN FINANCIAL MARKETS A step change in speed NAME OF REPORT SECTION 3 HARDWARE ACCELERATION IN FINANCIAL MARKETS A step change in speed Faster is more profitable in the front office
More information2013 ICT Enterprise Insights in the Life Sciences Industry
2013 ICT Enterprise Insights in the Life Sciences Industry Key findings from the 2013 survey results Reference Code: IT010-000185 Publication Date: 03 Oct 2013 Author: Andrew Brosnan SUMMARY Catalyst The
More informationDriving IBM BigInsights Performance Over GPFS Using InfiniBand+RDMA
WHITE PAPER April 2014 Driving IBM BigInsights Performance Over GPFS Using InfiniBand+RDMA Executive Summary...1 Background...2 File Systems Architecture...2 Network Architecture...3 IBM BigInsights...5
More informationOn the Radar: Tamr. Applying machine learning to integrating Big Data. Publication Date: Sept. 2014 Product code: IT0014-002934.
Applying machine learning to integrating Big Data Publication Date: Sept. 2014 Product code: IT0014-002934 Tony Baer Summary Catalyst Traditional data integration approaches may not scale for Big Data.
More informationOn the Radar: Tessella
On the Radar: Tessella Creating an archive for the long-term preservation of digital content Reference Code: IT014-002789 Publication Date: 04 Sep 2013 Author: Sue Clarke SUMMARY Catalyst Ensuring that
More informationCHAPTER - 5 CONCLUSIONS / IMP. FINDINGS
CHAPTER - 5 CONCLUSIONS / IMP. FINDINGS In today's scenario data warehouse plays a crucial role in order to perform important operations. Different indexing techniques has been used and analyzed using
More informationScaling Objectivity Database Performance with Panasas Scale-Out NAS Storage
White Paper Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage A Benchmark Report August 211 Background Objectivity/DB uses a powerful distributed processing architecture to manage
More informationColumnstore Indexes for Fast Data Warehouse Query Processing in SQL Server 11.0
SQL Server Technical Article Columnstore Indexes for Fast Data Warehouse Query Processing in SQL Server 11.0 Writer: Eric N. Hanson Technical Reviewer: Susan Price Published: November 2010 Applies to:
More informationPowering Cutting Edge Research in Life Sciences with High Performance Computing
A Point of View Powering Cutting Edge Research in Life Sciences with High Performance Computing High performance computing (HPC) is the foundation of pioneering research in life sciences. HPC plays a vital
More informationPut the R Back in CRM with a Customer Experience Platform
Put the R Back in CRM with a Customer Experience Platform An Ovum White Paper Sponsored by Publication Date: October 2015 Introduction In today s highly competitive market, businesses need to have their
More informationHadoop: Embracing future hardware
Hadoop: Embracing future hardware Suresh Srinivas @suresh_m_s Page 1 About Me Architect & Founder at Hortonworks Long time Apache Hadoop committer and PMC member Designed and developed many key Hadoop
More informationSpecializing in visualizing and analyzing clinical trials data
ON THE RADAR Comprehend Systems Specializing in visualizing and analyzing clinical trials data Reference Code: OI00193-012 Publication Date: February 2012 Author: Andrew Brosnan and Cornelia Wels-Maug
More informationBig Data: Are You Ready? Kevin Lancaster
Big Data: Are You Ready? Kevin Lancaster Director, Engineered Systems Oracle Europe, Middle East & Africa 1 A Data Explosion... Traditional Data Sources Billing engines Custom developed New, Non-Traditional
More informationBig Data With Hadoop
With Saurabh Singh singh.903@osu.edu The Ohio State University February 11, 2016 Overview 1 2 3 Requirements Ecosystem Resilient Distributed Datasets (RDDs) Example Code vs Mapreduce 4 5 Source: [Tutorials
More informationCOULD VS. SHOULD: BALANCING BIG DATA AND ANALYTICS TECHNOLOGY WITH PRACTICAL OUTCOMES
COULD VS. SHOULD: BALANCING BIG DATA AND ANALYTICS TECHNOLOGY The business world is abuzz with the potential of data. In fact, most businesses have so much data that it is difficult for them to process
More informationWHITE PAPER. Written by: Michael Azoff. Published Mar, 2015, Ovum
Unlocking systems of record with Web and mobile front-ends CA App Services Orchestrator for creating contemporary APIs Written by: Michael Azoff Published Mar, 2015, Ovum CA App Services Orchestrator WWW.OVUM.COM
More informationInge Os Sales Consulting Manager Oracle Norway
Inge Os Sales Consulting Manager Oracle Norway Agenda Oracle Fusion Middelware Oracle Database 11GR2 Oracle Database Machine Oracle & Sun Agenda Oracle Fusion Middelware Oracle Database 11GR2 Oracle Database
More informationAccelerating High-Speed Networking with Intel I/O Acceleration Technology
White Paper Intel I/O Acceleration Technology Accelerating High-Speed Networking with Intel I/O Acceleration Technology The emergence of multi-gigabit Ethernet allows data centers to adapt to the increasing
More informationSoftware-defined Storage Architecture for Analytics Computing
Software-defined Storage Architecture for Analytics Computing Arati Joshi Performance Engineering Colin Eldridge File System Engineering Carlos Carrero Product Management June 2015 Reference Architecture
More informationHigh Performance Computing OpenStack Options. September 22, 2015
High Performance Computing OpenStack PRESENTATION TITLE GOES HERE Options September 22, 2015 Today s Presenters Glyn Bowden, SNIA Cloud Storage Initiative Board HP Helion Professional Services Alex McDonald,
More informationSWOT Assessment: FireMon Security Manager Suite v7.0
SWOT Assessment: FireMon Security Manager Suite v7.0 Analyzing the strengths, weaknesses, opportunities, and threats Reference Code: IT017-004174 Publication Date: 12 Aug 2013 Author: Andrew Kellett SUMMARY
More informationKey Attributes for Analytics in an IBM i environment
Key Attributes for Analytics in an IBM i environment Companies worldwide invest millions of dollars in operational applications to improve the way they conduct business. While these systems provide significant
More informationServer Consolidation with SQL Server 2008
Server Consolidation with SQL Server 2008 White Paper Published: August 2007 Updated: July 2008 Summary: Microsoft SQL Server 2008 supports multiple options for server consolidation, providing organizations
More informationIII JORNADAS DE DATA MINING
III JORNADAS DE DATA MINING EN EL MARCO DE LA MAESTRÍA EN DATA MINING DE LA UNIVERSIDAD AUSTRAL PRESENTACIÓN TECNOLÓGICA IBM Alan Schcolnik, Cognos Technical Sales Team Leader, IBM Software Group. IAE
More informationUsing In-Memory Computing to Simplify Big Data Analytics
SCALEOUT SOFTWARE Using In-Memory Computing to Simplify Big Data Analytics by Dr. William Bain, ScaleOut Software, Inc. 2012 ScaleOut Software, Inc. 12/27/2012 T he big data revolution is upon us, fed
More informationBanking On A Customer-Centric Approach To Data
Banking On A Customer-Centric Approach To Data Putting Content into Context to Enhance Customer Lifetime Value No matter which company they interact with, consumers today have far greater expectations
More informationLeading the way with Information-Led Transformation. Mark Register, Vice President Information Management Software, IBM AP
Leading the way with Information-Led Transformation Mark Register, Vice President Information Management Software, IBM AP 1 Today s Topics Our Smarter Planet and the Information Challenge Accelerating
More informationIBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!
The Bloor Group IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS VENDOR PROFILE The IBM Big Data Landscape IBM can legitimately claim to have been involved in Big Data and to have a much broader
More informationRackspace Cloud Databases and Container-based Virtualization
Rackspace Cloud Databases and Container-based Virtualization August 2012 J.R. Arredondo @jrarredondo Page 1 of 6 INTRODUCTION When Rackspace set out to build the Cloud Databases product, we asked many
More informationEinsatzfelder von IBM PureData Systems und Ihre Vorteile.
Einsatzfelder von IBM PureData Systems und Ihre Vorteile demirkaya@de.ibm.com Agenda Information technology challenges PureSystems and PureData introduction PureData for Transactions PureData for Analytics
More informationORACLE BUSINESS INTELLIGENCE SUITE ENTERPRISE EDITION PLUS
Oracle Fusion editions of Oracle's Hyperion performance management products are currently available only on Microsoft Windows server platforms. The following is intended to outline our general product
More informationINCREASING EFFICIENCY WITH EASY AND COMPREHENSIVE STORAGE MANAGEMENT
INCREASING EFFICIENCY WITH EASY AND COMPREHENSIVE STORAGE MANAGEMENT UNPRECEDENTED OBSERVABILITY, COST-SAVING PERFORMANCE ACCELERATION, AND SUPERIOR DATA PROTECTION KEY FEATURES Unprecedented observability
More informationG-Cloud Big Data Suite Powered by Pivotal. December 2014. G-Cloud. service definitions
G-Cloud Big Data Suite Powered by Pivotal December 2014 G-Cloud service definitions TABLE OF CONTENTS Service Overview... 3 Business Need... 6 Our Approach... 7 Service Management... 7 Vendor Accreditations/Awards...
More informationInvestor Presentation. Second Quarter 2015
Investor Presentation Second Quarter 2015 Note to Investors Certain non-gaap financial information regarding operating results may be discussed during this presentation. Reconciliations of the differences
More informationT a c k l i ng Big Data w i th High-Performance
Worldwide Headquarters: 211 North Union Street, Suite 105, Alexandria, VA 22314, USA P.571.296.8060 F.508.988.7881 www.idc-gi.com T a c k l i ng Big Data w i th High-Performance Computing W H I T E P A
More informationIBM's Adoption of Sugar: A Lesson in Global Implementation
IBM's Adoption of Sugar: A Lesson in Global Implementation IBM's agile, collaborative, user-centered approach wins over 45,000 sales people Reference Code: IT020-000022 Publication Date: 24 Apr 2014 Author:
More informationNetApp Big Content Solutions: Agile Infrastructure for Big Data
White Paper NetApp Big Content Solutions: Agile Infrastructure for Big Data Ingo Fuchs, NetApp April 2012 WP-7161 Executive Summary Enterprises are entering a new era of scale, in which the amount of data
More informationInfiniteGraph: The Distributed Graph Database
A Performance and Distributed Performance Benchmark of InfiniteGraph and a Leading Open Source Graph Database Using Synthetic Data Objectivity, Inc. 640 West California Ave. Suite 240 Sunnyvale, CA 94086
More informationBig Data Analytics - Accelerated. stream-horizon.com
Big Data Analytics - Accelerated stream-horizon.com StreamHorizon & Big Data Integrates into your Data Processing Pipeline Seamlessly integrates at any point of your your data processing pipeline Implements
More information