Data Integrity & Scalability The Value of Accuracy. Data Quality in Big Data
|
|
- Leonard Adrian Hardy
- 8 years ago
- Views:
Transcription
1 Data Integrity & Scalability The Value of Accuracy Data Quality in Big Data
2 Data Quality in the news 2
3 And some more examples... 3
4 High Quality Information as competitive differentiator Business today... Online, Instantaneous and Connected Critical customer, supplier & portfolio information will change in the blink of an eye Global markets Highly competitive Regulatory compliance High quality information is business critical For industries banking, insurance, logistics, retail, healthcare, telco, government etc. Auctioning, marketplaces, mobile apps, global markets, benchmarking, social media. Copyright 2012, Information Builders. Slide 4
5 Effective Use of HQ Data is Key Competitive Differentiation in a Changing Environment Improve the Top Line Manage Risk and Regulatory Compliance Increase Operational Efficiency Customer insight to improve organic growth. Single view of the customer to provide a seamless and relevant cross channel channel experience. Put trusted information in the hands of frontline employees to improve productivity and keep service levels high Accurateinformation for riskmanagement optimization. Use information to accurately prevent, identify and mitigate fraud and operational risk. Leverage the business value of information with information governance programs that improve the management of people, process and information technology. Deliver trusted information to improve productivity, implement new processes and offerings and enhance existing process via automation Create trusted enterprise insight and analytics to drive business optimization and improve revenue, productivity, it operations, and financial i results. Optimize and truncate content centric process to reduce costs and improve customer experience. Copyright 2012, Information Builders. Slide 5
6 High Quality Data Facts about things in a context Copyright 2012, Information Builders. Slide 6
7 What does data growth look at? Copyright 2012, Information Builders. Slide 7 7
8 Enterprise Information Management Master Data Governance UpStream, InStream, Downstream Copyright 2012, Information Builders. Slide 8
9 Copyright 2012, Information Builders. Slide 9
10 What is Big Data?
11 Setting the stage Big Data: A massive volume of both structured and unstructured data that is so large that it s difficult to process with traditional database and software techniques Big data technologies describe a new generation of technologies and architectures, designed to economically extractvalue from very large volumes of a wide variety of data, by enabling high velocity capture, discovery, and/or analysis Data is a new class of economic assets, like currency and gold Source: World Economic Forum 2012
12 The numbers Walmart handles more than 1 million customer transactions every hour. Facebook handles 30 billion pieces of content shared a month Decoding the human genome originally took 10 years to process; now it can be achieved in one week. At CERN the 27 kilometer long Large Hadron Collider (LHC) creates 1 petabyte of data per second. 15 petabytes per year is available for analytics. Since you are probably not investigating the origin of the universe 12
13 Social 2012, the # s Facebook 900+ million April 2012 United States Qzone 536 million December 2011 China Windows Live 330+ million June 2009 United States Tencent Weibo 310 million December 2011 China Sina Weibo 300+ million February 2012 China Habbo 230 million September 2011 Finland Google million April 2012 United States Vkontakte 290+ million March 2012 Russia Badoo 151+ million April 2012 United States Skype 145 million September 2010 United States Twitter 140+ million March 2012 United States Bebo 117 million July 2010 United States LinkedIn 100+ million March 2011 United States 13
14 Where does your data come from? Online (Web) Applications To run applications (content, video, blog, posts, transactions) To give the data context (friends, social media, collaborative working) The keep applications running (logs, metrics) Regulatory compliance Data Governance initiatives Risk Management Electronic Archiving of Information ProductLifecycle Management and Catalogues Mobile Community Expanding Markets 14
15 External Big data Open Data (openbare data collecties) Overheid maar ook KNMI bv. Vraag antwoord combinaties Verkiezingskaart Scholen CBS Statline Social Media (feeds, likes, tweets, profiles) News, Press, Images, Video External Big Data is a new type of source for analytical purposes. p It does not mean it s your Big Data problem. It might be an integration or integrity problem however. 15
16 Why Big Data is important?
17 Key Drivers for Big Data management Social Network and Relationship Analysis Cost Reduction Fraud Detection and Prevention Digital Marketing Optimization Data Exploration and Discovery Single Views (Quality of Service) Responsiveness and User Experience Flexibility and Scalability supporting growth Data Retention Archiving Compliance 17
18 Potential Value of Big Data Source: McKinsey Global May
19 What is the impact of Big Data to your IT organisation?
20 Big Data spans four dimensions Gigabytes, Terrabytes, Petabytes Volume Velocity Realtime Capture and Realtime Analytics Big Data Variety Unstructured data like Images, Documents, Structured data like files, tables, messages Quality Integrity of the data
21 Two types of Big Data solutions For Analytical purposes Columnar Oriented and In Memory database technology Appliances Schema and Data model based architecture Allow huge amounts of data to be joined, aggregated and queried Super fast response times Limited it DBA requirement Information Builders Hyperstage For Storage purposes Hadoop Distributed File System (Open Source) started by Yahoo and Google Hd Hadoop Map Rd Reduce Distributed ib t computation tti Changing schema support Document Oriented Horizontal Scalability Cloud proof Hive SQL layer Information Builders Hadoop Adapter 21
22 Everybody is making decisions BI Com mmunity Developers Analysts & Power Users 20 % Business Users 80 % Partners Customers And Beyond 22
23 Big Data Analytics at our Customers Automotive data for Product Sales purpose Time To Market: New products Total Cost of Ownership: Development Maintenance User Experience: Performance and Mobile Police of Richmond, Virginia Quality of Service : #5 to #95 (out of 354 cities) Innovation: Predict Crime based on historical data Performance Management: Strategic Insight Banking data for Credit Card dscoring Innovation: new Internet Banking Solutions (SaaS) Scalability: 1 Million users and growing Revenue: Boost usage of online channels Marketing data to improve Campaign Management Increased Sales: Upsell and Cross sell Customer Retention: Single View 23
24 Big Data Technology for Analytics
25 IT Manager s try to mitigate these response times.. How Performance Issues are Typically Addressed by Pace of Data Growth Tune or upgrade existing databases 66% 75% Upgrade server hardware/processors 54% 70% Upgrade/expand storage systems 33% 60% Archive older data on other systems Upgrade networking infrastructure 21% 30% 32% 44% High Growth 4% Low Growth Don't Know / Unsure 7% 0% 20% 40% 60% 80% 100% When organizations have long running queries that limit the business, the response is often to spend much more time and money to resolve the problem Source: KEEPING UP WITH EVER-EXPANDING ENTERPRISE DATA ( Joseph McKendrick Unisphere Research October 2010)
26 Data Warehousing Challenges More Data, More Data Sources Real time data Multiple databases External Sources Limited Resources and Budget More Kinds of Output Needed by More Users, More Quickly Traditional Data Warehousing Labor intensive, heavy indexing, aggregations and partitioning Hardware intensive: massive storage; big servers Expensive and complex
27 Pivoting Your Perspective: Columnar Technology.
28 The Limitation of Rows These Solutions Contribute to Operational Limitations 1. Impediments to business agility: Organizations often must wait for DBAs to create indexes or other tuning structures, thereby delaying access to data. In addition, indexes significantly slow data loading operations and increase the size of the database, sometimes by a factor of 2x. 2. Loss of data and time fidelity: IT generally performs ETL operations in batch mode during non business hours. Such transformations delay access to data and often result in mismatches between operational and analytic databases. 3. Limited ad hoc capability: Response times for ad hoc queries increase as the volume of data dt grows. Unanticipated i t queries (where DBAs have not tuned dthe database in advance) can result in unacceptable response times, and may even fail to complete. 4. Unnecessary expenditures: Attempts to improve performance using hardware acceleration and database tuning schemes raise the capital costs of equipment and the operational costs of database administration. Further, the added complexity of managing a large database diverts operational budgets away from more urgent IT projects.
29 The Limitation of Rows The Ubiquity of Rows 30 columns Row-based databases are ubiquitous because so many of our most important business systems are transactional. 50 millions Rows Row-oriented databases are well suited for transactional environments, such as a call center where a customer s entire record is required when their profile is retrieved and/or when fields are frequently updated. But - Disk I/O becomes a substantial limiting factor since a row-oriented design forces the database to retrieve all column data for any query.
30 Pivoting Your Perspective: Columnar Technology Employee Id Name Location Sales 1 Smith New York 50,000 2 Jones New York 65,000 3 Fraser Boston 40,000 4 Fraser Boston 70,000 Row Oriented (1, Smith, New York, 50000; 2, Jones, New York, 65000; 3, Fraser, Boston, 40000; 4, Fraser, Boston, 70000) Column Oriented Works well if all the columns are needed for every query. Efficient for transactional processing if all the data for the row is available (1, 2, 3, 4; Smith, Jones, Fraser, Fraser; New York, New York, Boston, Boston, 50000, 65000, 40000, 70000) Works well with aggregate results (sum, count, avg. ) Only columns that are relevant need to be touched Consistent performance with any database design Allows for very efficient compression
31 Introducing WebFOCUS Hyperstage
32 Introducing WebFOCUS Hyperstage. How is it architected? Hyperstage combines a columnar database with intelligence we call the Knowledge Grid to deliver fast query responses. Bulk Loader Hyperstage Engine Knowledge Grid Compressor Unmatched Administrative Simplicity No Indexes No data partitioning No Manual tuning
33 Introducing WebFOCUS Hyperstage. What does this mean for Customers? Self managing: 90% less administrative effort Low cost: More than 50% less than alternative solutions Scalable, high performance: Up to 50 TB using a single industry standard server, no appliance required Fast queries: Ad hoc queries are as fast as anticipated queries, so users have total flexibility Compression: Data compression of 10:1 to 40:1 that means a lot less storage is needed, it might mean you can get the entire database in memory!
34 WebFOCUS Hyperstage Engine How does it work? Column Orientation Knowledge Grid statistics and metadata describing the super-compressed data Data Packs data stored in manageably sized, highly compressed data packs Smarter Architecture No maintenance No query planning No partition schemes No DBA Data compressed using algorithms tailored to data type
35 Performance Benchmarks WebFOCUS Hyperstage SQL Server Report Query 1 Query 2
36 Compression Ratio Datastore Size SQL Server WebFOCUS Hyperstage
37 WebFOCUS Hyperstage Example: Query and Knowledge Grid SELECT count(*) FROM employees WHERE salary > AND age < 65 AND job = Shipping AND city = Toronto ; salary age job city All values match Completely Irrelevant Suspect
38 WebFOCUS Hyperstage Example: salary > SELECT count(*) FROM employees WHERE salary > AND age < 65 AND job = Shipping AND city = Toronto ; salary age job city 1. Find the Data Packs with salary > All values match Completely Irrelevant
39 WebFOCUS Hyperstage Example: age < 65 SELECT count(*) FROM employees WHERE salary > AND age < 65 AND job = Shipping AND city = Toronto ; salary age job city 1. Find the Data Packs with salary > Find the Data Packs that contain age < 65 All values match Completely Irrelevant Suspect
40 WebFOCUS Hyperstage Example: job = shipping SELECT count(*) FROM employees WHERE salary > AND age < 65 AND job = Shipping AND city = Toronto ; salary age job city 1. Find the Data Packs with salary > Find the Data Packs that contain age < Find the Data Packs that have job = shipping All values match Completely Irrelevant Suspect
41 WebFOCUS Hyperstage Example: city = Toronto SELECT count(*) FROM employees WHERE salary > AND age < 65 AND job = Shipping AND city = Toronto ; salary age job city 1. Find the Data Packs with salary > Find the Data Packs that contain age < Find the Data Packs that have job = shipping 4. Find the Data Packs that have city = Toronto All values match Completely Irrelevant Suspect
42 WebFOCUS Hyperstage Example: Eliminate Pack Rows SELECT count(*) FROM employees WHERE salary > AND age < 65 AND job = Shipping AND city = Toronto ; 1. Find the Data Packs with salary > Find the Data Packs that contain age < Find the Data Packs that have job = shipping 4. Find the Data Packs that have city = Toronto 5. Eliminate All rows that have been flagged as irrelevant salary age job city All packs ignored All packs ignored All packs ignored All values match Completely Irrelevant Suspect
43 WebFOCUS Hyperstage Example: Decompress and scan SELECT count(*) FROM employees WHERE salary > AND age < 65 AND job = Shipping AND city = Toronto ; 1. Find the Data Packs with salary > Find the Data Packs that contain age < Find the Data Packs that have job = shipping 4. Find the Data Packs that have city = Toronto 5. Eliminate All rows that have been flagged as irrelevant 6. Finally we identify the pack that needs to be decompressed salary age job city Only this pack will be de-compressed All packs ignored All packs ignored All packs ignored All values match Completely Irrelevant Suspect
44 Q&A
Big Data & the LAMP Stack: How to Boost Performance
Big Data & the LAMP Stack: How to Boost Performance Jeff Kibler, Infobright Community Manager Kevin Schroeder, Zend Technologies, Technology Evangelist Agenda The Machine-Generated Data Problem Benchmark:
More informationBreakthrough BI: Mobile. Big Data. Social. Predictive. Cloud. Peter Walker Country Manager UK & Ireland
Breakthrough BI: Mobile. Big Data. Social. Predictive. Cloud. Peter Walker Country Manager UK & Ireland 1 Agenda Information Builders at a Glance Focused Software and Services Business Intelligence & Analytics
More informationHow to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW
How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW Roger Breu PDW Solution Specialist Microsoft Western Europe Marcus Gullberg PDW Partner Account Manager Microsoft Sweden
More informationAGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW
AGENDA What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story Hadoop PDW Our BIG DATA Roadmap BIG DATA? Volume 59% growth in annual WW information 1.2M Zetabytes (10 21 bytes) this
More informationHadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012. Viswa Sharma Solutions Architect Tata Consultancy Services
Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012 Viswa Sharma Solutions Architect Tata Consultancy Services 1 Agenda What is Hadoop Why Hadoop? The Net Generation is here Sizing the
More informationEnterprise Edition Analytic Data Warehouse Technology White Paper
Enterprise Edition Analytic Data Warehouse Technology White Paper August 2008 Infobright 47 Colborne Lane, Suite 403 Toronto, Ontario M5E 1P8 Canada www.infobright.com info@infobright.com Table of Contents
More informationThe Future of Data Management
The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class
More informationQLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM
QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM QlikView Technical Case Study Series Big Data June 2012 qlikview.com Introduction This QlikView technical case study focuses on the QlikView deployment
More informationWhat happens when Big Data and Master Data come together?
What happens when Big Data and Master Data come together? Jeremy Pritchard Master Data Management fgdd 1 What is Master Data? Master data is data that is shared by multiple computer systems. The Information
More informationSQL Server 2012 Performance White Paper
Published: April 2012 Applies to: SQL Server 2012 Copyright The information contained in this document represents the current view of Microsoft Corporation on the issues discussed as of the date of publication.
More informationUnderstanding the Value of In-Memory in the IT Landscape
February 2012 Understing the Value of In-Memory in Sponsored by QlikView Contents The Many Faces of In-Memory 1 The Meaning of In-Memory 2 The Data Analysis Value Chain Your Goals 3 Mapping Vendors to
More informationHow To Handle Big Data With A Data Scientist
III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution
More informationHigh-Performance Analytics
High-Performance Analytics David Pope January 2012 Principal Solutions Architect High Performance Analytics Practice Saturday, April 21, 2012 Agenda Who Is SAS / SAS Technology Evolution Current Trends
More informationBig Data Analytics: Today's Gold Rush November 20, 2013
Copyright 2013 Vivit Worldwide Big Data Analytics: Today's Gold Rush November 20, 2013 Brought to you by Copyright 2013 Vivit Worldwide Hosted by Bernard Szymczak Vivit Leader Ohio Chapter TQA SIG Copyright
More informationNews and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren
News and trends in Data Warehouse Automation, Big Data and BI Johan Hendrickx & Dirk Vermeiren Extreme Agility from Source to Analysis DWH Appliances & DWH Automation Typical Architecture 3 What Business
More informationA TECHNICAL WHITE PAPER ATTUNITY VISIBILITY
A TECHNICAL WHITE PAPER ATTUNITY VISIBILITY Analytics for Enterprise Data Warehouse Management and Optimization Executive Summary Successful enterprise data management is an important initiative for growing
More informationBig Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum
Big Data Analytics with EMC Greenplum and Hadoop Big Data Analytics with EMC Greenplum and Hadoop Ofir Manor Pre Sales Technical Architect EMC Greenplum 1 Big Data and the Data Warehouse Potential All
More informationParallel Data Warehouse
MICROSOFT S ANALYTICS SOLUTIONS WITH PARALLEL DATA WAREHOUSE Parallel Data Warehouse Stefan Cronjaeger Microsoft May 2013 AGENDA PDW overview Columnstore and Big Data Business Intellignece Project Ability
More informationWe are Big Data A Sonian Whitepaper
EXECUTIVE SUMMARY Big Data is not an uncommon term in the technology industry anymore. It s of big interest to many leading IT providers and archiving companies. But what is Big Data? While many have formed
More informationA financial software company
A financial software company Projecting USD10 million revenue lift with the IBM Netezza data warehouse appliance Overview The need A financial software company sought to analyze customer engagements to
More informationBusiness Analytics In a Big Data World Ted Malone Solutions Architect Data Platform and Cloud Microsoft Federal
Business Analytics In a Big Data World Ted Malone Solutions Architect Data Platform and Cloud Microsoft Federal Information has gone from scarce to super-abundant. That brings huge new benefits. The Economist
More informationIntroducing Oracle Exalytics In-Memory Machine
Introducing Oracle Exalytics In-Memory Machine Jon Ainsworth Director of Business Development Oracle EMEA Business Analytics 1 Copyright 2011, Oracle and/or its affiliates. All rights Agenda Topics Oracle
More informationHur hanterar vi utmaningar inom området - Big Data. Jan Östling Enterprise Technologies Intel Corporation, NER
Hur hanterar vi utmaningar inom området - Big Data Jan Östling Enterprise Technologies Intel Corporation, NER Legal Disclaimers All products, computer systems, dates, and figures specified are preliminary
More informationBusin i ess I n I t n e t ll l i l g i e g nce c T r T e r nds For 2013
Business Intelligence Trends For 2013 10 Trends The last few years the change in Business Intelligence seems to accelerate under the pressure of increased business demand and technology innovations. Here
More informationBIG DATA TRENDS AND TECHNOLOGIES
BIG DATA TRENDS AND TECHNOLOGIES THE WORLD OF DATA IS CHANGING Cloud WHAT IS BIG DATA? Big data are datasets that grow so large that they become awkward to work with using onhand database management tools.
More informationAre You Ready for Big Data?
Are You Ready for Big Data? Jim Gallo National Director, Business Analytics February 11, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?
More informationTraditional BI vs. Business Data Lake A comparison
Traditional BI vs. Business Data Lake A comparison The need for new thinking around data storage and analysis Traditional Business Intelligence (BI) systems provide various levels and kinds of analyses
More informationW H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract
W H I T E P A P E R Deriving Intelligence from Large Data Using Hadoop and Applying Analytics Abstract This white paper is focused on discussing the challenges facing large scale data processing and the
More informationExadata in the Retail Sector
Exadata in the Retail Sector Jon Mead Managing Director - Rittman Mead Consulting Agenda Introduction Business Problem Approach Design Considerations Observations Wins Summary Q&A What it is not... Introductions
More informationGaining the Performance Edge Using a Column-Oriented Database Management System
Analytics in the Federal Government White paper series on how to achieve efficiency, responsiveness and transparency. Gaining the Performance Edge Using a Column-Oriented Database Management System by
More informationSAP HANA PLATFORM Top Ten Questions for Choosing In-Memory Databases. Start Here
PLATFORM Top Ten Questions for Choosing In-Memory Databases Start Here PLATFORM Top Ten Questions for Choosing In-Memory Databases. Are my applications accelerated without manual intervention and tuning?.
More informationAn Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise
An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise Solutions Group The following is intended to outline our
More informationThe 3 questions to ask yourself about BIG DATA
The 3 questions to ask yourself about BIG DATA Do you have a big data problem? Companies looking to tackle big data problems are embarking on a journey that is full of hype, buzz, confusion, and misinformation.
More informationAdvanced Analytics. The Way Forward for Businesses. Dr. Sujatha R Upadhyaya
Advanced Analytics The Way Forward for Businesses Dr. Sujatha R Upadhyaya Nov 2009 Advanced Analytics Adding Value to Every Business In this tough and competitive market, businesses are fighting to gain
More informationOracle Database - Engineered for Innovation. Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya
Oracle Database - Engineered for Innovation Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya Oracle Database 11g Release 2 Shipping since September 2009 11.2.0.3 Patch Set now
More informationConverged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities
Technology Insight Paper Converged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities By John Webster February 2015 Enabling you to make the best technology decisions Enabling
More informationBig Data Analytics Platform @ Nokia
Big Data Analytics Platform @ Nokia 1 Selecting the Right Tool for the Right Workload Yekesa Kosuru Nokia Location & Commerce Strata + Hadoop World NY - Oct 25, 2012 Agenda Big Data Analytics Platform
More informationwww.pwc.com/oracle Next presentation starting soon Business Analytics using Big Data to gain competitive advantage
www.pwc.com/oracle Next presentation starting soon Business Analytics using Big Data to gain competitive advantage If every image made and every word written from the earliest stirring of civilization
More informationInnovative technology for big data analytics
Technical white paper Innovative technology for big data analytics The HP Vertica Analytics Platform database provides price/performance, scalability, availability, and ease of administration Table of
More informationKlarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance
Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance IBM s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice
More informationApache Hadoop in the Enterprise. Dr. Amr Awadallah, CTO/Founder @awadallah, aaa@cloudera.com
Apache Hadoop in the Enterprise Dr. Amr Awadallah, CTO/Founder @awadallah, aaa@cloudera.com Cloudera The Leader in Big Data Management Powered by Apache Hadoop The Leading Open Source Distribution of Apache
More informationAnalytic Applications With PHP and a Columnar Database
AnalyticApplicationsWithPHPandaColumnarDatabase No matter where you look these days, PHP continues to gain strong use both inside and outside of the enterprise. Although developers have many choices when
More informationAre You Ready for Big Data?
Are You Ready for Big Data? Jim Gallo National Director, Business Analytics April 10, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?
More information5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014
5 Keys to Unlocking the Big Data Analytics Puzzle Anurag Tandon Director, Product Marketing March 26, 2014 1 A Little About Us A global footprint. A proven innovator. A leader in enterprise analytics for
More informationIndustry Impact of Big Data in the Cloud: An IBM Perspective
Industry Impact of Big Data in the Cloud: An IBM Perspective Inhi Cho Suh IBM Software Group, Information Management Vice President, Product Management and Strategy email: inhicho@us.ibm.com twitter: @inhicho
More informationAnalytics in the Cloud. Peter Sirota, GM Elastic MapReduce
Analytics in the Cloud Peter Sirota, GM Elastic MapReduce Data-Driven Decision Making Data is the new raw material for any business on par with capital, people, and labor. What is Big Data? Terabytes of
More informationHow To Create A Business Intelligence (Bi)
Oracle Business Analytics Overview Markus Päivinen Business Analytics Country Leader, Finland May 2014 1 Presentation content What are the requirements for modern BI Trend in Business Analytics Big Data
More informationOPEN MODERN DATA ARCHITECTURE FOR FINANCIAL SERVICES RISK MANAGEMENT
WHITEPAPER OPEN MODERN DATA ARCHITECTURE FOR FINANCIAL SERVICES RISK MANAGEMENT A top-tier global bank s end-of-day risk analysis jobs didn t complete in time for the next start of trading day. To solve
More informationBig Data-Challenges and Opportunities
Big Data-Challenges and Opportunities White paper - August 2014 User Acceptance Tests Test Case Execution Quality Definition Test Design Test Plan Test Case Development Table of Contents Introduction 1
More informationBig Analytics: A Next Generation Roadmap
Big Analytics: A Next Generation Roadmap Cloud Developers Summit & Expo: October 1, 2014 Neil Fox, CTO: SoftServe, Inc. 2014 SoftServe, Inc. Remember Life Before The Web? 1994 Even Revolutions Take Time
More informationUsing Big Data for Smarter Decision Making. Colin White, BI Research July 2011 Sponsored by IBM
Using Big Data for Smarter Decision Making Colin White, BI Research July 2011 Sponsored by IBM USING BIG DATA FOR SMARTER DECISION MAKING To increase competitiveness, 83% of CIOs have visionary plans that
More informationAutonomy Consolidated Archive
Autonomy Consolidated Archive Dennis Wild Director SME, Information Governance and Archiving POWER PROTECT PROMOTE Meaning-Based Governance Files IM Audio Email Social Video SharePoint Archiving = Gain
More informationBusiness Usage Monitoring for Teradata
Managing Big Analytic Data Business Usage Monitoring for Teradata Increasing Operational Efficiency and Reducing Data Management Costs How to Increase Operational Efficiency and Reduce Data Management
More informationJDSU Partners with Infobright to Help the World s Largest Communications Service Providers Ensure the Highest Quality of Service
JDSU Partners with Infobright to Help the World s Largest Communications Service Providers Ensure the Highest Quality of Service Overview JDSU (NASDAQ: JDSU; and TSX: JDU) innovates and markets diverse
More informationAccelerate Business Advantage with Dynamic Warehousing
Accelerate Business Advantage with Dynamic Warehousing Mark McConnell Marketing Executive, Information Management IBM Asia Pacific 2007 IBM Corporation Is Information Technology delivering? Source: IBM
More informationBIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES
BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES Relational vs. Non-Relational Architecture Relational Non-Relational Rational Predictable Traditional Agile Flexible Modern 2 Agenda Big Data
More information2015 Analyst and Advisor Summit. Advanced Data Analytics Dr. Rod Fontecilla Vice President, Application Services, Chief Data Scientist
2015 Analyst and Advisor Summit Advanced Data Analytics Dr. Rod Fontecilla Vice President, Application Services, Chief Data Scientist Agenda Key Facts Offerings and Capabilities Case Studies When to Engage
More informationFaster Business Insights By Enabling Real-Time Data for Business Intelligence (BI) & Analytics A Best Practices White Paper
Faster Business Insights By Enabling Real-Time Data for Business Intelligence (BI) & Analytics A Best Practices Page 1 of 11 Executive Summary In today s intelligent economy, companies must analyze and
More informationHadoop and its Usage at Facebook. Dhruba Borthakur dhruba@apache.org, June 22 rd, 2009
Hadoop and its Usage at Facebook Dhruba Borthakur dhruba@apache.org, June 22 rd, 2009 Who Am I? Hadoop Developer Core contributor since Hadoop s infancy Focussed on Hadoop Distributed File System Facebook
More informationCustomer Insight Appliance. Enabling retailers to understand and serve their customer
Customer Insight Appliance Enabling retailers to understand and serve their customer Customer Insight Appliance Enabling retailers to understand and serve their customer. Technology has empowered today
More informationThe Future of Data Management with Hadoop and the Enterprise Data Hub
The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah Cofounder & CTO, Cloudera, Inc. Twitter: @awadallah 1 2 Cloudera Snapshot Founded 2008, by former employees of Employees
More informationBig Data Er Big Data bare en døgnflue? Lasse Bache-Mathiesen CTO BIM Norway
Big Data Er Big Data bare en døgnflue? Lasse Bache-Mathiesen CTO BIM Norway Big Data What is all the fuss about? The effective use of Big Data has the potential to transform economies, delivering a new
More informationHigh-Performance Business Analytics: SAS and IBM Netezza Data Warehouse Appliances
High-Performance Business Analytics: SAS and IBM Netezza Data Warehouse Appliances Highlights IBM Netezza and SAS together provide appliances and analytic software solutions that help organizations improve
More informationMachina Research. Where is the value in IoT? IoT data and analytics may have an answer. Emil Berthelsen, Principal Analyst April 28, 2016
Machina Research Where is the value in IoT? IoT data and analytics may have an answer Emil Berthelsen, Principal Analyst April 28, 2016 About Machina Research Machina Research is the world s leading provider
More informationText Analytics Beginner s Guide. Extracting Meaning from Unstructured Data
Text Analytics Beginner s Guide Extracting Meaning from Unstructured Data Contents Text Analytics 3 Use Cases 7 Terms 9 Trends 14 Scenario 15 Resources 24 2 2013 Angoss Software Corporation. All rights
More informationBig Data: Key Concepts The three Vs
Big Data: Key Concepts The three Vs Big data in general has context in three Vs: Sheer quantity of data Speed with which data is produced, processed, and digested Diversity of sources inside and outside.
More informationEMC/Greenplum Driving the Future of Data Warehousing and Analytics
EMC/Greenplum Driving the Future of Data Warehousing and Analytics EMC 2010 Forum Series 1 Greenplum Becomes the Foundation of EMC s Data Computing Division E M C A CQ U I R E S G R E E N P L U M Greenplum,
More informationMicrosoft Big Data. Solution Brief
Microsoft Big Data Solution Brief Contents Introduction... 2 The Microsoft Big Data Solution... 3 Key Benefits... 3 Immersive Insight, Wherever You Are... 3 Connecting with the World s Data... 3 Any Data,
More informationSAP NetWeaver BW Archiving with Nearline Storage (NLS) and Optimized Analytics
SAP NetWeaver BW Archiving with Nearline Storage (NLS) and Optimized Analytics www.dolphin corp.com Copyright 2011 Dolphin, West Chester PA All rights are reserved, including those of duplication, reproduction,
More informationFOR A FEW TERABYTES MORE THE GOOD, THE BAD and THE BIG DATA. Cenk Kiral Senior Director of BI&EPM solutions ECEMEA region
FOR A FEW TERABYTES MORE THE GOOD, THE BAD and THE BIG DATA Cenk Kiral Senior Director of BI&EPM solutions ECEMEA region Big Data Buzz The promise of big data Intelligent Utility - 8/28/11 Big data, analytics
More informationBig Data & Cloud Computing. Faysal Shaarani
Big Data & Cloud Computing Faysal Shaarani Agenda Business Trends in Data What is Big Data? Traditional Computing Vs. Cloud Computing Snowflake Architecture for the Cloud Business Trends in Data Critical
More informationwww.intelligentbusiness.biz mferguson@intelligentbusiness.biz Twitter: @mikeferguson1
Welcome to Today s Web Seminar! March 15, 2011 12:00PM ET Sponsored by: Hosted by: Eric Kavanagh is the host of DM Radio and Information Management's Webcasts. He is a veteran journalist and consultant
More informationIBM DB2 Near-Line Storage Solution for SAP NetWeaver BW
IBM DB2 Near-Line Storage Solution for SAP NetWeaver BW A high-performance solution based on IBM DB2 with BLU Acceleration Highlights Help reduce costs by moving infrequently used to cost-effective systems
More informationIntroduction to the Event Analysis and Retention Dilemma
Introduction to the Event Analysis and Retention Dilemma Introduction Companies today are encountering a number of business imperatives that involve storing, managing and analyzing large volumes of event
More informationDATAOPT SOLUTIONS. What Is Big Data?
DATAOPT SOLUTIONS What Is Big Data? WHAT IS BIG DATA? It s more than just large amounts of data, though that s definitely one component. The more interesting dimension is about the types of data. So Big
More informationBENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB
BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB Planet Size Data!? Gartner s 10 key IT trends for 2012 unstructured data will grow some 80% over the course of the next
More informationGigaSpaces Real-Time Analytics for Big Data
GigaSpaces Real-Time Analytics for Big Data GigaSpaces makes it easy to build and deploy large-scale real-time analytics systems Rapidly increasing use of large-scale and location-aware social media and
More informationMicrosoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010
Microsoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010 Better Together Writer: Bill Baer, Technical Product Manager, SharePoint Product Group Technical Reviewers: Steve Peschka,
More informationHP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica
HP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica So What s the market s definition of Big Data? Datasets whose volume, velocity, variety
More informationHow To Understand The Benefits Of Big Data
Findings from the research collaboration of IBM Institute for Business Value and Saïd Business School, University of Oxford Analytics: The real-world use of big data How innovative enterprises extract
More informationHow To Use Hp Vertica Ondemand
Data sheet HP Vertica OnDemand Enterprise-class Big Data analytics in the cloud Enterprise-class Big Data analytics for any size organization Vertica OnDemand Organizations today are experiencing a greater
More informationIII JORNADAS DE DATA MINING
III JORNADAS DE DATA MINING EN EL MARCO DE LA MAESTRÍA EN DATA MINING DE LA UNIVERSIDAD AUSTRAL PRESENTACIÓN TECNOLÓGICA IBM Alan Schcolnik, Cognos Technical Sales Team Leader, IBM Software Group. IAE
More informationUnderstanding Data Warehouse Needs Session #1568 Trends, Issues and Capabilities
Understanding Data Warehouse Needs Session #1568 Trends, Issues and Capabilities Dr. Frank Capobianco Advanced Analytics Consultant Teradata Corporation Tracy Spadola CPCU, CIDM, FIDM Practice Lead - Insurance
More informationTap into Hadoop and Other No SQL Sources
Tap into Hadoop and Other No SQL Sources Presented by: Trishla Maru What is Big Data really? The Three Vs of Big Data According to Gartner Volume Volume Orders of magnitude bigger than conventional data
More informationScaling Your Data to the Cloud
ZBDB Scaling Your Data to the Cloud Technical Overview White Paper POWERED BY Overview ZBDB Zettabyte Database is a new, fully managed data warehouse on the cloud, from SQream Technologies. By building
More informationEmerging Technologies Shaping the Future of Data Warehouses & Business Intelligence
Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence Appliances and DW Architectures John O Brien President and Executive Architect Zukeran Technologies 1 TDWI 1 Agenda What
More informationThe IBM Cognos Platform
The IBM Cognos Platform Deliver complete, consistent, timely information to all your users, with cost-effective scale Highlights Reach all your information reliably and quickly Deliver a complete, consistent
More informationDigital Transformation
Digital Transformation The Leadership Edge Pascal Giraud Senior Director EMEA Technology Copyright 2014 Oracle and/or its affiliates. All rights reserved. 2 Enterprise Computing Trends GLOBALIZATION DATA
More informationOracle Big Data SQL Technical Update
Oracle Big Data SQL Technical Update Jean-Pierre Dijcks Oracle Redwood City, CA, USA Keywords: Big Data, Hadoop, NoSQL Databases, Relational Databases, SQL, Security, Performance Introduction This technical
More informationThe Big Data Paradigm Shift. Insight Through Automation
The Big Data Paradigm Shift Insight Through Automation Agenda The Problem Emcien s Solution: Algorithms solve data related business problems How Does the Technology Work? Case Studies 2013 Emcien, Inc.
More informationData Masking Checklist
Data Masking Checklist Selecting the Right Data Masking Tool Selecting Your Masking Tool Ensuring compliance with current data protection regulations and guidelines has become a mandatory operation. Non-compliance
More informationBig Data Efficiencies That Will Transform Media Company Businesses
Big Data Efficiencies That Will Transform Media Company Businesses TV, digital and print media companies are getting ever-smarter about how to serve the diverse needs of viewers who consume content across
More informationArchitecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing
Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Wayne W. Eckerson Director of Research, TechTarget Founder, BI Leadership Forum Business Analytics
More informationTransactions & Interactions
Transactions & Interactions The Correlation of Structured and Unstructured Data Shaun Connolly, Hortonworks December 15, 2011 Big Data Has Reached Every Market Digital data is personal, everywhere, increasingly
More informationFUJITSU Software Interstage Business Operations Platform: A Foundation for Smart Process Applications
FUJITSU Software Interstage Business Operations Platform: A Foundation for Smart Process Applications Keith Swenson VP R&D, Chief Architect Fujitsu America, Inc. May 30, 2013 We are a software company
More informationIntegrating Apache Spark with an Enterprise Data Warehouse
Integrating Apache Spark with an Enterprise Warehouse Dr. Michael Wurst, IBM Corporation Architect Spark/R/Python base Integration, In-base Analytics Dr. Toni Bollinger, IBM Corporation Senior Software
More informationLuncheon Webinar Series May 13, 2013
Luncheon Webinar Series May 13, 2013 InfoSphere DataStage is Big Data Integration Sponsored By: Presented by : Tony Curcio, InfoSphere Product Management 0 InfoSphere DataStage is Big Data Integration
More informationManaging Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database
Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica
More informationNetApp Big Content Solutions: Agile Infrastructure for Big Data
White Paper NetApp Big Content Solutions: Agile Infrastructure for Big Data Ingo Fuchs, NetApp April 2012 WP-7161 Executive Summary Enterprises are entering a new era of scale, in which the amount of data
More information