ENGINE(S) BEHIND BI Sam Tawfik sam.tawfik@teradata.com @teradata_sam
Transactions and Interactions BIG DATA 2 Transaction Interaction
Data Growth 10 24 10 21 10 18 10 15 10 12 10 9 Yottabyte Zettabyte Exabyte Petabyte Terabyte Gigabyte Interactions Transactions Source: IDC - sponsored by EMC. As the Economy Contracts, the Digital Universe Expands. May 2009 3
Transactional Data An Enterprise Data Warehouse is a Centralized and Historical repository of Integrated, Detailed and Enriched data that supports multiple decision-making applications for multiple groups and is the single source of analytics data for the enterprise. Transactional Systems Accts. Payable/Rec Sales/Orders Finance G/L HR Payroll Purchasing Manufacturing Inventory Enterprise Data Warehouse 4 Users
Integrated Transactional Data is Valuable Profitability Inventory Analysis Supplier Data Business Value Sales Analysis Market Basket Analysis Transaction Data Customer Analysis Customer Data Financial Data EDW Investment Product Data 5
Industry Examples Satellite provider uses geographical data to calculate the revenue and acceptance rate of the new Free HD DVR offer by day, geography, and customer value Retailer uses customer purchasing behavior data to identify expected mothers so they can target them with baby products and change their buying habbits SCE uses Smart metering capability with feedback to consumers/businesses to conserve energy during peak periods, avoiding costly build-out of more peak capacity and reducing operational costs 6
Big Data The Four Axes of Big Data CIOs face significant challenges in addressing the issues surrounding big data New technologies and applications are emerging (examples include Hadoop and MapReduce) and should be investigated to understand their potential value. Source: CEO Advisory: Big Data Equals Big Opportunity, Gartner, 31 March 2011. 7
What is MapReduce? MapReduce is a parallel programming framework originally developed by Google to generate search indexes and for web scoring algorithms Runs on 100s to 100,000 servers Data analyzed where it is stored Use popular developer tools Automatic re-execution Map Function Scheduler map shuffle reduce Hadoop is open source MapReduce Results 8
Big Data in the Entertainment Industry Leveraging content as a capture point for behavioral data products Studio Products Consumer Channels Film & Television STUDIO Interactive Games Explosion of Interactions & New Data Downloads Duration Stopping points Invitations Engagement Mobile Connected TV CUSTOMER DVD Home Entertainment Digital Distribution YouTube VOD Theater 9
Industry Examples Marketing > Influence the influencers > Identify key influencers within an internet community of friends and leverage the viral phenomenon in the social community Retail > Digital marketing optimization > Cross-analysis of massive volumes of web logs and searches, click stream data, and social media for SEO spend optimization, advertising attribution, golden path, and site stickiness Social Media Analysis > Social networking graph analysis > Crowd-sourcing, virility analysis, and content targeting 10
Transactions and Interactions Together! Big Data Analytic Platform Workers Bi-directional High Performance Aster-Teradata Connector Intelligent Applications SQL-MapReduce Loaders/Exporters <<<<<>>>>> Integrated Data Warehouse Bi-directional, high performance data transfer ensures that Big Data Analytics are actively integrated into the Data Warehouse to optimize & drive operations. 11
Putting it all Together Today 12 Source: ebay, ebay Extreme Analytics in a Virtual World, Nov 10,2010 Permission to use publicly granted by ebay.
Q&A