BI Presentation Informatica 2016
Disclaimer The BI Strategy discussed in the following slides are unique to and are not necessarily representative of Siemens AG Corporate BI Strategy. Page 2
Once Upon a Time there was a company that loved Excel Page 3
they owned every Excel-based reporting tool known to mankind Page 4
acquisitions were sad because of the complex reporting silos Page 5
but the ever-wise CIO knew there must be a better way. So he formed a new, highly-skilled team to change the culture Page 6
the challenge was issued Self-service and Cross-application reporting new goals Friendly Data for end-user and developer Analytics and Visualization would be the way forward One Source of the Truth - reduce data movement and shadow IT by creating a Data Lake Fast - utilize Greenplum database s blazing speed Page 7
the steps set ETL - procure enterprise ETL tool to meet data movement and transformation needs Reporting find enterprise reporting tool to meet end-user ad-hoc and visualization reporting needs host reporting tool bake-off Data Management - minimize data transformation while bringing in disparate data sources; provide consistency across enterprise Governance - develop rock-solid governance that protects data yet is flexible enough to allow business to prototype and do ad-hoc reporting Page 8
ETL Extract, Transform and Load Informatica PowerCenter was obvious choice industry leader - no bake off needed High-speed connectors can move millions of rows of data very quickly out of source systems such as SAP, BW and SFDC into Greenplum Data Lake Informatica s footprint is expanding has become tool of choice for all data movement! Informatica Cloud also used for some smaller, less complex needs Page 9
Reporting Tool Bake-off Vendor PROs CONs Tableau WebFOCUS InfoAssist 8 Business Objects Qlikview Easy enough for business analyst to use Direct data connectors Reporting experts really liked product Analytics and Visualizations tool of choice Does not slow Greenplum DB down low overhead Good support for row-level security Scales for enterprise rollout Decent Admin tools Reasonably priced Existing WebFOCUS environment Inexpensive Operational Reporting tool of choice Existing BO environment Great Operational Reporting tool Produces nice-looking reports Focused on analytics and visualizations not necessarily a good choice for operational reporting Limited Ease of Use IT support required Limited Ease of Use IT support required Current environment not scalable Expensive Limited ease of use - scripting is required In memory engine required cannot use Greenplum DB directly Expensive Page 10
Data Management Informatica uses high-speed connectors to bring millions of rows of source data into Greenplum Data Staging process gets data ready for use: Flatten out / de-normalize data Add friendly, consistent column names Ensure column values consistent Snapshot data for trending analysis Perform automated error checking Rollback updates if data updating fails per group Page 11
Governance Data loads from source systems and ease-of-use verified by IT and signed off by the business Tableau Workbooks can only be created or modified by authorized users. Authorized user can use Certified and Uncertified data sources IT publishes all Certified Data Sources at request of business Greenplum Data Lake Tableau Desktop Tableau Server Tableau Workbooks are published only after business SME approval Tableau Workbooks Page 12
Put it all Together SFDC Tableau Desktop (Super Users) Tableau Server Other Reporting Systems O2O OpenAir Informatica Extract, Transform and Load (ETL) Greenplum Data Lake SAP Page 13
There were bumps in the road IT Team very uncomfortable wanted more detail when we were making it up as we went Difficult to manage expectations Impossible to manage timeline art not science Lots and lots and lots of technical hurdles to overcome Page 14
some good decisions too CIO project sponsor Professional Services helped verify technical installation of product and gave us excellent best practices help Training was extremely important to get team up and running as quickly as possible Team sequestered for first year as we worked through standing up reporting solution Had real-world Use Case to work through Page 15
Business Reporting Team Executive Reporting Council sets overarching direction for Enterprise Reporting Business Stakeholders responsible for reporting in a specific area of the business. Provides IT with Reporting Demands (data and, in some cases, reports) Reporting Council group of Business Stakeholders that prioritize all Reporting Demands across the Enterprise Super Users Business users that can create reports and, in some instances, combine Uncertified data with Certified data Data Czar responsible for Enterprise Data Cleansing/Monitoring and reporting-specific business process around DUNS, Industry, Product Hierarchy, etc. Must understand and has authority to require changes to business processes and Transactional systems where needed to support reporting and other such initiatives Page 16
IT Reporting Team Demand Manager conducts periodic meeting with Reporting Council to prioritize all Enterprise reporting demands; works closely with Business Stakeholders to ensure alignment in developing Enterprise reporting area vision Project Manager(s) works with different areas of the business managing business expectations and reporting projects Data Modeler(s) develops overarching database architecture for providing Certified and Uncertified data to appropriate Business users, using data in place and ETL to Stage and De-normalize data into Data Lake Reporting Tool Team(s) creates reports, provides support and administers security for Operational reporting tool Meta Data Manager manages Data Certification, Data Cleansing, Data Normalization and provides Business users with Data Guides on all Certified data sources; works closely with and supports Data Czar Data Lake DBA technical resource that manages Data Lake Data Management Team provides support and administers security for Data Management tool(s); provides Data Blending, Data Staging, ETL, Cleansing/Monitoring, Data Normalization and supports Meta Data and Data Modeling Page 17
Overall Roles and Responsibilities Business Data, Workbooks and Priorities: Owns data and business processes Verifies ease of use of Greenplum data Approves/denies user security requests Responsible for creating workbooks from published data sources Works with IT in training end users Sets priorities for what data is transformed when by IT team IT Data Transformation, Governance: Performs business analysis on data needed per the business Extracts, transforms and loads needed data into Greenplum Implements security per business requirements Ensures governance of data source and workbook publishing Creates starter workbooks when needed Works with business in training end users Page 18
Data Alignment Matrix Gather each slice of data needed by each Business Stakeholder Add to Data Alignment Matrix Prioritize based on overall need Page 19
Priorities More Work Than People Each Business Stakeholder prioritizes their work Meet with all Business Stakeholders prioritize overall work Resource availability based on overall priority Page 20
Data Governance Process Uncertified Data is often brought into the reporting tool or even into Greenplum as Super Users do prototyping and ad-hoc reporting. This is encouraged by all, because the business will iterate the reports before requesting data to be Certified. Preserves precious IT resources Certified Data data must be verified and signed off by the business before it can be used in a Production report. Data Requests are made by the business in the Help Desk ticketing system What Data is Most Important? prioritizing what data moves into the Data Lake when is the most important item to prioritize. We look for sweet spots where multiple Business Stakeholders need the same data to help guide prioritization ETL is 80-90% of the total effort required IT S ALL ABOUT THE DATA. No Data = No Reports. Bad Data = Bad Reports Page 21
Security Request Process 1. Requestor fills out Help Desk Ticket and attaches Manager s approval 2. Data Owner(s) reviews and approves 3. Tableau Account set up, Requestor added to proper Group(s) 4. Row-level security added (if needed) 5. Requestor notified, Help Desk Ticket closed Page 22
Summary 1. It s all about the data 2. See #1 3. Start small try it out on a small scale first 4. To optimize results IT and business must be partners 5. Governance must protect data yet allow for business to prototype and do ad-hoc reporting while not hurting anyone 6. Never underestimate the power of Executive Sponsorship Page 23
And now the Unicorn! Certified Uncertified Essbase GTAC O2O OpenAir Products Discovery Operational Write- Back Access Excel 3 rd Party SAP Data Management SFDC Data Lake Page 24
Questions? Page 25
I ll be at the Tableau booth for a while after the presentation Page 26