The IBM Agile Information Governance Process
|
|
- Betty Leonard
- 8 years ago
- Views:
Transcription
1 IBM Software Thought Leadership White Paper May 2014 The IBM Agile Information Governance Process
2 2 The IBM Agile Information Governance Process We are literally drowning in data today. IDC estimates that the amount of information in the digital universe exceeded 1.8 zettabytes, or 1.8 trillion gigabytes, in 2011, and is doubling every two years. 1 Companies are increasingly turning to big data to drive analytic and operational applications. For example, a credit card company may analyze transactions in real time to uncover fraud. A retailer may engage in listening efforts to understand what its customers are saying in social media. Finally, a utility may use smart meter readings to incent customers to move their electricity consumption to off-peak hours. Big data has the following characteristics: Volume: Enterprises are awash with data, easily amassing terabytes and petabytes of information, and even zettabytes in the future. Velocity: Often time-sensitive, streaming data must be analyzed with millisecond response times to bolster real-time decisions. Variety: Big data includes structured, semi-structured and unstructured data such as s, audio, video, clickstreams, log files and biometrics. Veracity: Veracity is different from the other Vs. As volume, velocity and variety grow, the veracity, or confidence in your data, decreases. If organizations do not have confidence in the underlying big data, then they will not be able to trust the analytics and insights that emanate from this data. Companies are also using big data technologies to modernize their legacy infrastructures. As they do this, information governance best practices, including metadata, data quality, master data, data security and data lifecycle management, are critical to the success of these initiatives. This white paper will focus on the veracity (or confidence) dimension of big data Define business problem Obtain executive sponsors Align teams Understand data risk and value Implement project(s) Measure results Plan Act Assess Enhanced 360-degree view of the customer Big data exploration Security/ intelligence extension Application development and testing Application efficiency Application consolidation and retirement Security and compliance Data warehouse augmentation Operations analysis Figure 1. The IBM Agile Information Governance Process.
3 IBM Software 3 The IBM Agile Information Governance Process, shown in Figure 1, consists of six steps across three distinct phases. In the Plan phase, information governance teams define the business problem, obtain executive sponsorship, align teams and understand data risk and value. In the Act phase, organizations implement one or more projects based on common use cases. Finally, in the Assess phase, information governance teams measure results. The IBM Agile Information Governance Process is built as a continuous loop. As information governance teams measure results on one project, they start anew by defining the business problem that may spawn additional projects. This white paper will explore the steps in the IBM Agile Information Governance Process. Step one: Define the business problem The information governance team should begin by defining the business problem. Best practices across clients often line up with nine common use cases (see Step five: Implement projects for more on the use cases). Here are a few business problems with a big data orientation: Increasing system performance based on log analytics IT departments are turning to big data to analyze application logs for slivers of insight that can improve system performance. Because application vendors log files are in different formats, they need to be standardized first to promote IT s confidence in the results. Optimizing water, gas and electricity consumption through smart meters Several utilities are rolling out smart meters to measure the consumption of water, gas and electricity at regular intervals of an hour or less. These smart meters generate copious amounts of interval data that needs to be governed appropriately. Utilities must safeguard the privacy of this interval data because it can potentially indicate a subscriber s household activities, as well as the comings and goings from his or her home. In addition, utilities need to establish policies for the archiving and deletion of interval data to reduce storage costs. Masking sensitive data within call-center voice recordings Many call centers make voice recordings of some or all of their calls for quality assurance purposes or to comply with regulations. These voice recordings may contain sensitive or personally identifiable information (PII) such as Social Security numbers and Payment Card Industry data such as the three-digit or fourdigit card verification code and primary account numbers. Therefore, they must be safeguarded against unauthorized access and use. Step two: Obtain executive sponsorship Information governance programs historically focused on traditional types of data. Bringing big data within the scope of the information governance program requires strong executive sponsorship. The executive sponsor may be someone from business or IT, but must have the ability to talk to both constituencies. The executive sponsor should be able to prioritize projects, obtain funding and manage headcount. Examples of a cross-functional executive sponsor include the chief information officer, the chief data officer or the vice president of enterprise data management. Specific functional executives from marketing, risk and supply chain departments may also provide executive sponsorship if big data constitutes a competitive advantage for the organization. Step three: Align teams Organizations also need to update certain information governance roles to account for big data. For example, the Information Governance Lead may need to assume the following additional responsibilities to govern big data: Determine the types of big data that need to be governed Assist with the development of a business case to support big data governance Evangelize big data with business stakeholders
4 4 The IBM Agile Information Governance Process Support activities that integrate big data with master data management (MDM) Expand the scope of the business glossary to support terms relating to big data (for example, a session relating to clickstream analytics) Align with multiple organizations, including legal, marketing, privacy and senior management, to establish policies regarding the acceptable use of big data Drive policies regarding the retention, compression and archiving of big data Foster policies to improve the security and privacy of big data Oversee data steward activities relating to big data In addition, the role of a data steward may need to change to accommodate big data. For example, a customer data steward may need to assume the following additional responsibilities to govern social media: Provide input into the attributes to match semi-structured and unstructured data, such as social media profiles, with MDM records Work with internal teams to determine what big data should be moved, federated, archived and ignored Leverage the data stewardship console to link, deduplicate and merge unstructured data with customer MDM records (for example, determine if the Susie Smith from Facebook is the same as Susan Smith in the customer MDM hub) Work with legal counsel, the privacy department and business stakeholders to establish privacy policies regarding the acceptable use of social media IBM InfoSphere Business Glossary and IBM InfoSphere Master Data Management (InfoSphere MDM) support the ownership of data by the business by allowing data stewards to manage business terms, policies and data rules. Step four: Understand data risk and value Data discovery and profiling are a basic requirement of an information governance program. The following examples highlight some of the ways that understanding data can impact a broader information governance program. Support data stewardship IBM InfoSphere Information Analyzer supports data profiling, including analysis of data at the column, key, source and crossdomain levels. The IBM InfoSphere Data Quality Console extends the capabilities of InfoSphere Information Analyzer by providing data stewards with the ability to drill down into exceptions. A data steward can use the InfoSphere Data Quality Console to drill down into expired life insurance policies by date of expiration. Manage reference data InfoSphere Information Analyzer can discover reference tables during the data profiling process. These reference tables can then be exported into the IBM InfoSphere Master Data Management Reference Data Management Hub. If these tables are updated in the latter, they can then be imported back into InfoSphere Information Analyzer. Link data rules with business terms Data rules within InfoSphere Information Analyzer can be linked to information governance rules in InfoSphere Business Glossary. For example, a data rule called CITY_EXISTS in InfoSphere Information Analyzer can be linked to an information governance rule called Data quality rules for customer address in InfoSphere Business Glossary. Identify sensitive data Organizations struggle with hidden sensitive data such as Social Security numbers located in a field called EMP_NUM. IBM InfoSphere Discovery can locate sensitive information such as PII so that data privacy rules can be enforced appropriately.
5 IBM Software 5 Discover complete business objects Complete business objects are logical groupings of related objects such as customers. InfoSphere Discovery can locate complete business objects that can then be archived using the IBM InfoSphere Optim Data Growth Solution. Because of its extreme volume, velocity and variety, big data should be handled differently than traditional types of data. Table 1 compares and contrasts the differences between traditional and big data quality programs. Here is an example scenario: Acme Corporation uses IBM InfoSphere BigInsights to conduct sentiment analysis of Twitter feeds. The social listening department may adopt the following business rules to determine whether mentions of acme refer to Acme Corporation or are noise that needs to be filtered out. Step five: Implement projects In working with customers across many industries, the following sweet spots emerge as the most common starting points in big data and governance projects. These are represented as part of the Implement phase of the IBM Agile Information Governance Process shown in Figure 1. The information governance implications of these big data projects are discussed in the following pages. Enhanced 360-degree view of the customer Gaining a full understanding of customers what makes them tick, why they buy, how they prefer to shop, why they switch and so on is strategic for virtually every company. In fact, in a recent IBM study, the number-one recommendation was that organizations should focus their big data efforts first on customer analytics that enable them to truly understand customer needs and anticipate future behaviors. 2 If tweet then confidence level = 99 percent If tweet contains Acme and Acme product names, then confidence level = 75 percent If tweet is on the ignore list, then confidence level = 0 percent Forward-thinking organizations recognize the need to equip their customer-facing professionals with the right information in context to help them solve customer problems and improve up-selling and cross-selling. However, they need to consider several information governance implications as well. InfoSphere MDM and IBM Watson Explorer combine structured and unstructured information in context from customer relationship management, content management, supply chain, order tracking databases, and many more systems to present a 360-degree view of the client. These offerings can also integrate in-context analytics from social media and other types of big data from InfoSphere BigInsights and IBM InfoSphere Streams.
6 6 The IBM Agile Information Governance Process Dimension Traditional data quality Big data quality Frequency of processing Batch-oriented Real-time and batch-oriented Variety of data Data format is largely structured Data format may be structured, semi- structured or unstructured Confidence levels Timing of data cleansing Critical data elements Location of analysis Stewardship Data needs to be in pristine condition for analytics in the data warehouse Data is cleansed prior to loading into the data warehouse Data quality is assessed for critical data elements such as customer address Data moves to the data quality and analytics engines Stewards can manage a high percentage of the data Noise needs to be filtered out but data needs to be good enough Depending upon confidence levels and intended use, poor data quality may or may not impede analytics and insights Data can be loaded as- is because the critical data elements and relationships may not be fully understood Volume and velocity of data may require streaming, in- memory analytics to cleanse data, thus reducing storage requirements Data may be quasi- or ill- defined and subject to further exploration, hence critical data elements may change iteratively Data quality and analytics engines may move to the data to ensure speed of processing Stewards can manage only a smaller percentage of data due to high volumes and/or velocity Table 1. Traditional versus big data quality programs. Marketing organizations often need to match lists of prospects against internal records to remove any customers who have made do-not-call elections. These large data sets push the limits of existing computational resources when IT needs to match 200 million prospects against a database of 100 million customers and return the results to marketing in 24 hours. IBM InfoSphere MDM can manage massive data sets comprising up to as many as one billion records. IBM has also implemented the InfoSphere MDM probabilistic matching engine within a MapReduce framework on InfoSphere BigInsights. This has helped organizations implement probabilistic matching on ultra-large data sets in hours rather than days or weeks. Big data exploration The first step in leveraging big data is to find out what you have and to establish your ability to access it and use it to support decision making and day-to-day operations. Big data exploration is the way to get started.
7 IBM Software 7 Users should be able to explore big data in the context of operational and analytical applications. For example, call center agents should be able to search for content within the company s intranet portal. Watson Explorer automates the discovery of big data, regardless of its format or where it resides, providing a federated view of key business information necessary to drive new initiatives. The technology is characterized by its unique index and search capabilities that uncover data from multiple repositories. As a result, customer service representatives are able to reduce Average Handle Time by doing text searches while on the call with the customer. IBM has also integrated Watson Explorer with InfoSphere Business Glossary. As shown in Figure 2, a customer is now able to search on the term taxation in the business glossary and pull up instances of that term within unstructured content. Security/intelligence extension To combat sophisticated threats, organizations must adopt new approaches that help spot anomalies and subtle indicators of attack by leveraging all available data. This may include: Traditional log and event data Network flow data Vulnerability and configuration information Identity context Threat intelligence data For example, InfoSphere Streams can correlate real-time feeds from multiple motion sensors to detect any threats to a physical environment. The software provides specialized data quality techniques when handling high volumes of data in real time without landing interim results to disk. Figure 2. Watson Explorer supports text search from InfoSphere Business Glossary.
8 8 The IBM Agile Information Governance Process InfoSphere Streams can also discover the temporal offset when joining, correlating and matching data from different sources. For example, a streaming application that needs to combine data from two sensors needs to know that Sensor A generates events every second, while Sensor B generates events every three seconds. If InfoSphere Streams does not receive a sensor event as expected, it can generate an alarm. Application development and testing The tremendous size and complexity of big data projects create challenges for testers. Because production data contains PII, organizations need to mask that data in test environments. However, big data applications must be delivered rapidly and testing teams must create realistic, right-sized, masked test data sets in short order. The IBM InfoSphere Optim Test Data Management solution streamlines the creation and management of test environments; subsets and migrates data to build realistic and right-sized test databases; masks sensitive data; automates test result comparisons; and helps eliminate the expense and effort of maintaining multiple database clones. The InfoSphere Optim Test Data Management solution can facilitate the following tangible financial benefits: Increased revenues from faster time-to-market due to automated generation of test data sets Reduced storage space for test data Fewer production defects due to better testing Reduced downtime as developers spend less time waiting for the refresh of their test environments Application efficiency Data growth often has an adverse impact on application performance and costs. Big data at rest includes smart meter readings, sensor data, RFID data and web logs that might reside in relational databases, file systems, NoSQL databases and Apache Hadoop. However, there is a myth that more data equals better analytics. According to the CGOC Summit 2012 Survey, approximately 69 percent of enterprise information has outlived its usefulness and can be subject to defensible disposition practices. Organizations can improve the signal-to-noise ratio in their big data environments moving from data swamps to data lakes by retaining only the right data. Companies should compress and archive big data at rest to reduce storage costs and to improve application performance. Because Hadoop avoids data loss by replicating the same data across multiple nodes in a cluster, organizations should consider InfoSphere BigInsights for fault-tolerant data archiving. The InfoSphere Optim Data Growth solution helps organizations reduce storage costs and improve application performance by archiving structured data. Compared with Hadoop, the InfoSphere Optim Data Growth solution can archive structured data in an immutable format where user access is tightly controlled and audited. Archived data can be subject to legal holds and is easily retrievable during legal proceedings. This data can be defensibly disposed to minimize legal impact. Archived data in the InfoSphere Optim Data Growth solution is accessible to business intelligence and enterprise applications, supports search and can be easily restored back to the source. Application consolidation and retirement Chief information officers are always on the lookout for ways to reduce costs, including application consolidation and retirement. For example, one organization had eight instances of SAP across several versions. It was expensive to maintain these versions and to aggregate data for corporate reporting. However, application teams were reluctant to retire legacy applications for fear that they might need the underlying data for legal, regulatory or analytic purposes.
9 IBM Software 9 InfoSphere Optim Data Growth and InfoSphere Information Server facilitate application consolidation and retirement by archiving data from decommissioned applications while providing ongoing access to the underlying data. Tangible cost savings accrue from lower software license and maintenance, hardware and labor costs. Organizations can now leverage the power of Hadoop to perform blended analytics of archived data in InfoSphere Optim as well as structured and unstructured data from other sources. Organizations can store their data in immutable format in the InfoSphere Optim Data Growth solution, which can now create query-able data archive files for storage in HBase. As a result, organizations can combine the immutability of an InfoSphere Optim archive with the processing power and cost-effectiveness of InfoSphere BigInsights Hadoop capabilities. Plus, Watson Explorer can also search for data within InfoSphere Optim archive files. Security and compliance Securing sensitive data complements the security/intelligence extension project mentioned above and focuses on securing and protecting sensitive data, such as credit card numbers, health records and so on. Companies are leveraging new types of internal and external data, which are now being consumed by innovative applications and new users. Because this data may be sensitive, organizations have to consider compliance and reputational risks. This is especially true in the US, as the Securities and Exchange Commission has ordered publicly traded companies to declare their security breaches. Although this sensitive data may be embedded within production, test, training and business intelligence environments, it needs to be protected regardless of location. in real time irrespective of the data type and for data-at-rest and data-in-motion. Developers can invoke this data masking functionality directly from MapReduce routines and Jaql scripts. In addition, this feature has been packaged as database-specific user-defined functions (UDFs) so that data moving into and out of the Hadoop Distributed File System (HDFS), HBase, InfoSphere BigInsights, InfoSphere Streams, the IBM PureData System for Analytics, IBM DB2, IBM DB2 for z/os and Oracle can be masked on demand. Organizations also need to establish policies to monitor access to sensitive big data by privileged users. The Hadoop Activity Monitoring feature of IBM InfoSphere Guardium allows activity monitoring on Hadoop just like traditional environments. This not only has minimal impact on the network, but offers an audit trail with granular details of big data activity. Data warehouse augmentation Data warehouses are built for massive scale. However, many organizations are struggling to manage data storage costs as their volumes grow. As a result, they are integrating Hadoop file systems and data warehouse capabilities to increase operational efficiency. Organizations are taking advantage of Hadoop s relatively inexpensive data storage by using it as a staging area before determining what data should be moved to the warehouse. They can process and analyze streaming data in real time to determine what should be stored, either in Hadoop or directly in the warehouse. Additionally, data can be cleansed and transformed before loading into the warehouse, enabling data exploration and ad hoc queries (see Figure 3). Data masking is the process of systematically transforming confidential data elements, such as trade secrets and PII, into realistic, but fictionalized, values. IBM InfoSphere Optim Data Masking on Demand allows organizations to invoke masking algorithms
10 10 The IBM Agile Information Governance Process All data IBM Watson Foundations New/enhanced applications Transaction and application data Machine and sensor data Enterprise content Image, geospatial and video data Social data Real-time data processing and analytics Operational data zone Landing, exploration and archive data zone Deep analytics data zone Enterprise data warehouse and data mart zone Information integration and governance What action should I take? Decision management What is happening? Discovery and exploration What did I learn? What s best? Cognitive What could happen? Predictive analytics and modeling Why did it happen? Reporting and analytics Customer experience New business models Financial performance Risk Operations, threats and fraud Thirdparty data Systems Security Storage On-premise, cloud, as a service IBM Big Data & Analytics infrastructure Maximize insight, improve IT economics Figure 3. Data flows through multiple zones as it is ingested, transformed and analyzed. IBM InfoSphere DataStage includes the Big Data File Stage, which supports reading and writing multiple files in parallel from and to the HDFS. The Big Data File Stage leverages the parallel engine within InfoSphere DataStage to provide massive scalability. Developers can also use the Balanced Optimization functionality to design a job in the InfoSphere DataStage environment and then deploy all or part of it in InfoSphere BigInsights or Cloudera Enterprise. InfoSphere DataStage autogenerates Jaql for MapReduce through the Balanced Optimization technology. IBM InfoSphere Business Information Exchange supports data lineage and impact analysis in highly heterogeneous environments. It helps organizations create, manage and share enterprise-wide common business terminology, policies and rules relating to all types of data, including big data. For example, InfoSphere Business Information Exchange may define the term unique visitor as a unit used to count individual users of a website for the purposes of clickstream analytics. It may also include a business rule that governs how unique visitors are calculated. All of these decisions are made with the help of an easy-to-use web interface designed to simplify collaboration between business and IT.
11 IBM Software 11 Operations analysis InfoSphere BigInsights provides robust sentiment analysis, enabling companies to leverage vast volumes of social media, machine data and other types of big data to improve the efficiency of their day-to-day operations. A popular brand-name global retailer was experiencing declining product profit margins due to increased promotional activity. In order to address this business challenge, the company decided to collect and analyze product feedback from customers in social media such as Twitter and other websites to determine the pricing strategy for new products. If the so-called sentiment analysis was not very positive during the product launch, the company decided to update its pricing in the master product catalog and offer discounts of 30 percent. This would replace its usual practice of selling merchandise at the end of the season at a 70 percent discount. As a result, the retailer was able to significantly improve its profit margins. The same retailer also piloted a flash event lasting just one afternoon to promote a new line of swimwear. The marketing team used only social media to attract customers to the event and anticipated that the communication would go viral. While the event was extraordinarily successful and sales exceeded projections, the marketing team uncovered some issues when analyzing clickstream data. Customers who had taken pictures of the new line could not easily find the product on line. After examining the root cause, the retailer modified its product hierarchy so that boardshorts could be found in shorts, swimwear and within its own subclass of boardshorts. InfoSphere MDM provides strong capabilities to manage product hierarchies and other attributes of product master data. Step six: Measure results As a final step, the organization should assess the results of the information governance program and make adjustments. After this assessment is completed, the information governance program loops back to define new business problems or to make adjustments to existing business problems. The process then starts over again. Many organizations can leverage their existing governance programs to both accelerate big data initiatives and reduce the time to implement further projects. IBM InfoSphere provides a foundation for big data, integration and governance to support these initiatives and their success. For more information Want to learn more about IBM InfoSphere capabilities? Call your IBM sales representative to schedule a Client Value Engagement at no cost or visit: ibm.com/software/data/infosphere About the author Sunil Soares is the founder and managing partner of Information Asset, LLC, a consulting firm that specializes in helping organizations build out their data governance programs. Prior to this role, he was the Director of Information Governance at IBM, and worked with clients across six continents and multiple industries. Soares has written four books on information governance, including The IBM Data Governance Unified Process, Selling Information Governance to the Business, Big Data Governance and IBM InfoSphere: A Platform for Big Data Governance and Process Data Governance.
12 Copyright IBM Corporation 2014 IBM Corporation Software Group Route 100 Somers, NY Produced in the United States of America May 2014 IBM, the IBM logo, ibm.com, BigInsights, DataStage, DB2, Guardium, IBM Watson, InfoSphere, Optim, PureData, and z/os are trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the web at Copyright and trademark information at ibm.com/legal/copytrade.shtml This document is current as of the initial date of publication and may be changed by IBM at any time. Not all offerings are available in every country in which IBM operates. It is the user s responsibility to evaluate and verify the operation of any other products or programs with IBM products and programs. THE INFORMATION IN THIS DOCUMENT IS PROVIDED AS IS WITHOUT ANY WARRANTY, EXPRESS OR IMPLIED, INCLUDING WITHOUT ANY WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND ANY WARRANTY OR CONDITION OF NON-INFRINGEMENT. IBM products are warranted according to the terms and conditions of the agreements under which they are provided. The client is responsible for ensuring compliance with laws and regulations applicable to it. IBM does not provide legal advice or represent or warrant that its services or products will ensure that the client is in compliance with any law or regulation Digital Universe Study. Extracting Value from Chaos. IDC, IBM Institute for Business Value Executive Report. Analytics: Real-World Use of Big Data in Telecommunications. IBM, Please Recycle IMW14737-USEN-01
IBM Software Five steps to successful application consolidation and retirement
Five steps to successful application consolidation and retirement Streamline your application infrastructure with good information governance Contents 2 Why consolidate or retire applications? Data explosion:
More informationIBM InfoSphere Guardium Data Activity Monitor for Hadoop-based systems
IBM InfoSphere Guardium Data Activity Monitor for Hadoop-based systems Proactively address regulatory compliance requirements and protect sensitive data in real time Highlights Monitor and audit data activity
More informationIBM Software Wrangling big data: Fundamentals of data lifecycle management
IBM Software Wrangling big data: Fundamentals of data management How to maintain data integrity across production and archived data Wrangling big data: Fundamentals of data management 1 2 3 4 5 6 Introduction
More informationIBM Software The fundamentals of data lifecycle management in the era of big data
IBM Software The fundamentals of in the era of big data How complements a big data strategy The fundamentals of in the era of big data 1 2 3 4 5 6 Introduction Big data, big impact: Dealing with the Best
More informationHow the oil and gas industry can gain value from Big Data?
How the oil and gas industry can gain value from Big Data? Arild Kristensen Nordic Sales Manager, Big Data Analytics arild.kristensen@no.ibm.com, tlf. +4790532591 April 25, 2013 2013 IBM Corporation Dilbert
More informationIBM InfoSphere Optim Test Data Management
IBM InfoSphere Optim Test Data Management Highlights Create referentially intact, right-sized test databases or data warehouses Automate test result comparisons to identify hidden errors and correct defects
More informationLuncheon Webinar Series May 13, 2013
Luncheon Webinar Series May 13, 2013 InfoSphere DataStage is Big Data Integration Sponsored By: Presented by : Tony Curcio, InfoSphere Product Management 0 InfoSphere DataStage is Big Data Integration
More informationBeyond the Single View with IBM InfoSphere
Ian Bowring MDM & Information Integration Sales Leader, NE Europe Beyond the Single View with IBM InfoSphere We are at a pivotal point with our information intensive projects 10-40% of each initiative
More informationBuilding Confidence in Big Data Innovations in Information Integration & Governance for Big Data
Building Confidence in Big Data Innovations in Information Integration & Governance for Big Data IBM Software Group Important Disclaimer THE INFORMATION CONTAINED IN THIS PRESENTATION IS PROVIDED FOR INFORMATIONAL
More informationIBM Software Understanding big data so you can act with confidence
IBM Software Understanding big data so you can act with confidence More data, more problems? Not if you have an agile, automated information integration and governance program in place 1 2 3 4 5 Introduction
More informationBig Data & Analytics for Semiconductor Manufacturing
Big Data & Analytics for Semiconductor Manufacturing 半 導 体 生 産 におけるビッグデータ 活 用 Ryuichiro Hattori 服 部 隆 一 郎 Intelligent SCM and MFG solution Leader Global CoC (Center of Competence) Electronics team General
More informationIBM Analytics. Just the facts: Four critical concepts for planning the logical data warehouse
IBM Analytics Just the facts: Four critical concepts for planning the logical data warehouse 1 2 3 4 5 6 Introduction Complexity Speed is businessfriendly Cost reduction is crucial Analytics: The key to
More informationBusiness-driven governance: Managing policies for data retention
August 2013 Business-driven governance: Managing policies for data retention Establish and support enterprise data retention policies for ENTER» Table of contents 3 4 5 Step 1: Identify the complete business
More informationExploiting Data at Rest and Data in Motion with a Big Data Platform
Exploiting Data at Rest and Data in Motion with a Big Data Platform Sarah Brader, sarah_brader@uk.ibm.com What is Big Data? Where does it come from? 12+ TBs of tweet data every day 30 billion RFID tags
More informationHow To Use Big Data To Help A Retailer
IBM Software Big Data Retail Capitalizing on the power of big data for retail Adopt new approaches to keep customers engaged, maintain a competitive edge and maximize profitability 2 Capitalizing on the
More informationIBM Analytics Prepare and maintain your data
Data quality and master data management in a hybrid environment Table of contents 3 4 6 6 9 10 11 12 13 14 16 19 2 Cloud-based data presents a wealth of potential information for organizations seeking
More informationIBM Software. The MDM advantage: Creating insight from big data
IBM Software The MDM advantage: Creating insight from The MDM advantage: Creating insight from 1 2 3 4 5 6 Introduction The importance of understanding your How MDM enhances big data and vice Leveraging
More informationIBM Unstructured Data Identification and Management
IBM Unstructured Data Identification and Management Discover, recognize, and act on unstructured data in-place Highlights Identify data in place that is relevant for legal collections or regulatory retention.
More informationIBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!
The Bloor Group IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS VENDOR PROFILE The IBM Big Data Landscape IBM can legitimately claim to have been involved in Big Data and to have a much broader
More informationIBM BigInsights for Apache Hadoop
IBM BigInsights for Apache Hadoop Efficiently manage and mine big data for valuable insights Highlights: Enterprise-ready Apache Hadoop based platform for data processing, warehousing and analytics Advanced
More informationIBM Software Delivering trusted information for the modern data warehouse
Delivering trusted information for the modern data warehouse Make information integration and governance a best practice in the big data era Contents 2 Introduction In ever-changing business environments,
More informationTapping the power of big data for the oil and gas industry
IBM Software White Paper Petroleum Industry Tapping the power of big data for the oil and gas industry 2 Tapping the power of big data for the oil and gas industry The petroleum industry is no stranger
More informationDelivering new insights and value to consumer products companies through big data
IBM Software White Paper Consumer Products Delivering new insights and value to consumer products companies through big data 2 Delivering new insights and value to consumer products companies through big
More informationAddressing government challenges with big data analytics
IBM Software White Paper Government Addressing government challenges with big data analytics 2 Addressing government challenges with big data analytics Contents 2 Introduction 4 How big data analytics
More informationTest Data Management in the New Era of Computing
Test Data Management in the New Era of Computing Vinod Khader IBM InfoSphere Optim Development Agenda Changing Business Environment and Data Management Challenges What is Test Data Management Best Practices
More informationKlarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance
Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance IBM s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice
More informationData virtualization: Delivering on-demand access to information throughout the enterprise
IBM Software Thought Leadership White Paper April 2013 Data virtualization: Delivering on-demand access to information throughout the enterprise 2 Data virtualization: Delivering on-demand access to information
More informationIBM Software Top tips for securing big data environments
IBM Software Top tips for securing big data environments Why big data doesn t have to mean big security challenges 2 Top Comprehensive tips for securing data big protection data environments for physical,
More informationIBM Analytics Make sense of your data
Using metadata to understand data in a hybrid environment Table of contents 3 The four pillars 4 7 Trusting your information: A business requirement 7 9 Helping business and IT talk the same language 10
More informationIBM Analytical Decision Management
IBM Analytical Decision Management Deliver better outcomes in real time, every time Highlights Organizations of all types can maximize outcomes with IBM Analytical Decision Management, which enables you
More informationIBM System x reference architecture solutions for big data
IBM System x reference architecture solutions for big data Easy-to-implement hardware, software and services for analyzing data at rest and data in motion Highlights Accelerates time-to-value with scalable,
More informationStrengthen security with intelligent identity and access management
Strengthen security with intelligent identity and access management IBM Security solutions help safeguard user access, boost compliance and mitigate insider threats Highlights Enable business managers
More informationIBM Software InfoSphere Guardium. Planning a data security and auditing deployment for Hadoop
Planning a data security and auditing deployment for Hadoop 2 1 2 3 4 5 6 Introduction Architecture Plan Implement Operationalize Conclusion Key requirements for detecting data breaches and addressing
More informationIBM InfoSphere: Solutions for retail
IBM Software Solution Brief IBM InfoSphere: Solutions for retail Build a single view of customer information and a trusted source for product information with data integration and master data management
More informationThe top five ways to get started with big data
IBM Software Thought Leadership White Paper June 2013 The top five ways to get started with big data 2 The top five ways to get started with big data Big data: A high-stakes opportunity Remember what life
More informationSafeguarding the cloud with IBM Dynamic Cloud Security
Safeguarding the cloud with IBM Dynamic Cloud Security Maintain visibility and control with proven security solutions for public, private and hybrid clouds Highlights Extend enterprise-class security from
More informationContinuing the MDM journey
IBM Software White paper Information Management Continuing the MDM journey Extending from a virtual style to a physical style for master data management 2 Continuing the MDM journey Organizations implement
More informationGetting the most out of big data
IBM Software White Paper Financial Services Getting the most out of big data How banks can gain fresh customer insight with new big data capabilities 2 Getting the most out of big data Banks thrive on
More informationIBM Software Four steps to a proactive big data security and privacy strategy
Four steps to a proactive big data security and privacy strategy Elevate data security to the boardroom agenda Contents 2 Introduction You ve probably heard the saying Data is the new oil. Just as raw
More informationThe Smart Archive strategy from IBM
The Smart Archive strategy from IBM IBM s comprehensive, unified, integrated and information-aware archiving strategy Highlights: A smarter approach to archiving Today, almost all processes and information
More informationIBM Software June 2014 Thought Leadership White Paper. The top five ways to get started with big data
IBM Software June 2014 Thought Leadership White Paper The top five ways to get started with big data 2 The top five ways to get started with big data Big data: A high-stakes opportunity Remember what life
More informationIBM Software Hadoop in the cloud
IBM Software Hadoop in the cloud Leverage big data analytics easily and cost-effectively with IBM InfoSphere 1 2 3 4 5 Introduction Cloud and analytics: The new growth engine Enhancing Hadoop in the cloud
More informationIBM InfoSphere BigInsights Enterprise Edition
IBM InfoSphere BigInsights Enterprise Edition Efficiently manage and mine big data for valuable insights Highlights Advanced analytics for structured, semi-structured and unstructured data Professional-grade
More informationBeyond Watson: The Business Implications of Big Data
Beyond Watson: The Business Implications of Big Data Shankar Venkataraman IBM Program Director, STSM, Big Data August 10, 2011 The World is Changing and Becoming More INSTRUMENTED INTERCONNECTED INTELLIGENT
More informationIBM Software Integrating and governing big data
IBM Software big data Does big data spell big trouble for integration? Not if you follow these best practices 1 2 3 4 5 Introduction Integration and governance requirements Best practices: Integrating
More informationIBM Social Media Analytics
IBM Social Media Analytics Analyze social media data to better understand your customers and markets Highlights Understand consumer sentiment and optimize marketing campaigns. Improve the customer experience
More informationThe Future of Data Management
The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class
More informationData Refinery with Big Data Aspects
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 7 (2013), pp. 655-662 International Research Publications House http://www. irphouse.com /ijict.htm Data
More informationPredictive analytics with System z
Predictive analytics with System z Faster, broader, more cost effective access to critical insights Highlights Optimizes high-velocity decisions that can consistently generate real business results Integrates
More informationThe Future of Business Analytics is Now! 2013 IBM Corporation
The Future of Business Analytics is Now! 1 The pressures on organizations are at a point where analytics has evolved from a business initiative to a BUSINESS IMPERATIVE More organization are using analytics
More informationOptimize data management for. smarter banking and financial markets
Optimize data management for smarter banking and financial markets 2 Flexibility, transparency, quick response times: Are you ready for the new financial environment? 1 2 and profitability Meeting customer
More informationAddressing customer analytics with effective data matching
IBM Software Information Management Addressing customer analytics with effective data matching Analyze multiple sources of operational and analytical information with IBM InfoSphere Big Match for Hadoop
More informationLeveraging innovative security solutions for government. Helping to protect government IT infrastructure, meet compliance demands and reduce costs
IBM Global Technology Services Leveraging innovative security solutions for government. Helping to protect government IT infrastructure, meet compliance demands and reduce costs Achieving a secure government
More informationIBM Social Media Analytics
IBM Analyze social media data to improve business outcomes Highlights Grow your business by understanding consumer sentiment and optimizing marketing campaigns. Make better decisions and strategies across
More informationIBM InfoSphere Optim Test Data Management Solution
IBM InfoSphere Optim Test Data Management Solution Highlights Create referentially intact, right-sized test databases Automate test result comparisons to identify hidden errors Easily refresh and maintain
More informationPreemptive security solutions for healthcare
Helping to secure critical healthcare infrastructure from internal and external IT threats, ensuring business continuity and supporting compliance requirements. Preemptive security solutions for healthcare
More informationMaking confident decisions with the full spectrum of analysis capabilities
IBM Software Business Analytics Analysis Making confident decisions with the full spectrum of analysis capabilities Making confident decisions with the full spectrum of analysis capabilities Contents 2
More informationIBM InfoSphere Optim Test Data Management solution for Oracle E-Business Suite
IBM InfoSphere Optim Test Data Management solution for Oracle E-Business Suite Streamline test-data management and deliver reliable application upgrades and enhancements Highlights Apply test-data management
More informationBusiness Analytics for Big Data
IBM Software Business Analytics Big Data Business Analytics for Big Data Unlock value to fuel performance 2 Business Analytics for Big Data Contents 2 Introduction 3 Extracting insights from big data 4
More informationIBM Content Analytics with Enterprise Search, Version 3.0
IBM Content Analytics with Enterprise Search, Version 3.0 Highlights Enables greater accuracy and control over information with sophisticated natural language processing capabilities to deliver the right
More informationWhite paper September 2009. Realizing business value with mainframe security management
White paper September 2009 Realizing business value with mainframe security management Page 2 Contents 2 Executive summary 2 Meeting today s security challenges 3 Addressing risks in the mainframe environment
More informationLeverage big data to fight claims fraud
Leverage big data to fight claims fraud How big data supports smarter approaches to addressing claims fraud Highlights Identify patterns and trends across emerging information to pinpoint fraudsters quickly
More informationBoosting enterprise security with integrated log management
IBM Software Thought Leadership White Paper May 2013 Boosting enterprise security with integrated log management Reduce security risks and improve compliance across diverse IT environments 2 Boosting enterprise
More informationReduce your data storage footprint and tame the information explosion
IBM Software White paper December 2010 Reduce your data storage footprint and tame the information explosion 2 Reduce your data storage footprint and tame the information explosion Contents 2 Executive
More informationIBM ediscovery Identification and Collection
IBM ediscovery Identification and Collection Turning unstructured data into relevant data for intelligent ediscovery Highlights Analyze data in-place with detailed data explorers to gain insight into data
More informationHow To Create An Insight Analysis For Cyber Security
IBM i2 Enterprise Insight Analysis for Cyber Analysis Protect your organization with cyber intelligence Highlights Quickly identify threats, threat actors and hidden connections with multidimensional analytics
More informationIBM Big Data Platform
IBM Big Data Platform Turning big data into smarter decisions Stefan Söderlund. IBM kundarkitekt, Försvarsmakten Sesam vår-seminarie Big Data, Bigga byte kräver Pigga Hertz! May 16, 2013 By 2015, 80% of
More informationIBM SmartCloud Monitoring
IBM SmartCloud Monitoring Gain greater visibility and optimize virtual and cloud infrastructure Highlights Enhance visibility into cloud infrastructure performance Seamlessly drill down from holistic cloud
More informationBreaking down silos of protection: An integrated approach to managing application security
IBM Software Thought Leadership White Paper October 2013 Breaking down silos of protection: An integrated approach to managing application security Protect your enterprise from the growing volume and velocity
More informationPredictive Analytics for Donor Management
IBM Software Business Analytics IBM SPSS Predictive Analytics Predictive Analytics for Donor Management Predictive Analytics for Donor Management Contents 2 Overview 3 The challenges of donor management
More informationChapter 6. Foundations of Business Intelligence: Databases and Information Management
Chapter 6 Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:
More informationIBM Cognos Enterprise: Powerful and scalable business intelligence and performance management
: Powerful and scalable business intelligence and performance management Highlights Arm every user with the analytics they need to act Support the way that users want to work with their analytics Meet
More informationReducing the cost and complexity of endpoint management
IBM Software Thought Leadership White Paper October 2014 Reducing the cost and complexity of endpoint management Discover how midsized organizations can improve endpoint security, patch compliance and
More informationBig Data, Integration and Governance: Ask the Experts
Big, Integration and Governance: Ask the Experts January 29, 2013 1 The fourth dimension of Big : Veracity handling data in doubt Volume Velocity Variety Veracity* at Rest Terabytes to exabytes of existing
More informationGanzheitliches Datenmanagement
Ganzheitliches Datenmanagement für Hadoop Michael Kohs, Senior Sales Consultant @mikchaos The Problem with Big Data Projects in 2016 Relational, Mainframe Documents and Emails Data Modeler Data Scientist
More informationManaging big data for smart grids and smart meters
IBM Software White Paper Information Management Managing big data for smart grids and smart meters Meet the challenge posed by the growing volume, velocity and variety of information in the energy industry
More informationA financial software company
A financial software company Projecting USD10 million revenue lift with the IBM Netezza data warehouse appliance Overview The need A financial software company sought to analyze customer engagements to
More informationExtending security intelligence with big data solutions
IBM Software Thought Leadership White Paper January 2013 Extending security intelligence with big data solutions Leverage big data technologies to uncover actionable insights into modern, advanced data
More informationHow To Use Big Data Effectively
Why is BIG Data Important? March 2012 1 Why is BIG Data Important? A Navint Partners White Paper May 2012 Why is BIG Data Important? March 2012 2 What is Big Data? Big data is a term that refers to data
More informationThe Next Wave of Data Management. Is Big Data The New Normal?
The Next Wave of Data Management Is Big Data The New Normal? Table of Contents Introduction 3 Separating Reality and Hype 3 Why Are Firms Making IT Investments In Big Data? 4 Trends In Data Management
More informationEmpowering intelligent utility networks with visibility and control
IBM Software Energy and Utilities Thought Leadership White Paper Empowering intelligent utility networks with visibility and control IBM Intelligent Metering Network Management software solution 2 Empowering
More informationFor healthcare, change is in the air and in the cloud
IBM Software Healthcare Thought Leadership White Paper For healthcare, change is in the air and in the cloud Scalable and secure private cloud solutions can meet the challenges of healthcare transformation
More informationDatenverwaltung im Wandel - Building an Enterprise Data Hub with
Datenverwaltung im Wandel - Building an Enterprise Data Hub with Cloudera Bernard Doering Regional Director, Central EMEA, Cloudera Cloudera Your Hadoop Experts Founded 2008, by former employees of Employees
More informationWHITEPAPER. A Technical Perspective on the Talena Data Availability Management Solution
WHITEPAPER A Technical Perspective on the Talena Data Availability Management Solution BIG DATA TECHNOLOGY LANDSCAPE Over the past decade, the emergence of social media, mobile, and cloud technologies
More informationIBM Big Data in Government
IBM Big in Government Turning big data into smarter decisions Deepak Mohapatra Sr. Consultant Government IBM Software Group dmohapatra@us.ibm.com The Big Paradigm Shift 2 Big Creates A Challenge And an
More informationTaking control of the virtual image lifecycle process
IBM Software Thought Leadership White Paper March 2012 Taking control of the virtual image lifecycle process Putting virtual images to work for you 2 Taking control of the virtual image lifecycle process
More informationIBM Tivoli Netcool network management solutions for enterprise
IBM Netcool network management solutions for enterprise The big picture view that focuses on optimizing complex enterprise environments Highlights Enhance network functions in support of business goals
More information8 Steps to Holistic Database Security
Information Management White Paper 8 Steps to Holistic Database Security By Ron Ben Natan, Ph.D., IBM Distinguished Engineer, CTO for Integrated Data Management 2 8 Steps to Holistic Database Security
More informationAchieving customer loyalty with customer analytics
IBM Software Business Analytics Customer Analytics Achieving customer loyalty with customer analytics 2 Achieving customer loyalty with customer analytics Contents 2 Overview 3 Using satisfaction to drive
More informationBig Data Integration and Governance Considerations for Healthcare
White Paper Big Data Integration and Governance Considerations for Healthcare by Sunil Soares, Founder & Managing Partner, Information Asset, LLC Big Data Integration and Governance Considerations for
More informationWashington State s Use of the IBM Data Governance Unified Process Best Practices
STATS-DC 2012 Data Conference July 12, 2012 Washington State s Use of the IBM Data Governance Unified Process Best Practices Bill Huennekens Washington State Office of Superintendent of Public Instruction,
More informationIBM InfoSphere Information Server Ready to Launch for SAP Applications
IBM Information Server Ready to Launch for SAP Applications Drive greater business value and help reduce risk for SAP consolidations Highlights Provides a complete solution that couples data migration
More informationHow To Use Social Media To Improve Your Business
IBM Software Business Analytics Social Analytics Social Business Analytics Gaining business value from social media 2 Social Business Analytics Contents 2 Overview 3 Analytics as a competitive advantage
More informationBig Data Use Case Deep Dive 5 Game Changing Use Cases for Big Data
Big Data Use Case Deep Dive 5 Game Changing Use Cases for Big Data Disruptive forces impact long standing business models across industries Pressure to do more with less Shift of power to the consumer
More informationCloudera Enterprise Data Hub in Telecom:
Cloudera Enterprise Data Hub in Telecom: Three Customer Case Studies Version: 103 Table of Contents Introduction 3 Cloudera Enterprise Data Hub for Telcos 4 Cloudera Enterprise Data Hub in Telecom: Customer
More informationHow To Understand The Benefits Of Big Data
Findings from the research collaboration of IBM Institute for Business Value and Saïd Business School, University of Oxford Analytics: The real-world use of big data How innovative enterprises extract
More informationIBM DB2 Near-Line Storage Solution for SAP NetWeaver BW
IBM DB2 Near-Line Storage Solution for SAP NetWeaver BW A high-performance solution based on IBM DB2 with BLU Acceleration Highlights Help reduce costs by moving infrequently used to cost-effective systems
More information