BIG DATA IEs can play a key role in mining numbers for dramatic improvements



Similar documents
Industrial Internet & Advanced Manufacturing

Industrial Dr. Stefan Bungart

Lean manufacturing in the age of the Industrial Internet

A Hurwitz white paper. Inventing the Future. Judith Hurwitz President and CEO. Sponsored by Hitachi

From Big Data to Smart Data Thomas Hahn

Types of Engineering Jobs

Tapping the benefits of business analytics and optimization

Social Innovation through Utilization of Big Data

Optimizing Energy Operations with Machine-to-Machine Communications

decisions that are better-informed leading to long-term competitive advantage Business Intelligence solutions

I. TODAY S UTILITY INFRASTRUCTURE vs. FUTURE USE CASES...1 II. MARKET & PLATFORM REQUIREMENTS...2

Big Data, Physics, and the Industrial Internet! How Modeling & Analytics are Making the World Work Better."

Using Predictive Maintenance to Approach Zero Downtime

The Scientific Data Mining Process

Kepware Whitepaper. Enabling Big Data Benefits in Upstream Systems. Steve Sponseller, Business Director, Oil & Gas. Introduction

What is Data Mining, and How is it Useful for Power Plant Optimization? (and How is it Different from DOE, CFD, Statistical Modeling)

Smarter Energy: optimizing and integrating renewable energy resources

Why your business decisions still rely more on gut feel than data driven insights.

Asset Tracking Application Can It Drive Business Efficiencies?

IBM Executive Point of View: Transform your business with IBM Cloud Applications

Hyper-connectivity and Artificial Intelligence

Operations Management and the Integrated Manufacturing Facility

White Paper February IBM Cognos Supply Chain Analytics

Innovative Big Data Platform Revolutionizes Maritime Fleet Management

BPM for Structural Integrity Management in Oil and Gas Industry

ORACLE RAPID PLANNING

smart systems and internet of things forecast

Data Aggregation and Cloud Computing

INVENSYS OPERATIONS MANAGEMENT: AUTOMATING THE FUTURE OF A SUSTAINABLE SAN FRANCISCO. Invensys Operations Management

Healthcare Measurement Analysis Using Data mining Techniques

Igniting the Next Industrial Revolution

The hidden reality of payroll & HR administration costs

WASTEWATER TREATMENT OBJECTIVES

Real Options for the Future Energy Mix

Big Data. Fast Forward. Putting data to productive use

Smart Data THE driving force for industrial applications

Nuclear Energy: Nuclear Energy

Accenture Utilities Podcast Series A perspective on enterprise asset management in the power generation sector

WHITE PAPER ON. Operational Analytics. HTC Global Services Inc. Do not copy or distribute.

Data mining for prediction

Industrial Automation. A Manufacturing Revolution in Automotive and Industrial Equipment

STRUCTURAL HEALTH MONITORING AT ROME UNDERGROUND, ROMA, ITALY

Energy Insight from OMNETRIC Group. Achieving quality and speed in analytics with data discovery

SWAN. The Value of Online Water Network Monitoring SUMMARY

Information Visualization WS 2013/14 11 Visual Analytics

Business Process Services. White Paper. Optimizing Extended Warranty Processes by Embracing Analytics

Customer Analysis - Customer analysis is done by analyzing the customer's buying preferences, buying time, budget cycles, etc.

How To Understand Engineering

The Future of Business Analytics is Now! 2013 IBM Corporation

Renewable Energy. Winner, Loser, or Innocent Victim? by Dallas Burtraw, Joel Darmstadter, Karen Palmer, and James McVeigh

A Green Sector Overview

12.5: Generating Current Electricity pg. 518

Gerard Mc Nulty Systems Optimisation Ltd BA.,B.A.I.,C.Eng.,F.I.E.I

ARC VIEW. OSIsoft-SAP Partnership Deepens SAP s Predictive Analytics at the Plant Floor. Keywords. Summary. By Peter Reynolds

CANADIAN FRAMEWORK FOR LICENSURE

Onshore Wind Services

Patient Relationship Management

BUSINESS INTELLIGENCE MATURITY AND THE QUEST FOR BETTER PERFORMANCE

Portfolio Management 101:

Introduction To Engineering Design

Enhancing Business Performance using Integrated Visibility and Big Data

Analytics For Everyone - Even You

Managed Print Services

WHITE PAPER. The PLM Domino Effect. - Jagmeet Singh and Jeff Kavanaugh

Intelligent Terminal Automation System

Introduction. Background

Investor day. November 17, Energy business Michel Crochon Executive Vice President

Ten Critical Questions to Ask a Manufacturing ERP Vendor

T E A C H E R S N O T E S

SUSTAINING COMPETITIVE DIFFERENTIATION

Power Quality For The Digital Age INVERTING SOLAR POWER A N E N V IRONME N TA L P OT E N T I A L S W HI T E PA PER

SYSTEMS, CONTROL AND MECHATRONICS

Cisco Context-Aware Mobility Solution: Put Your Assets in Motion

Big Data Collection and Utilization for Operational Support of Smarter Social Infrastructure

Mobile Real-Time Bidding and Predictive

White Paper. Convergence of Information and Operation Technologies (IT & OT) to Build a Successful Smart Grid

Data Isn't Everything

SPATIAL DATA CLASSIFICATION AND DATA MINING

Equipment Performance Monitoring

End Small Thinking about Big Data

Energy Systems Integration

How To Change A Business Model

Business Process Management In An Application Development Environment

Generating Current Electricity: Complete the following summary table for each way that electrical energy is generated. Pros:

A Forrester Consulting Thought Leadership Paper Commissioned By Zebra Technologies. November 2014

Good morning. It is a pleasure to be with you here today to talk about the value and promise of Big Data.

Intelligent Systems: Unlocking hidden business value with data Microsoft Corporation. All Right Reserved

Core Ideas of Engineering and Technology

DEXTER. Ship Health Monitoring Software. Efficiency & Reliability. Smart Solutions for

IBM Cognos Enterprise: Powerful and scalable business intelligence and performance management

Firm Uses Internet Service Bus to Enable Smart Grid for Dynamic Energy Savings

Business Intelligence and Decision Support Systems

How To Monitor The Environment Online

Uniformance Asset Sentinel. Advanced Solutions. A real-time sentinel for continuous process performance monitoring and equipment health surveillance

Move beyond monitoring to holistic management of application performance

The Internet of Things: Connected Home, Auto, and Life

At a recent industry conference, global

Profile. Company. areas of competence

Transcription:

Break through with BIG DATA IEs can play a key role in mining numbers for dramatic improvements By Andrew Kusiak 38 Industrial Engineer www.iienet.org/iemagazine

TThe growth of data streams and data analysis will generate significant developments in the 21st century. Big data will play a key role in shaping the design of products, systems and services by making them smarter, better connected, more efficient and widely accessible. Data will become a powerful economic engine. Data analysis has been enlarging its footprint in business and engineering applications for the last two decades. In particular, the two labels of data science and big data have emerged recently, attracting attention and receiving massive press coverage. Expectations from data science are enormous. In the research community, the interest in data is not new, as it has accompanied developments in science and practice for centuries. The novelty is in the scale at which the data is available, its growth and the acknowledgement that data research has been equated to data science. The industrial engineering profession can be at the forefront of applying big data analysis and research in a number of business sectors. What is big data? The concept of big data can be illustrated by likening it to an Excel-like file. Such a file could include countless columns and rows of data. The rows of the Excel file would be generated at high frequency (data streaming). However, it needs to be realized that a big data scientist does not expect the data to appear in an Excel format. Big data sets seldom exist in a ready-to-use form. Instead, they need to be harvested. Another important aspect of big data is its acquisition, which the author calls data farming. To date, most data analysis projects have been based on the data generated for purposes other than data science research and applications. This limited scope of data constraints is what data science has to offer. To meet the expectations of data science, data farming needs to become a reality. The industrial engineering connection to data farming comes naturally through the design of experiments, which may be considered as a data farming tool. Other tools for data farming, e.g., sensor design and deployment, will emerge. Despite prolific writings about big data, there are only a handful of places where big data are readily available. Many organizations store data in volumes measured in large numbers that we are unaccustomed to. The fact that the data stores are large does not make them truly big, however. For example, deep-sequenced genetic data may include tens of thousands of parameters (columns of an Excel file); however, in a healthcare environment, such data may be available for only a few patients. On the other hand, industrial process data may have been collected for years, although the number of parameters for which the data has been collected could be limited. The question of who owns the data often is overlooked. It usually is assumed that an industry or an individual that has purchased equipment owns the data the equipment generates. Many individuals and companies may be surprised that the equipment manufacturer owns the data generated by the equipment, especially when the equipment is under warranty. Realizing the flow of data is important to the ownership of the problems and solutions that are developed through data analysis. Areas ripe for big data mining Embracing big data adds another dimension to industrial engineering research and practice. It translates into an area of intellectual and economic activity, thus creating new jobs. The fact that the term process is prominent in the definition of industrial engineering establishes a natural bridge to the phenomenon of big data. After all, the very reason for collecting process data is to reflect process behavior. Such data can be used to model processes, which in turn can be optimized. Big data modeling offers a significant advantage, as the models created can be more accurate than models created with less data. Such models may be considered digital replicas of the processes or phenomena of interest. Thus, it needs to be realized that models are created by machine learning algorithms rather than being developed by people. The models can be refreshed as frequently as needed to maintain the expected accuracy. Industrial experience in modeling stationary and nonstationary processes becomes useful in designing data-driven models. As a profession, industrial engineering stands to gain tremendously by embracing and engaging in big data research and applications. Such opportunities either exist or will emerge in the near future across many industrial and service domains. Industrial engineering can apply big data research in areas that include manufacturing and remanufacturing; product design and development; the service, energy, transportation, healthcare and biology sectors; sustainability and innovation. Manufacturing and remanufacturing. Traditional manufacturing and remanufacturing as well as emerging manufacturing technologies and systems involve data. Areas of industrial engineering activities such as continuous improvement, process modeling, production planning and scheduling, and quality improvement are naturally amenable to big data analysis. Data science offers tools for fusing diverse data steams (parameters) that would not be possible with traditional approaches such as operations research or simulation. Models involving diverse parameters could be highly accurate, which could be a boon since forecasting in any industry usually includes the chance of significant error. For example, predicting the demand for remanufactured parts and assemblies in a market where original and remanu- March 2015 Industrial Engineer 39

Break through with big data FIGURE 1 Fix it with data A data-driven model helped resolve manufacturing issues associated with rounded surfaces on wafers. Wafer surfaces with a roll-off greater issues associated than zero cannot with rounded be used surfaces for integrated on wafers. circuits. Wafer surfaces with a roll-off greater than zero cannot be used for integrated circuits. FIGURE 2 From dirty to clean The basic processes in a wastewater processing plant are just waiting for data solutions provided by industrial engineers. Screens Primary clarifier Aeration basin Secondary clarifier Effluent Raw wastewater Sediment Air Disinfection Primary sludge Blower Returned sludge Activated sludge Sludge pump To sludge treatment factured parts and assemblies are sold is difficult. Our studies have shown that data-driven models accurately predict production volumes using historical production volumes, stock prices, product reliability data and parameters as esoteric as average temperature in the region. In one case, a data-driven model combining data about product, material and process parameters dramatically reduced the number of roll-off errors in wafer production. The roll-off error illustrated in Figure 1 involves a rounded surface of a wafer that cannot be used for integrated circuits. Such a rounded surface is created from shortcomings of the wafer manufacturing process. Product design and development. Industrial engineering needs to be open to embracing all engineering areas where data offers value. Product design and development is a highly multidisciplinary process awaiting new solutions. It is widely recognized that design of an innovative product considers data streams originating at customers, experts, trails of data left by products throughout their lifetime, and cyberspace. It has been established that markets get conquered by products inspired by requirements that go beyond basic functions. Disruptive innovations often succeed because they meet this expanded set of requirements. Service. Business and computing communities have engaged in research of service systems and service computing. The industrial engineering profession may learn from the frameworks developed by these sister communities and translate them into practical applications versed in data. Text (e.g., emails), images (e.g., facial features) and numerical data (process parameters) can be combined to support decision-making in industry. A model versed in data from sources that never have been used before could revolutionize decision-making. There is no doubt that data from heterogeneous data streams are truly big data. Energy. The energy industry is undergoing significant changes, triggered largely by environmental concerns. Energy generation is becoming distributed rather than centralized as a shift to clean energy is taking place. A combustion engine or a nuclear power plant is replaced by dozens to thousands of wind turbines or solar panels, thus creating a large energy-supply network. Coordinating energy consumption by thousands or millions of households and industries with the energy supply makes it a complex optimization problem. At some level, a wind farm is not that different from a manufacturing plant. A machine tool changes the material form; a wind turbine converts the energy of the wind into electrical energy. Besides controlling, monitoring and maintenance of turbines, the energy produced needs to be managed. The 40 Industrial Engineer www.iienet.org/iemagazine

Big data, minimal changes A recent report in Forbes shows that the big data money train has yet to pull into the station. According to a survey of 226 executives, only 13 percent have fully implemented their big data programs to where they are integrating predictive insights into business operations. While 79 percent have not fully integrated their data sources across the organization, 67 percent admit that they don t have well-defined criteria to measure the success of big data. Even under such limited criteria, only 27 percent labeled their big data initiatives successful, and only 8 percent called them very successful. According to Forbes, the problems are remarkably similar to those experienced when enterprises try to integrate any advanced business technology client/server computing, ERP woes and service-oriented architecture. emerging energy model could benefit from the logic and solutions embedded in enterprise resource planning systems widely known to industrial engineering practitioners. Production system expertise is applicable to the energy industry, and this includes wind energy. Wind farms offer tremendous volumes of data that come at low cost. It is rare to see an industrial process generating as much data as wind turbines provide. This offers a new area for industrial engineering solutions. Transportation. In the transportation domain, industrial engineering has been present for years. Now is the time to integrate the data modeling and human factors perspectives in a uniform research and development approach. A car generates data from its engine and sensors installed in the cabin. It also receives data from external sources such as cameras, satellites and other vehicles. As the term autonomous vehicle gains attention, the industrial engineering community should act. Applying industrial engineering to these tools can yield better capacity utilization of highways and vehicles. How different is managing transportation assets and road capacity from that of production management? The single most differentiating factor between the transportation and production domain is in the data. Vehicles move while production machinery remains stationary. Therefore, telematic systems limit the width and frequency of the volumes of data transmitted. Determining the type of data and the frequency of data transmission from a vehicle is, in fact, a valid industrial engineering problem. Making use of the data attributed to a transportation network is a great challenge. Sustainability. Industrial engineers could not find a more interesting area of research and practice than sustainability. Partnerships could be formed with civil and geo-informatics communities to create data-driven models involving water, soil and climate applications. One of many applications awaiting attention is performance optimization of wastewater processing plants. The basic processes involved in these plants are shown in Figure 2. A wastewater processing plant involves processes that consume energy, and it also produces energy (methane gas). It is not unusual to see at least 10,000 parameters monitored at different time scales. This is a large system that needs big data solutions due to the diversity of different processes that have to work seamlessly. This industry is large in numbers, as there are municipal and industrial facilities. Facilities that process clean drinking water usually operate independently of wastewater plants and use different processes. In countries facing water shortages, such as Singapore, it is not unusual to see the waste- and clean-water processing plants combined into one, with wastewater flowing in and usable water flowing out of the plant. Healthcare and biology. The industrial engineering profession has a long history of engagement in healthcare research and practice, particularly involving process improvement, safety and information technology. Embracing the data perspective offers the opportunity to enhance clinical research and applications. Healthcare awaits solutions for disease diagnosis, treatment optimization and process improvement to benefit patients. Embracing clinical and genetic data, including data from deep-sequencing equipment, offers great opportunities for implementing process and modeling perspectives. Industrial engineering research in medical decision-making has been visible for decades, mostly in administrative domains. The key challenges ahead for medical research include discovering knowledge from all of the data gathered and making use March 2015 Industrial Engineer 41

Break through with big data FIGURE 3 Innovation framework Analyzing and mining data could yield knowledge that would improve the process of innovation. In many ways, a wind farm is like a manufacturing plant, as turbines need to be controlled, monitored and maintained and the energy produced needs to be managed. of this data in clinical practice. Such data may include patient data, medical tests, imaging and genetic data. This new information represents dozens to even thousands of parameters. The new knowledge and models discovered from medical data could lead to dramatic productivity and quality improvements, thus making the healthcare system sustainable. Innovation. Innovation has been studied for decades; however, long awaited breakthroughs are still to come. The four data sources discussed in the product design and development section are of great interest to innovation, particularly the data trail attributed to a product or a system over its lifetime and the data distributed in cyberspace, as shown in Figure 3. Analyzing and mining such data would enhance innovation. Some industries use data in fostering innovation, while most do not use it at all. Any area of economic significance is usually backed up with science. For example, management has management science and manufacturing has manufacturing science. Though innovation appears to have the highest impact on economic output, its science is missing. One comes across methods, rules and tools supporting innovation; however, a unified theory of innovation is missing. The number of publications on innovation topics is growing rapidly, offering a hope for establishing innovation science. The fact that meaningful innovations are rare can be interpreted to mean that our innovation processes carry a huge error. Depending on the assumptions and area of economic activity, this error is estimated to be as high as 95 percent and can even exceed 99 percent. Innovation needs a science that would reduce this error dramatically. This large error suggests that massive experimentation is needed in the generation of high-impact solutions. Any innovative solution needs to be accepted by the market, which implies that a market model versed in data and allowing for massive simulations would lead to disruptive and continuous innovations. Simulation and industrial engineering go hand in hand. Opportunities to imagine Systematic studies and imagination aimed at improving processes and outcomes by better use of data are issues that are worth the attention of the industrial engineering profession. Big data has not been explored to its fullest, and IEs could be the first to power the data-driven economic engine. Opportunities for discovery are plentiful. Even novice IEs introduced to a technology or research area could have a meaningful impact on products, processes and services by taking advantage of the value hidden in big data. Andrew Kusiak is professor and chair of the Department of Mechanical and Industrial Engineering at the University of Iowa. He previously served as associate professor at the University of Manitoba and assistant professor at the Technical University of Nova Scotia. 42 Industrial Engineer www.iienet.org/iemagazine