BaiQing DIAO 2014.9.15



Similar documents
How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

Big Data Analytics. An Introduction. Oliver Fuchsberger University of Paderborn 2014

Amplify Serviceability and Productivity by integrating machine /sensor data with Data Science

Survey of Big Data Architecture and Framework from the Industry

ANALYTICS CENTER LEARNING PROGRAM

Data Science & Big Data Practice

Big Data Storage Architecture Design in Cloud Computing

CHAPTER SIX DATA. Business Intelligence The McGraw-Hill Companies, All Rights Reserved

Research of Smart Distribution Network Big Data Model

The 4 Pillars of Technosoft s Big Data Practice

Advanced Big Data Analytics with R and Hadoop

Where is... How do I get to...

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Is a Data Scientist the New Quant? Stuart Kozola MathWorks

This Symposium brought to you by

Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data

Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.

Digital Modernization of Oilfields Digital Oilfield to Intelligent Oilfield. Karamay Hongyou Software Co., Ltd.

Testing Big data is one of the biggest

Big Data and Analytics in Government

Transforming the Telecoms Business using Big Data and Analytics

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON

Chapter 6 8/12/2015. Foundations of Business Intelligence: Databases and Information Management. Problem:

Big Data and Advanced Analytics Technologies for the Smart Grid

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Big Data: Image & Video Analytics

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

Chapter ML:XI. XI. Cluster Analysis

Zero-in on business decisions through innovation solutions for smart big data management. How to turn volume, variety and velocity into value

TURN YOUR DATA INTO KNOWLEDGE

ANALYTICS IN BIG DATA ERA

The University of Jordan

Foundations of Business Intelligence: Databases and Information Management

Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2

Oracle Big Data SQL Technical Update

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

VIEWPOINT. High Performance Analytics. Industry Context and Trends

SURVEY REPORT DATA SCIENCE SOCIETY 2014

BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES

B.Sc (Computer Science) Database Management Systems UNIT-V

Application and practice of parallel cloud computing in ISP. Guangzhou Institute of China Telecom Zhilan Huang

Big Data Use Case: Business Analytics

The WAMS Power Data Processing based on Hadoop

ORACLE UTILITIES ANALYTICS

Introduction to Data Mining

Chapter 6. Foundations of Business Intelligence: Databases and Information Management

Using reporting and data mining techniques to improve knowledge of subscribers; applications to customer profiling and fraud management

Alejandro Vaisman Esteban Zimanyi. Data. Warehouse. Systems. Design and Implementation. ^ Springer

A Knowledge Management Framework Using Business Intelligence Solutions

In-Memory Analytics for Big Data

SAP Predictive Analytics: An Overview and Roadmap. Charles Gadalla, SESSION CODE: 603

UPS battery remote monitoring system in cloud computing

Enterprise Resource Planning Analysis of Business Intelligence & Emergence of Mining Objects

TURN YOUR DATA INTO KNOWLEDGE

Leveraging Big Data Technologies to Support Research in Unstructured Data Analytics

Search and Data Mining: Techniques. Applications Anya Yarygina Boris Novikov

BIG DATA What it is and how to use?

Microsoft Services Exceed your business with Microsoft SharePoint Server 2010

Foundations of Business Intelligence: Databases and Information Management

Pentaho High-Performance Big Data Reference Configurations using Cisco Unified Computing System

BIG DATA Alignment of Supply & Demand Nuria de Lama Representative of Atos Research &

Data Refinery with Big Data Aspects

Automated Data Ingestion. Bernhard Disselhoff Enterprise Sales Engineer

The Challenge of Handling Large Data Sets within your Measurement System

Ganzheitliches Datenmanagement

Sunnie Chung. Cleveland State University

Preparing for the Future: How Asset Management Will Evolve in the Age of the Smart Grid

Advanced In-Database Analytics

Client Overview. Engagement Situation. Key Requirements

5.5 Copyright 2011 Pearson Education, Inc. publishing as Prentice Hall. Figure 5-2

The basic data mining algorithms introduced may be enhanced in a number of ways.

Technology in Action. Alan Evans Kendall Martin Mary Anne Poatsy. Eleventh Edition. Copyright 2015 Pearson Education, Inc.

ANALYTICS IN BIG DATA ERA

Reference Architecture, Requirements, Gaps, Roles

Oracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>

White Paper. How Streaming Data Analytics Enables Real-Time Decisions

Luncheon Webinar Series May 13, 2013

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

PRIME DIMENSIONS. Revealing insights. Shaping the future.

Advanced analytics at your hands

Course MIS. Foundations of Business Intelligence

Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc All Rights Reserved

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

Augmented Search for IT Data Analytics. New frontier in big log data analysis and application intelligence

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

Tax Fraud in Increasing

2015 Workshops for Professors

Outlines. Business Intelligence. What Is Business Intelligence? Data mining life cycle

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics

HOW TO DO A SMART DATA PROJECT

Embracing the Cloud, Mobile, Social & Big Data

Introduction to Engineering Using Robotics Experiments Lecture 17 Big Data

Enabling the Digital Enterprise, embracing Cloud, Mobile, Social & Big Data Introducing ARIS 9.0 and webmethods 9.0

What do Big Data & HAVEn mean? Robert Lejnert HP Autonomy

Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料

Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers

Research of Postal Data mining system based on big data

Comprehensive Analytics on the Hortonworks Data Platform

MOC 20467B: Designing Business Intelligence Solutions with Microsoft SQL Server 2012

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat

Transcription:

Intelligent Operation Analysis and Application of Power Big Data BaiQing DIAO 2014.9.15 1

Contents Ⅰ Introduction of Power Big Data Ⅱ Application of Technology Framework Ⅲ Power Big Data Quality Management Ⅳ Data Mining and Visualization 2

Introduction of Power Big Data-Types of Business Operation data of the company includes all kinds of business information of power grid. Transmission Condition mo nitoring for transmission line HVDC Flexible Transformation Intelligent power station Distribution Distribution Automation MDS Management Utilization Electricity Consumption Information Gathering Intelligent b uilding Electric Vehicles Dispatch EMS OMS Resources Guarantee HR Science and t echnology 3

Introduction of Power Big Data- Business System With business system deployment and application, data volume increases sharply, various types of data, the data storage, handling and value mining put forward higher requirements, and the need to frame a unified data platform and data management. ERP HR System Finalcial System Materials System TB->PB Manufacturing system Storage of equipment information, maintenance and order information Data volume to TB Dispatching System Cover scope broad huge data volume high data processing performance Marketing system Data volume increases from TB to PB Amount of data from customer service senter to PB Structured Data Platform Data volume of data warehouse to TB Growth of data volume to TB every month Mass/Real time Data Platform Electricity consumption and transformation systems information gathering Data storage to TB Unstructured Data Platform Amount of data to 10 million in existing business systmes Total data storage to TB GIS Data Platform GIS data platform includes graphic data, attribute data and topological data Data storage in GIS to TB Data volume from TB to Pb, high data saving performance and extensibility Intelligent and subtilized busines s, requires real time and high data processing complexity Data mining capability in cross-business and platform needs to improve 4

Introduction of Power Big Data-Business Data Power Big Data include:structured Data Unstructured Data Mass/Real time Data and GIS Data. Structured Data Structured Data(Relational Data),Data from business applications, for horizontal sharing, vertical cascade and data analysis, etc Unstructured Data Unstructured Data(Files Data) Includes office documents, pictures, XML, HTML, videos and audios. Mass/Real time Data Mass/Real time Data(Timestamp Data) generated from the acquisition system and the condition monitoring system. GIS Data Graphic data Attribute data Topological data 5

Introduction of Power Big Data-Data Attribute 4 V : Volume Variety Velocity Value 3 E : Energy Exchange Empathy 4V Engergy Data Factory Efficiency Power Plant Power to Data Exchange Visualization Real-time Two-way interaction Exchange Empathy Reflect requirements Cross boundary Ecological benefit

Contents Ⅰ Introduction of Power Big Data Ⅱ Application of Technology Framework Ⅲ Power Big Data Quality Management Ⅳ Data Mining and Visualization 7

Application of Technology Framework BI/Report Retrieval/Vi sualization Functional Application Professional Application Analytics Application Predictive Analytics DATA Services Platform 数 据 管 理 Metadata Management Model Management Data Management And Service Data Service Data Quality Data Mining Stream Computing Relational Database Data Application Data Mining And Analysis Data Integration and Govemance Data Integration Data Analysis Data Computing Data Calculation Data Storage Data Store NoSQL Data Retrieval Parallel Computing Distributed File System Data Integration And Governance Data Governance Structural Data Unstructured Data Data Source Data Mass Source Real Time Data GIS Data 8

Key Technology 1-Data Integration Using technologies such as ETL, OGG, extraction in the distribution of structured/unstructured/mass/real-time/gis data in a business system. Data Source Structured data Unstructured data Mass/realtime data GIS data Data Extraction ETL SQL WebService Copy OPC Data Association Data filtering Data standardization Extracts features Associate metadata Data Quality Check Integrity Accuracy Timeliness Target Data Structured data Unstructure d data Mass/realtime data GIS data 9

Key Technology 2-Data Storage Improving the capacity of data storage using infrastructure such as Relational database cluster, Distributed real-time database and Distributed file System. Structured Data Management Platform Unstructured Data Management Platform Mass/real-time Data Management Platform GIS Data Management Platform Relational Database Cluster Distributed real-time database Distributed file System RDB RDB RDB Database management in-memory database Metadata management Access control NoSQL Database redundancy strategy Cloud Storage 10

Key Technology 3-Data Calculation Improving the capacity of data calculation with technology such as Parallel Processing Technology, Stream Computing Technology and Indexing Technology. Parallel Processing Technology Stream computing technology Indexing Technology 11

Key Technology 4-Data Mining Using data mining technology, to build the business analysis model, find business value. Data Mining Technology Multi-dimensional Analysis Parallel computing Cluster Association Rules MapReduce Semantic engines Data Mining Classify Memory computing Time Series Regression Analysis Data Mining Model State grid analysis Model Load forecast of distribution Model Electricity sales forecast Model User behavior Model Optimize purchasing structure Model Analytical Application Power grid state tracking Load forecast of distribution Analytical Electricity sales Analytical User behavior Analytical Intelligence knowledge base 12

Key Technology 5-Data Application and Presentation Data presentation: By using different kinds of data presentation, such as charts, GIS, video, and map. To display data analysis results through PC, large screen, tablet, and mobile devices. Display Vector Display Form Analysis and Mining Clustering analysis Predictive analytics Association Analysis Variance analysis Data Source Structured Data Unstructured Data Mass/Real time Data GIS Data 13

Key Technology 6-Data Management and Services Data management and services: By establishing data service platform, to unified manage data storage, calculation, and data mining. Data Quality Management Platform Interface verify management Code verify management Monitor class management Data Quality Management Data quality statistic Verify scene management Data interface monitoring Data interface monitoring Data quality report Verify result board Data access monitoring Data schedule management Data stream monitoring Automati c dispatch Data service management Connecti on Service Access Service Data model manageme nt Data operations manageme nt Metadata Management Master Data Management Structured Data Unstructured Data GIS Data Mass/Real time Data Metadata Master Data Integration enterprise layer management model 14

Contents Ⅰ Introduction of Power Big Data Ⅱ Application of Technology Framework Ⅲ Power Big Data Quality Management Ⅳ Data Mining and Visualization 15

Power Big Data Quality Management-Data Link Power big data come from various business systems. Through organizing data link, to make sure the management responsibility of each point in data link, to draw the data distribution and flow map. Application Storage Stream Storage Business 16

Power Big Data Quality Management- Data Link Monitoring To design monitor rule of data link points in creation, calculation, transformation, and circulation, so that to locate data quality related issues and realize closed-loop management. 17

Power Big Data Quality Management- Technical Standard Development of data quality technical standard, including data model, data access, application development, and utilization. 18 18

Power Big Data Quality Management- Management Standard Development of data quality management standard, including data connection, alteration, and utilization. 19 19

Contents Ⅰ Introduction of Power Big Data Ⅱ Application of Technology Framework Ⅲ Power Big Data Quality Management Ⅳ Data Mining and Visualization 20

Data Mining-Application Framework 11 22 44 33 1. Indicator Data mining of single indicator of historical data, find change law of indicators,monitor indicator of data quality and business tansaction. Suitable data mining algorithms: time series, linear fitting. 2. Vertical Indicator relationship The historical data of the lower layer indicator as the basis, through the data mining for the upper point of concern. Suitable data mining algorithms: Regression, associatio n rules, clustering 3. Horizontal Indicator relationship Through studying the relevance between the index of two two types of data, build the indicator data correlation network di agram, to find potential business association t o provide decision basis. Suitable data mining algorithms: correlation analysis, association rules, clustering. 21

Data Mining1-Indicator Example Short-term Power Load Forecasting based on Fuzzy Information Granulation SVM Predicted Results: [L,R,U]=[595.24,680.69,727.26] 716 703 690 669 658 648 638 637 644 646 648 635 623 619 602 604 627 637 669 667 669 693 701 698 711 712 727 723 721 708 692 713 716 718 722 710 705 725 706 718 700 670 666 676 678 690 686 723 22

Data Mining2-Vertical Indicator Relationship Example power distribution reliability (27indicators,6 notes,21 parameters) The average outage time of user Power supply reliability rate User average failure times of interruption User average scheduled interruption time Customer average interru ption time Transformer fault outage rate weight?% weight?% weight?% weight?% weight?% weight?% weight?% Customer average insufficient power supply Scene selection Indicator Set This scene aims at studying electricity average interruption duration and 27 indicators, to analyze the weights and score of power distribution reliability. To solve current business issues. The analysis based on power supply reliability indicator, by selecting the time history data of power supply reliability, to analyze the relationship between correlation factor and weight. 23

Data Mining2-Vertical Indicator Relationship Example Through factor analysis can obtain the factor of original 27 indicators: varianc e weight The average outage time of user Transform er fault outag e rate User average failure times of interruptio n 业 务 稳 定 Customer 性 分 析 average insufficient power supply Factor total value 0.066 0.052 0.048 0.040 0.5305 factor1/facto r total Factor2/fac tor total Factor3/factor total factor4/factor total Factor n/factor total 1 power distribution reli ability 0.124 The average outage time of user 0.09 Transformer fault outage rate 0.15 0.09 User average failure times of interruption Supply Radius 0.10 Service Life 0.13 Cost of Deprecia tion Facility Cost 24

Data Mining3-Hrizontal Indicator Realtionship Among the indicator rule is starting from one or more concern, research the data regularity between the two indicators and business rules, so as to construct the indicators correlation network diagram, improve business personnel s understanding of the indicator rule. Monitoring the lifting point: (1) through the positive and negative correlation between indicators provide monitoring means for monitoring the quality of data (2) through the positive and negative correlation between business state can monitor the current integrated service quality Analysis of lifting points: (1) the strength of the correlation between indicators, provide the data analysis range of thematic analysis later (2) the correlation between indicators found potential business rules, business optimization and upgrading to provide analysis basis Indicator1 Indicator2 25

Detailed monitoring and data analysis E 市 Power feature with area: The city E has 60% samples in the cluster-3, 40% samples in cluster-1. From the time trend point of view, city E has the trend to change from cluster-3 to cluster-1 recently. 10 年 11 年 12 年 13 年 10 年 11 年 12 年 13 年 10 年 11 年 12 年 13 年 聚 类 一 聚 类 二 聚 类 三 A 市 47 1 B 市 48 C 市 48 D 市 48 E 市 18 30 F 市 48 G 市 48 H 市 48 I 市 47 1 J 市 48 K 市 1 46 1 L 市 1 47 M 市 47 1 N 市 48 26

Big Data Application Technology Power load flow analysis Use the data flow diagram to show the area of enterprise power load variation with time changes. electric power load change. Usage: (1) each ribbon r epresents the load of corresponding city; (2) ribbon width represents load value or the percentage of load in the province; (3) by moving the mouse can view the load of any time and city. 27

Big Data Application Technology Treemap Analysis Using Treemap to show the detailed data of various categories. (1) the block area on behalf of the blocks represent the total applied electricity capacity; (2) using the nested model conveniently show multilevel data constitute relationship. 28

Big Data Application Technology Bubble chart analysis To access different electricity capacity increase of cities, through the analysis of the data, for each city using a bubble chart display, a bubble chart size represents electricity increase that appeared, the bubble number withi n a block represents the number of classification of the electricity increase that the staff had made. Example The chart records different electricity capacity increase of cities. As we can see, the pink one(rural residents living electricity) is the largest part of every city except city C, grey part is the second part (general business). The city in the lower right chart has few capacity increase(7 kinds) compared with other city. 29