TECHNOLOGY ANALYSIS FOR INTERNET OF THINGS USING BIG DATA LEARNING



Similar documents
Patent Big Data Analysis by R Data Language for Technology Management

A Statistical Text Mining Method for Patent Analysis

A Divided Regression Analysis for Big Data

Developing the SMEs Innovative Capacity Using a Big Data Approach

From Data to Insight: Big Data and Analytics for Smart Manufacturing Systems

The emergence of big data technology and analytics

Development of Taekwondo Trainer System for Training on Electronic Protector with Hitting Target Indicator

Indian Journal of Science The International Journal for Science ISSN EISSN Discovery Publication. All Rights Reserved

Dept. of Financial Information Security

A Study on Quantitative Analysis for New Product Development Strategy

Development of CEP System based on Big Data Analysis Techniques and Its Application

Data Mining in Web Search Engine Optimization and User Assisted Rank Results

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

Performance Evaluation of Large-Scale Wireless Sensor Networks Communication Protocols that can be Integrated in a Smart City

3rd International Symposium on Big Data and Cloud Computing Challenges (ISBCC-2016) March 10-11, 2016 VIT University, Chennai, India

Internet of Things (IoT): A vision, architectural elements, and future directions

ASSOCIATION RULE MINING ON WEB LOGS FOR EXTRACTING INTERESTING PATTERNS THROUGH WEKA TOOL

Horizontal IoT Application Development using Semantic Web Technologies

Business Intelligence and Big Data Analytics: An Overview

Exchange of Data for Big Data in Hybrid Cloud Environment

How To Filter Spam Image From A Picture By Color Or Color

Systems of Discovery The Perfect Storm of Big Data, Cloud and Internet-of-Things

ECIS 2017 CALL FOR TRACKS

Enhancing MapReduce Functionality for Optimizing Workloads on Data Centers

Use of Data Mining Techniques to Improve the Effectiveness of Sales and Marketing

IEEE IoT IoT Scenario & Use Cases: Social Sensors

Technology offer: Machine-to-Cloud Management System of Distributed Heterogeneous Devices

DBaaS Using HL7 Based on XMDR-DAI for Medical Information Sharing in Cloud

Pattern Recognition Using Feature Based Die-Map Clusteringin the Semiconductor Manufacturing Process

Single Level Drill Down Interactive Visualization Technique for Descriptive Data Mining Results

FUTURE RESEARCH DIRECTIONS OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING *

Improving Data Processing Speed in Big Data Analytics Using. HDFS Method

Big Data Framework for u-healthcare System. Tae-Woong Kim 1, Jai-Hyun Seu 2.

A STUDY REGARDING INTER DOMAIN LINKED DOCUMENTS SIMILARITY AND THEIR CONSEQUENT BOUNCE RATE

Chapter 17: M2M-Based Metropolitan Platform for IMS-Enabled Road Traffic Management in IoT

The Development of an Intellectual Tracking App System based on IoT and RTLS

A Big Data Analytical Framework For Portfolio Optimization Abstract. Keywords. 1. Introduction

Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network

Keywords cosine similarity, correlation, standard deviation, page count, Enron dataset

Biomarker Discovery and Data Visualization Tool for Ovarian Cancer Screening

IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper

IMAV: An Intelligent Multi-Agent Model Based on Cloud Computing for Resource Virtualization

A Survey on Web Mining From Web Server Log

Data Mining Solutions for the Business Environment

A Log Analysis System with REST Web Services for Desktop Grids and its Application to Resource Group-based Task Scheduling

M2M Communications and Internet of Things for Smart Cities. Soumya Kanti Datta Mobile Communications Dept.

Smart Integrated Multiple Tracking System Development for IOT based Target-oriented Logistics Location and Resource Service

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

Internet of Things Value Proposition for Europe

How To Understand The Power Of The Internet Of Things

An Approach to Implement Map Reduce with NoSQL Databases

International Journal of Advanced Research in Computer Science and Software Engineering

Semantic Concept Based Retrieval of Software Bug Report with Feedback

BIG DATA IN SUPPLY CHAIN MANAGEMENT: AN EXPLORATORY STUDY

White Paper. How Streaming Data Analytics Enables Real-Time Decisions

Sentiment Analysis on Big Data

IoT R&I on IoT integration and platforms INTERNET OF THINGS FOCUS AREA

Open Access Research and Design for Mobile Terminal-Based on Smart Home System

From Big Data to Smart Data

Overview of the Internet of things

New Matrix Approach to Improve Apriori Algorithm

Big Data: Study in Structured and Unstructured Data

Investigative Research on Big Data: An Analysis

FUZZY CLUSTERING ANALYSIS OF DATA MINING: APPLICATION TO AN ACCIDENT MINING SYSTEM

A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS

TCM: Transactional Completeness Measure based vulnerability analysis for Business Intelligence Support

Tweets Miner for Stock Market Analysis

Using Data Mining for Mobile Communication Clustering and Characterization

Spam Detection Using Customized SimHash Function

PSG College of Technology, Coimbatore Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS.

Augmented Search for Web Applications. New frontier in big log data analysis and application intelligence

A Grid Architecture for Manufacturing Database System

A MEDICAL HEALTH CARE SYSTEM WITH HIGH SECURITY USING ANDROID APPLICATION

Exploitation of Server Log Files of User Behavior in Order to Inform Administrator

NETCONF-based Integrated Management for Internet of Things using RESTful Web Services

Aspect Oriented Strategy to model the Examination Management Systems

A RFID Data-Cleaning Algorithm Based on Communication Information among RFID Readers

Analysis of Internet Topologies: A Historical View

WEB SITE OPTIMIZATION THROUGH MINING USER NAVIGATIONAL PATTERNS

Distributed Framework for Data Mining As a Service on Private Cloud

Internet of Things: IoT Day Special Edition

The Application Method of CRM as Big Data: Focused on the Car Maintenance Industry

Healthcare Measurement Analysis Using Data mining Techniques

EL Program: Smart Manufacturing Systems Design and Analysis

Aggregation Methodology on Map Reduce for Big Data Applications by using Traffic-Aware Partition Algorithm

A Novel Method to Improve Resolution of Satellite Images Using DWT and Interpolation

Cloud Storage Solution for WSN Based on Internet Innovation Union

Efficient Data Recovery scheme in PTS-Based OFDM systems with MATRIX Formulation

Transcription:

TECHNOLOGY ANALYSIS FOR INTERNET OF THINGS USING BIG DATA LEARNING Sunghae Jun 1 1 Professor, Department of Statistics, Cheongju University, Chungbuk, Korea Abstract The internet of things (IoT) is an internet among things through advanced communication protocols without human s operation. The main idea of IoT is to reach and improve the common goal by their intelligent networking. The IoT is an integrated technology of several sub technologies, such as wireless sensor or semantic. The technology of IoT has been evolved according to the environment based on information communication technology and social infrastructure. So we need to know the technological evolution of IoT in the future. In this paper, we analyze the IoT technology to find its technological relationship. We use patents and papers related to the IoT, and consider big data learning as tool for the IoT technology analysis. To verify the performance of our proposed methodology, we perform and show our case study using collected the patent and paper documents. Keywords: Big data learning, Internet of things, Technology analysis, and Patent analysis --------------------------------------------------------------------***---------------------------------------------------------------------- 1. INTRODUCTION Since the ubiquitous computing was introduced in the late 1980s, the technologies of communication between humans and things, or between things and things have been researched and developed rapidly [1]. The technologies of internet of things (IoT) that the things can communicate their information each other without the help of human have been spread at the beginning of the 2000s [2]. Many researches of IoT have been performed in diverse academic and industrial fields, and the developed results have been published as papers and applied as patents [3]. The papers of IoT have been published for a long time. By comparison, the patents of IoT have been applied and registered relatively recently. So, we can get the most recent information of IoT technology from the published journal and conference papers. But in general, the paper has the limitation of developed technology because of its characteristics. On the contrary to this, a patent have detailed and complete information of developed technology because of its exclusive right for the technology in applied and registered patent. In this paper, we consider both paper and patent data for technology analysis of IoT. The analytical result of IoT paper data gives the newest trend about IoT technology. Also, we get more detailed result from IoT patent data analysis. To analyze the paper and patent data, we use big data learning (BDL). The BDL includes statistics and machine learning algorithms. Next section shows the related works which are the introductory IoT and BDL. In section 3, we propose our methodology for IoT technology analysis using BDL, we also show the experimental result of proposed methodology in this section. Our conclusion is shown in section 4. 2. RELATED WORKS The research backgrounds related to our study are IoT and BDL. First we show the description of IoT. 2.1 Internet of Things We are interested in how the things with intelligence are connected each other and acquire new knowledge, finally provide novel value to us. This is IoT which is a system to make what we want to by networking between intelligent things on the internet without human s intervention [2]. In addition, the IoT is a networking of thing space based on censor, network, and information over distributed environment [3]. We should develop the technologies of sensor, network, and interface for the IoT. In this paper, we analyze the technologies related to IoT for its technological evolution. So, we use patent and paper data for technology analysis of IoT. 2.2 Big Data Learning We are in the age of big data [4],[5]. Big data has three typical characteristics which are volume, variety, and velocity [6]. That is, the size of big data is extremely large, the data types of big data are diverse such as number, text, and figure. In addition, the data processing of big data is so rapidly. So we should consider these characteristics when we analyze the big data. The BDL is a technique of learning from big data [7]. BDL is distinguished from traditional data analysis such as statistics. We have to transform the big data into a structured data. This is suitable to data analysis using statistics and machine learning algorithm. And then we analyze the structured data by learning tools which are statistics and machine learning. Finally, we apply the results to real domain for optimal decision. In this paper, we consider paper and patent to big data because the paper and patent data include number, text, and figure, also they are so large and the velocity of their publication and application is rapidly. The Figure 1 shows the process of BDL. Volume: 03 Issue: 12 Dec-2014, Available @ http://www.ijret.org 314

Fig -1: BDL process Using BDL process, we can do optimal decision from big data. Next we propose a methodology for IoT technology analysis using BDL 3. IoT TECHNOLOGY ANALYSIS USING BIG DATA LEARNING In this paper, we use the patent and paper data for IoT technology analysis. We also perform the BDL based on statistics and machine learning algorithm to analyze the data. Figure 2 shows the process of IoT technology analysis using BDL. data into structured data which is called as document-term matrix [11]. This matrix consists of document and term as row and column respectively. Each cell is the occurred frequency of a term in a document. For this transformation, we use text mining techniques and social analysis by tm and sna packages of the R project [11],[12]. Next we extract high ranked keywords by descriptive statistics for keyword analysis. In our research, we can have efficient R&D planning for IoT using the result of keyword analysis. For our case study, we collected the patents and papers related to IoT from KIPRIS and IEEE respectively [8],[9]. First of all, we retrieved total 1,517 patent documents from the patent database which were consisted of the nations of the United States (19 patents), Europe (6 patents), the Patent Cooperation Treaty (PCT) (46 patents), Japan (1 patent), China (1441 patents), the United Kingdom (2 patents), and Taiwan (2 patents). The number of applied patents by year is shown as follow. Chart -1: Number of patents by year We found most technologies related to IoT were developed recently. Also, from the year 2010, the number of applied and registered patents has been increased rapidly. To analyze the patent documents, we constructed patent-term matrix. Using text mining preprocessing, we made this matrix, and the dimension of this matrix was 1517 by 1833. That is, the number of occurred terms was 1833. We extracted top ranked keywords from the patent-term matrix as follow. Fig -2: IoT technology analysis by BDL First we search patent and paper documents related to IoT from patent and paper databases such as KIPRIS, USPTO or IEEE xplore [8],[9],[10]. These data are big data because they include text, number, and others. So we can apply the BDL to analyze patent and paper data. In general, patent and paper consist of texts mainly, and these are not suitable to statistics and machine learning. So we transform the text Rank Table -1: Top Ranked Keywords: Patent Frequent Term 1 st Control, data, device, information, intelligent, management, monitoring, system, technology 2 nd Terminal 3 rd Gateway, equipment 4 th Communication, platform, service 5 th Access, network, wireless Volume: 03 Issue: 12 Dec-2014, Available @ http://www.ijret.org 315

From this result, we found that the technologies based on the IoT patents were focused on the actual usage and service, because the first ranked terms contained control, device, system, and management. Also the IoT patents showed the importance of the technologies with intelligent, wireless, platform, and terminal. Next we considered another technology analysis using papers related to IoT. We retrieved them from IEEE journal site. We selected 100 papers and make paper-term matrix. This matrix consisted of 3375 terms. Like the case of patent data, we extracted top ranked keywords from the matrix as follow. technology of IoT [13]. The technologies of analytical model for social and security frameworks, and smart and semantic architecture for sensor and application were researched heavily in academic field. To know more detailed relationship between the keywords of IoT in patent document data, we structured a SNA graph using correlation coefficients of the keywords. Figure 3 shows the SNA result of IoT patent data. Table -2: Top Ranked Keywords: Paper Rank Frequent Term 1 st Application, information, network 2 nd Data, system, technology, smart 3 rd Management, architecture, communication 4 th Model, framework, social, use, device, object, security 5 th Service, cloud, analysis, design, global, semantic, integration, sensor The IoT papers had the technologies based on application, information, and network. These terms were the first ranked keywords for IoT technology. This shows more scientific point of view than the IoT technology based on the patents. In addition, the IoT papers contain the technologies of model, social, security, cloud, analysis, design, global, semantic, integration, and sensor. Next we combined the keyword results of patent and paper cases for finding keyword distribution of IoT technology. Table 3 shows the keyword distribution of IoT patents and papers. Data Patent & Paper Patent Paper Table -3: Keyword Distribution Frequent Term Data, device, information, management, system, technology, communication, network, service Control, intelligent, monitoring, terminal, gateway, equipment, platform, access, wireless Application, smart, architecture, model, framework, social, use, object, security, cloud, analysis, design, global, semantic, integration, sensor Fig -3: SNA graph based on correlation of IoT patents The technologies of gateway, access, and monitoring are developed collaboratively. We found the technology of wireless is located isolated. So, we knew this technology is developed independently. The other technologies are linearly related. Also the technology of intelligent is intermediate technology of IoT, because it is in center of SNA graph. Next figure shows SNA graph of IoT paper using the correlation matrix of its keywords. From the frequent terms included in patent and paper at the same time, we can know the following technologies are researched and developed in academic and industrial fields concurrently; data and information, device and system, management and service, and communication and network. Also we can find the industrial trend of IoT technology from the frequent terms of patent keyword distribution. Using the frequent terms of IoT patents, we decided the technologies of intelligent platform for controlling and monitoring wireless access was popular issues for industrial domain of IoT. Especially the monitoring technology is important in the wireless Fig -4: SNA graph based on correlation of IoT papers Volume: 03 Issue: 12 Dec-2014, Available @ http://www.ijret.org 316

We found that this SNA graph had three components. The first component consists of cloud, model, semantic, use, architecture, and sensor. These represent the technology of system and architecture of IoT. The terms of smart, design, application, security, framework, analysis, integration, social, object are assigned to the second component. This component shows the technology of IoT design and application. The last component is global, but we cannot add a technological meaning to this component. Also the first and second components were connected by the terms of sensor, and smart. Next two figures show other SNA plots by keyword matrix for IoT technology analysis. Figure 5 is a SNA graph using IoT keyword matrix. Fig -5: SNA graph based on keyword matrix of IoT patents Using the result of above matrix, we decided that the terms of monitoring, and access were located in central areas on the IoT. So, in the patent aspect of IoT, the technology of monitoring and accessing can be considered as meaningful technology. Figure 6 shows another SNA graph of IoT keyword matrix in paper data. Fig -6: SNA graph based on keyword matrix of IoT papers Volume: 03 Issue: 12 Dec-2014, Available @ http://www.ijret.org 317

The terms of smart, model, analysis, framework, design, architecture, and application were regarded as central meanings from the point of view on IoT papers. In this experiment, we found that there were shown more diverse studies in paper than patent. 4. CONCLUSION In this paper, we analyzed the patents and papers related to IoT technology. We collected the IoT patent and paper data from KIPRIS and IEEE. Using the text mining preprocessing, we made structured data for descriptive statistics, and SNA. Also the results of statistics and SNA graph provided the understanding the technological relationship in IoT technology. From our study, we knew the main researched and developed areas on IoT technology were about the following two aspects. The first is the technology of system and architecture for IoT design. The second is the technology of monitoring and accessing between objects in the IoT environment. There are so many methods for technology analysis based on big data learning. In our future studies, we will apply them to the technology analysis works as well as IoT technology analysis. [12]. C. T. Butts, (2008) Social Network Analysis with sna, Journal of Statistical Software, Vol. 24, Iss. 6, pp. 1-51. [13]. P. S. Kumar, and V. A. Kumar, (2014) An Efficient Monitoring System for Sports Person Using Wi-Fi Communication, International Journal of Research in Engineering and Technology, Vol. 3, Iss. 11, pp. 20-23. BIOGRAPHIES Sunghae Jun is a professor of Statistics at Cheongju University, Korea. He received the BS, MS, and PhD degrees in department of Statistics, Inha University, Korea, in 1993, 1996, and 2001. Also, he received PhD degree in department of Computer Science, Sogang University, Korea in 2007. Lastly he got PhD degree in department of Information Management & Security, Korea University, Korea in 2013. From 2009 to 2010, he was a visiting scholar in department of Statistics, Oklahoma State University, the United States. He has researched big data and management of technology. REFERENCES [1]. D. Bandyopadhyay, and J. Sen, (2011) Internet of Things: Applications and Challengers in Technology and Standardization, Wireless Pers Commum Springer, Vol. 58, pp. 49-69. [2]. L. Atzori, A. Iera, and G. Morabito, (2010) The Internet of Things: A survey, Computer Networks, Vol. 54, Iss. 15, pp. 2787-2805. [3]. L. Atzori, A. Iera, G. Morabito, and M. Nitti, (2012) The Social Internet of Things (SIoT) When social networks meet the Internet of Things: Concept, architecture and network characterization, Computer Networks, Vol. 56, Iss. 16, pp. 3594-3608. [4]. S. Lohr, (2012) The Age of Big Data, The New York Times, February 11, 2012. [5]. N. R. Shikalgar, and D. Badgujar, (2013) Online Review Mining for Forecasting Sales, International Journal of Research in Engineering and Technology, Vol. 2, Iss. 12, pp. 53-55. [6]. J. Manyika, M. Chui, B. Brown, J. Bughin, R. Dobbs, C. Roxburgh, and A. H. Byers, (2011) Big data: The next frontier for innovation, competition, and productivity, McKinsey Global Institute. [7]. S. Jun, and J. Choi, (2014) Big Data and Statistics, What s the Connection? Proceedings on KIIS Spring Conference, Vol. 24, No. 1, pp. 183-184. [8]. KIPRIS (2014) Korea Intellectual Property Rights Information Service, http://www.kipris.or.kr. [9]. IEEE (2014) IEEE xplore, http://ieeexplore.ieee.org. [10]. USPTO (2014) The United States Patent and Trademark Office, http://www.uspto.gov. [11]. I. Feinerer, K. Hornik, and D. Meyer, (2008) Text Mining Infrastructure in R, Journal of Statistical Software, Vol. 25, Iss. 5, pp. 1-54. Volume: 03 Issue: 12 Dec-2014, Available @ http://www.ijret.org 318