Open Access Research on Database Massive Data Processing and Mining Method based on Hadoop Cloud Platform
|
|
|
- Ada Martin
- 10 years ago
- Views:
Transcription
1 Send Orders for Reprints to The Open Automation and Control Systems Journal, 2014, 6, Open Access Research on Database Massive Data Processing and Mining Method based on Hadoop Cloud Platform Zhao Xiaoyong * and Yang Chunrong Mathematics and Computer Science Institute, XinYu University, JiangXi, , China Abstract. This paper establishes the massive data processing mathematical model and algorithm of cloud computing, and the Hadoop distributed computing method is introduced to the database management system, to realize the automatic partition database data and master-slave node set. The master-slave nodes distributed algorithm is complied by using the MATLAB software realizing the data distributed computing function, and through the numerical simulation, we can compute the data processing speed, transmission rate, capacity and other system parameters. Compared with the Hadoop distributed processing algorithms and two kinds of traditional data processing algorithm, we can find that the data processing speed of Hadoop distributed computing algorithm is faster than the general algorithm, the amount of information storage, information transmission speed, which can satisfy the need of high data processing. Keywords: Hadoop; Cloud computing; Massive data; Large capacity; Distributed computing; Master slave node 1. INTRODUCTION With the development of computer internet communication computing, communication system needs to deal with a very large data. For the massive data processing, the server will consume a large amount of computer resources, and many traditional computing platforms cannot complete the mass data processing [1,2]. Combined with the calculation function of Hadoop distributed, a cloud computing data processing platform of high-speed processing mass data is designed by using cloud computing and cloud storage technology [3]. At the same time, combined with master-slave nodes automatic assignment, the platform uses the automatically partitioned database to realize a high data capacity and transmission speed data processing, which provides a new scheme for the massive data processing algorithms and data communication technology. 2. OVERVIEW OF HADOOP CLOUD PLATFORM DATABASE MASSIVE DATA PROCESSING AND MINING METHODS The evolution process of the personalized internet causes a massive data, and the traditional single super server in the face of massive data has gradually fallen short, so the processing massive data has become a thorny problem [4]. The characteristics of open source Hadoop cloud platform developed by Apache foundation research brings endless possibilities for the huge amount of data processing methods, this paper uses the Hadoop data processing platform and combines with distributed data processing algorithm to establish the massive database processing data platform, the main structure is shown in Fig. (1). Fig. (1) shows the schematic diagram of Hadoop massive data processing, it can be seen from the chart that the data processing of massive data Hadoop mainly consists allocation computing algorithm design and master-slave distributed, these two techniques can successfully achieve high speed processing of massive database data. Massive database Hadoop database processing algorithm Distributed computing method Master-slave nodes distribution Completing data processing Fig. (1). Hadoop massive data processing / Bentham Open
2 1464 The Open Automation and Control Systems Journal, 2014, Volume 6 Xiaoyong and Chunrong Part1 Part2 Part3 Part4 Part5 Reduce Reduce Fig. (2). Massive data cloud processing Hadoop platform. 3. DESIGN OF DATABASE MASSIVE DATA PRO- CESSING AND DATA MINING MATHEMATICAL MODEL Massive image processing belongs to a large image data processing, and it not only has rather high processing requirements, but also is very harsh on the storage space requirements, if we use the general storage space, it is in difficult to meet the image processing capacity requirements [5, 6]. This paper adopts Hadoop distributed processing algorithm, the data space is partitioned to realize high-speed mass data processing. The database the data is divided into [w i,w i+1 ], the establishment of two-dimensional database Hadoop data processing function is shown in formula (1).! # " $# M(m,n) = x 1 + x 2 m + x 3 n + x 4 mn N(m,n) = y 1 + y 2 m + y 3 n + y 4 mn Assumed that memory allocation value is E T =!" E 1, E 2, E 3, E 4 # $ Then formula (1) and formula (2) can be written in matrix form: e(m,n) =! x. (3) "! = # $ Among them,! 0 0! (1) (2) % &'. (4) For each storage node, it is Z = ZR. (5) Assuming the generalized coordinate R is F = H!1 x. (6) is We can get the distributed node calculation formula that X (x, y) = a(x, y)x. (7) In order to realize the cloud computing and distributed computing, the master-slave nodes distributed computing program is set by using MATLAB software, the main procedures are as follows [7]: function dy=vdp100000(t,y) dy=zeros(2,1); dy(1)=y(2); dy(2)=1000*(1-y(1)^2)*y(2)-y(1); t0=0 tf=8000 t0=0 tf=8000 function dy=rigid(t,y) dy=zeros(3,1); dy(1)=y(2)*y(3); dy(2)=-y(1)*y(3); dy(3)=-0 51*y(1)*y(2); t0=0 tf=12 [T,Y]=ode45('rigid',[0 12],[0 1 1]); plot(t,y(:,1),'-',t,y(:,2),'*',t,y(:,3),'+') 4. DESIGN OF HADOOP CLOUD PLATFORM DA- TABASE MASSIVE DATA PROCESSING SYSTEM In order to verify the validity and reliability of massive data processing mathematical model and the algorithm in second section, this paper uses MATLAB software to carry on simulation for the massive data processing, and designs the Hadoop cloud data processing platform, in which the main platform frame is shown in Fig. (2). As shown in Fig. (2), the Hadoop cloud computing platform is mainly composed of three technologies, including distributed parallel computing framework Reduce,
3 Research on Database Massive Data Processing The Open Automation and Control Systems Journal, 2014, Volume Browser Hadoop utility tool Secondary JVM JobTracker Win Operating system Server Fig. (3). The schematic diagram of Hadoop master node design. Secondary JobTracker JVM Linux Operating system Server Fig. (4). The schematic diagram of Hadoop slave node design. Fig. (5). The schematic diagram of Hadoop and Hbase running. distributed database system HDFS and distributed file system Hbase. As shown in Fig. (3), the master-slave node design is the main calculation mode of Hadoop implementation distributed computing and data storage, which has a master node and multiple slave nodes in each database cluster, and the master node running daemon includes, Secondary and JobTracker. As shown in Fig. (4), there are many salvee nodes in the Hadoop massive data cloud processing system, which can realize the data distributed processing function [8]. In the slave node running, the daemon is DataNode and Task- Tracker. Fig. (5) shows the Hadoop and HBase run simultaneously, it can layout Hbase information and data partition, this
4 1466 The Open Automation and Control Systems Journal, 2014, Volume 6 Xiaoyong and Chunrong Fig. (6). Master node opening. Fig. (7). The massive data processing computing time simulation curve. paper uses the browser Web log to deal with massive data, the master node opening is shown in Fig. (6). Fig. (6) shows the schematic diagram Master node opening, it is the key of implementing Hadoop database massive data processing [9,10]. In order to verify the validity and reliability of the system, this paper goes through simulation that can obtain the calculation performance table as shown in Table 1. Table 1 shows the simulation parameters of the cloud massive data processing Hadoop platform [11]. It can be seen from the table that Hadoop data processing calculation can achieve a higher level, in which the minimum value of data calculation is 0.001Mb/s, the information capacity of the minimum value is 100Gb and the speed is 1800 frames /s, they all meet the needs of design. In order to compare different algorithms on the processing effect of the massive data, this paper selects two kinds of traditional algorithm and Hadoop cloud processing algorithm to carry on comparison, the amount of data is from the initial dozens of million to several hundreds of megabytes [12, 13]. As shown in Fig. (7), the MATLAB numerical simulation calculation can be found that the computing speed of Hadoop cloud processing computing platform is significantly higher than two traditional algorithms, especially when the data is reached 105 level, data processing speed will be faster, it can save several times data processing time, which is a kind of high efficient data processing algorithms. Table 1. The massive data processing cloud platform simulation parameters. Main Simulation Parameters Numerical Calculated data Min Mb/s Calculated data Max Mb/s 0.01 Information capacity Min Gb 100 Information capacity Max Gb SUMMARY Speed (frame /s) 1800 Massive image processing technology requires very high for the processor and memory, which need to adopt high performance processor and large capacity memory, because
5 Research on Database Massive Data Processing The Open Automation and Control Systems Journal, 2014, Volume the traditional single or single core processing and ordinary memory already can not satisfy the need of image processing. In this paper, cloud computing function is introduced to mass data processing system, and then through the Hadoop distributed computing function, combined with the automatic distribution method of master slave node, the high speed data processing can be realized. Finally, compared with the Hadoop distributed processing algorithms and two kinds of traditional data processing algorithms, the data processing speed of Hadoop distributed algorithms is faster and its information capacity is also higher, it is a massive data processing algorithm, which provides a new method for the study of large data, high capacity and high transmission rate data treatment scheme. CONFLICT OF INTEREST The authors confirm that this article content has no conflicts of interest. ACKNOWLEDGEMENTS Declared none. REFERENCES [1] Wang Wei, Jiang Lina. Research on cloud computing security requirements analysis [J]. Information network security, 2012(1): [2] Sun Fuquan, Zhang Dawei, Cheng Xu, Liu Chao. Construction of enterprise private cloud storage platform based on Hadoop [J]. Journal of Liaoning Technical University, 2011(3): [3] Zhou Zichen, Shen Zhenning. The power supply integrity design method in high speed embedded system [J]. Microcontroller and embedded application, 2010(3): [4] Kuang Shenghui, Li Bo. Analysis of cloud computing architecture and its application case [J]. Computer and digital engineering, 2011, 3(6): [5] Xin Jun, Chen Kang, Zheng Weimin. Research on virtual cluster management technology [J]. Computer science and exploration, 2011, 4(23): [6] Yu Fei. Research on data preprocessing technology in Web log mining [J]. Computer technology and development, 2011, 20(5): [7] Jinhai. On cloud [J]. China Institute of computer communication, 2012, 5(6): [8] Wu Jiyi, Ping Lingdi, Pan Xuezeng, Li Zhuo. Cloud Computing: from concept to platform [J]. Telecom science, 2011,12 (4): [9] Xue Zhiqiang, Liu Peng, Wen AI. Research on distributed file system management strategy [J]. Computer knowledge Technique, 2011(1): [10] Li Cheng, Cheng Xiaoyu. Analysis of high-speed DSP signal integrity simulation based on Hyperlynx [J]. Electronic devices, 2011, 33(2): [11] Zhu Zhijun, Le Zichun, Zhu ran. Research and implementation of OBS network simulation platform based on NS2 [J]. Journal of communication, 2012, 30(9): [12] Zhang Jian. The concept of cloud computing and its influence analysis [J]. Telecommunication technology, 2012, 1(12): [13] Chen Quan, Deng Qianni. Cloud computing and its key technology [J]. Computer application, 2011, 29 (9): Received: September 16, 2014 Revised: December 23, 2014 Accepted: December 31, 2014 Xiaoyong and Chunrong; Licensee Bentham Open. This is an open access article licensed under the terms of the Creative Commons Attribution Non-Commercial License ( which permits unrestricted, non-commercial use, distribution and reproduction in any medium, provided the work is properly cited. Xiaoyong and Chunrong
UPS battery remote monitoring system in cloud computing
, pp.11-15 http://dx.doi.org/10.14257/astl.2014.53.03 UPS battery remote monitoring system in cloud computing Shiwei Li, Haiying Wang, Qi Fan School of Automation, Harbin University of Science and Technology
http://www.paper.edu.cn
5 10 15 20 25 30 35 A platform for massive railway information data storage # SHAN Xu 1, WANG Genying 1, LIU Lin 2** (1. Key Laboratory of Communication and Information Systems, Beijing Municipal Commission
On Cloud Computing Technology in the Construction of Digital Campus
2012 International Conference on Innovation and Information Management (ICIIM 2012) IPCSIT vol. 36 (2012) (2012) IACSIT Press, Singapore On Cloud Computing Technology in the Construction of Digital Campus
Open Access Research on Application of Neural Network in Computer Network Security Evaluation. Shujuan Jin *
Send Orders for Reprints to [email protected] 766 The Open Electrical & Electronic Engineering Journal, 2014, 8, 766-771 Open Access Research on Application of Neural Network in Computer Network
Big Data Storage Architecture Design in Cloud Computing
Big Data Storage Architecture Design in Cloud Computing Xuebin Chen 1, Shi Wang 1( ), Yanyan Dong 1, and Xu Wang 2 1 College of Science, North China University of Science and Technology, Tangshan, Hebei,
The WAMS Power Data Processing based on Hadoop
Proceedings of 2012 4th International Conference on Machine Learning and Computing IPCSIT vol. 25 (2012) (2012) IACSIT Press, Singapore The WAMS Power Data Processing based on Hadoop Zhaoyang Qu 1, Shilin
CLOUDDMSS: CLOUD-BASED DISTRIBUTED MULTIMEDIA STREAMING SERVICE SYSTEM FOR HETEROGENEOUS DEVICES
CLOUDDMSS: CLOUD-BASED DISTRIBUTED MULTIMEDIA STREAMING SERVICE SYSTEM FOR HETEROGENEOUS DEVICES 1 MYOUNGJIN KIM, 2 CUI YUN, 3 SEUNGHO HAN, 4 HANKU LEE 1,2,3,4 Department of Internet & Multimedia Engineering,
Telecom Data processing and analysis based on Hadoop
COMPUTER MODELLING & NEW TECHNOLOGIES 214 18(12B) 658-664 Abstract Telecom Data processing and analysis based on Hadoop Guofan Lu, Qingnian Zhang *, Zhao Chen Wuhan University of Technology, Wuhan 4363,China
Journal of Chemical and Pharmaceutical Research, 2015, 7(3):1388-1392. Research Article. E-commerce recommendation system on cloud computing
Available online www.jocpr.com Journal of Chemical and Pharmaceutical Research, 2015, 7(3):1388-1392 Research Article ISSN : 0975-7384 CODEN(USA) : JCPRC5 E-commerce recommendation system on cloud computing
Design of Electric Energy Acquisition System on Hadoop
, pp.47-54 http://dx.doi.org/10.14257/ijgdc.2015.8.5.04 Design of Electric Energy Acquisition System on Hadoop Yi Wu 1 and Jianjun Zhou 2 1 School of Information Science and Technology, Heilongjiang University
The Improved Job Scheduling Algorithm of Hadoop Platform
The Improved Job Scheduling Algorithm of Hadoop Platform Yingjie Guo a, Linzhi Wu b, Wei Yu c, Bin Wu d, Xiaotian Wang e a,b,c,d,e University of Chinese Academy of Sciences 100408, China b Email: [email protected]
Optimization of Distributed Crawler under Hadoop
MATEC Web of Conferences 22, 0202 9 ( 2015) DOI: 10.1051/ matecconf/ 2015220202 9 C Owned by the authors, published by EDP Sciences, 2015 Optimization of Distributed Crawler under Hadoop Xiaochen Zhang*
The Construction of Seismic and Geological Studies' Cloud Platform Using Desktop Cloud Visualization Technology
Send Orders for Reprints to [email protected] 1582 The Open Cybernetics & Systemics Journal, 2015, 9, 1582-1586 Open Access The Construction of Seismic and Geological Studies' Cloud Platform Using
Modeling for Web-based Image Processing and JImaging System Implemented Using Medium Model
Send Orders for Reprints to [email protected] 142 The Open Cybernetics & Systemics Journal, 2015, 9, 142-147 Open Access Modeling for Web-based Image Processing and JImaging System Implemented
Research on Trust Management Strategies in Cloud Computing Environment
Journal of Computational Information Systems 8: 4 (2012) 1757 1763 Available at http://www.jofcis.com Research on Trust Management Strategies in Cloud Computing Environment Wenjuan LI 1,2,, Lingdi PING
The Study of a Hierarchical Hadoop Architecture in Multiple Data Centers Environment
Send Orders for Reprints to [email protected] The Open Cybernetics & Systemics Journal, 2015, 9, 131-137 131 Open Access The Study of a Hierarchical Hadoop Architecture in Multiple Data Centers
Research on Storage Techniques in Cloud Computing
American Journal of Mobile Systems, Applications and Services Vol. 1, No. 1, 2015, pp. 59-63 http://www.aiscience.org/journal/ajmsas Research on Storage Techniques in Cloud Computing Dapeng Song *, Lei
Applied research on data mining platform for weather forecast based on cloud storage
Applied research on data mining platform for weather forecast based on cloud storage Haiyan Song¹, Leixiao Li 2* and Yuhong Fan 3* 1 Department of Software Engineering t, Inner Mongolia Electronic Information
CONCEPTUAL MODEL OF MULTI-AGENT BUSINESS COLLABORATION BASED ON CLOUD WORKFLOW
CONCEPTUAL MODEL OF MULTI-AGENT BUSINESS COLLABORATION BASED ON CLOUD WORKFLOW 1 XINQIN GAO, 2 MINGSHUN YANG, 3 YONG LIU, 4 XIAOLI HOU School of Mechanical and Precision Instrument Engineering, Xi'an University
Research of Data Mining Algorithm Based on Cloud Database
S Orders for Reprints to [email protected] The Open Automation and Control Systems Journal, 2013, 5, 187-193 187 Research of Data Mining Algorithm Based on Cloud Database Open Access Tianxiang
Advances in Natural and Applied Sciences
AENSI Journals Advances in Natural and Applied Sciences ISSN:1995-0772 EISSN: 1998-1090 Journal home page: www.aensiweb.com/anas Clustering Algorithm Based On Hadoop for Big Data 1 Jayalatchumy D. and
Research on the UHF RFID Channel Coding Technology based on Simulink
Vol. 6, No. 7, 015 Research on the UHF RFID Channel Coding Technology based on Simulink Changzhi Wang Shanghai 0160, China Zhicai Shi* Shanghai 0160, China Dai Jian Shanghai 0160, China Li Meng Shanghai
Analysis of Information Management and Scheduling Technology in Hadoop
Analysis of Information Management and Scheduling Technology in Hadoop Ma Weihua, Zhang Hong, Li Qianmu, Xia Bin School of Computer Science and Technology Nanjing University of Science and Engineering
Open Access Research and Design for Mobile Terminal-Based on Smart Home System
Send Orders for Reprints to [email protected] The Open Automation and Control Systems Journal, 2015, 7, 479-484 479 Open Access Research and Design for Mobile Terminal-Based on Smart Home System
Chapter 2 The Research on Fault Diagnosis of Building Electrical System Based on RBF Neural Network
Chapter 2 The Research on Fault Diagnosis of Building Electrical System Based on RBF Neural Network Qian Wu, Yahui Wang, Long Zhang and Li Shen Abstract Building electrical system fault diagnosis is the
SURVEY ON SCIENTIFIC DATA MANAGEMENT USING HADOOP MAPREDUCE IN THE KEPLER SCIENTIFIC WORKFLOW SYSTEM
SURVEY ON SCIENTIFIC DATA MANAGEMENT USING HADOOP MAPREDUCE IN THE KEPLER SCIENTIFIC WORKFLOW SYSTEM 1 KONG XIANGSHENG 1 Department of Computer & Information, Xinxiang University, Xinxiang, China E-mail:
A CLOUD-BASED FRAMEWORK FOR ONLINE MANAGEMENT OF MASSIVE BIMS USING HADOOP AND WEBGL
A CLOUD-BASED FRAMEWORK FOR ONLINE MANAGEMENT OF MASSIVE BIMS USING HADOOP AND WEBGL *Hung-Ming Chen, Chuan-Chien Hou, and Tsung-Hsi Lin Department of Construction Engineering National Taiwan University
Exploration on Security System Structure of Smart Campus Based on Cloud Computing. Wei Zhou
3rd International Conference on Science and Social Research (ICSSR 2014) Exploration on Security System Structure of Smart Campus Based on Cloud Computing Wei Zhou Information Center, Shanghai University
Introduction to Hadoop. New York Oracle User Group Vikas Sawhney
Introduction to Hadoop New York Oracle User Group Vikas Sawhney GENERAL AGENDA Driving Factors behind BIG-DATA NOSQL Database 2014 Database Landscape Hadoop Architecture Map/Reduce Hadoop Eco-system Hadoop
Open Access Design of a Python-based Wireless Network Optimization and Testing System
Send Orders for Reprints to [email protected] The Open Automation and Control Systems Journal, 2015, 7, 353-357 353 Open Access Design of a Python-based Wireless Network Optimization and Testing
Study on the Evaluation for the Knowledge Sharing Efficiency of the Knowledge Service Network System in Agile Supply Chain
Send Orders for Reprints to [email protected] 384 The Open Cybernetics & Systemics Journal, 2015, 9, 384-389 Open Access Study on the Evaluation for the Knowledge Sharing Efficiency of the Knowledge
Method of Fault Detection in Cloud Computing Systems
, pp.205-212 http://dx.doi.org/10.14257/ijgdc.2014.7.3.21 Method of Fault Detection in Cloud Computing Systems Ying Jiang, Jie Huang, Jiaman Ding and Yingli Liu Yunnan Key Lab of Computer Technology Application,
Scalable Multiple NameNodes Hadoop Cloud Storage System
Vol.8, No.1 (2015), pp.105-110 http://dx.doi.org/10.14257/ijdta.2015.8.1.12 Scalable Multiple NameNodes Hadoop Cloud Storage System Kun Bi 1 and Dezhi Han 1,2 1 College of Information Engineering, Shanghai
An Hadoop-based Platform for Massive Medical Data Storage
5 10 15 An Hadoop-based Platform for Massive Medical Data Storage WANG Heng * (School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing 100876) Abstract:
NoSQL and Hadoop Technologies On Oracle Cloud
NoSQL and Hadoop Technologies On Oracle Cloud Vatika Sharma 1, Meenu Dave 2 1 M.Tech. Scholar, Department of CSE, Jagan Nath University, Jaipur, India 2 Assistant Professor, Department of CSE, Jagan Nath
An Experimental Approach Towards Big Data for Analyzing Memory Utilization on a Hadoop cluster using HDFS and MapReduce.
An Experimental Approach Towards Big Data for Analyzing Memory Utilization on a Hadoop cluster using HDFS and MapReduce. Amrit Pal Stdt, Dept of Computer Engineering and Application, National Institute
A Network Simulation Experiment of WAN Based on OPNET
A Network Simulation Experiment of WAN Based on OPNET 1 Yao Lin, 2 Zhang Bo, 3 Liu Puyu 1, Modern Education Technology Center, Liaoning Medical University, Jinzhou, Liaoning, China,[email protected] *2
Mechanical Analysis of Crossbeam in a Gantry Machine Tool and its Deformation Compensation
Send Orders for Reprints to [email protected] The Open Mechanical Engineering Journal, 2015, 9, 213-218 213 Open Access Mechanical Analysis of Crossbeam in a Gantry Machine Tool and its Deformation
The Comprehensive Performance Rating for Hadoop Clusters on Cloud Computing Platform
The Comprehensive Performance Rating for Hadoop Clusters on Cloud Computing Platform Fong-Hao Liu, Ya-Ruei Liou, Hsiang-Fu Lo, Ko-Chin Chang, and Wei-Tsong Lee Abstract Virtualization platform solutions
Design of Electronic Medical Record System Based on Cloud Computing Technology
TELKOMNIKA Indonesian Journal of Electrical Engineering Vol.12, No.5, May 2014, pp. 4010 ~ 4017 DOI: http://dx.doi.org/10.11591/telkomnika.v12i5.4392 4010 Design of Electronic Medical Record System Based
The Application Research of Ant Colony Algorithm in Search Engine Jian Lan Liu1, a, Li Zhu2,b
3rd International Conference on Materials Engineering, Manufacturing Technology and Control (ICMEMTC 2016) The Application Research of Ant Colony Algorithm in Search Engine Jian Lan Liu1, a, Li Zhu2,b
Efficient Data Replication Scheme based on Hadoop Distributed File System
, pp. 177-186 http://dx.doi.org/10.14257/ijseia.2015.9.12.16 Efficient Data Replication Scheme based on Hadoop Distributed File System Jungha Lee 1, Jaehwa Chung 2 and Daewon Lee 3* 1 Division of Supercomputing,
A Scheme for Implementing Load Balancing of Web Server
Journal of Information & Computational Science 7: 3 (2010) 759 765 Available at http://www.joics.com A Scheme for Implementing Load Balancing of Web Server Jianwu Wu School of Politics and Law and Public
Parallel Data Mining and Assurance Service Model Using Hadoop in Cloud
Parallel Data Mining and Assurance Service Model Using Hadoop in Cloud Aditya Jadhav, Mahesh Kukreja E-mail: [email protected] & [email protected] Abstract : In the information industry,
Research Article Hadoop-Based Distributed Sensor Node Management System
Distributed Networks, Article ID 61868, 7 pages http://dx.doi.org/1.1155/214/61868 Research Article Hadoop-Based Distributed Node Management System In-Yong Jung, Ki-Hyun Kim, Byong-John Han, and Chang-Sung
Chapter 7. Using Hadoop Cluster and MapReduce
Chapter 7 Using Hadoop Cluster and MapReduce Modeling and Prototyping of RMS for QoS Oriented Grid Page 152 7. Using Hadoop Cluster and MapReduce for Big Data Problems The size of the databases used in
Map/Reduce Affinity Propagation Clustering Algorithm
Map/Reduce Affinity Propagation Clustering Algorithm Wei-Chih Hung, Chun-Yen Chu, and Yi-Leh Wu Department of Computer Science and Information Engineering, National Taiwan University of Science and Technology,
Query and Analysis of Data on Electric Consumption Based on Hadoop
, pp.153-160 http://dx.doi.org/10.14257/ijdta.2016.9.2.17 Query and Analysis of Data on Electric Consumption Based on Hadoop Jianjun 1 Zhou and Yi Wu 2 1 Information Science and Technology in Heilongjiang
HadoopRDF : A Scalable RDF Data Analysis System
HadoopRDF : A Scalable RDF Data Analysis System Yuan Tian 1, Jinhang DU 1, Haofen Wang 1, Yuan Ni 2, and Yong Yu 1 1 Shanghai Jiao Tong University, Shanghai, China {tian,dujh,whfcarter}@apex.sjtu.edu.cn
Dell Reference Configuration for Hortonworks Data Platform
Dell Reference Configuration for Hortonworks Data Platform A Quick Reference Configuration Guide Armando Acosta Hadoop Product Manager Dell Revolutionary Cloud and Big Data Group Kris Applegate Solution
R.K.Uskenbayeva 1, А.А. Kuandykov 2, Zh.B.Kalpeyeva 3, D.K.Kozhamzharova 4, N.K.Mukhazhanov 5
Distributed data processing in heterogeneous cloud environments R.K.Uskenbayeva 1, А.А. Kuandykov 2, Zh.B.Kalpeyeva 3, D.K.Kozhamzharova 4, N.K.Mukhazhanov 5 1 [email protected], 2 [email protected],
Memory Database Application in the Processing of Huge Amounts of Data Daqiang Xiao 1, Qi Qian 2, Jianhua Yang 3, Guang Chen 4
5th International Conference on Advanced Materials and Computer Science (ICAMCS 2016) Memory Database Application in the Processing of Huge Amounts of Data Daqiang Xiao 1, Qi Qian 2, Jianhua Yang 3, Guang
Study on Redundant Strategies in Peer to Peer Cloud Storage Systems
Applied Mathematics & Information Sciences An International Journal 2011 NSP 5 (2) (2011), 235S-242S Study on Redundant Strategies in Peer to Peer Cloud Storage Systems Wu Ji-yi 1, Zhang Jian-lin 1, Wang
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK A COMPREHENSIVE VIEW OF HADOOP ER. AMRINDER KAUR Assistant Professor, Department
International Journal of Advance Research in Computer Science and Management Studies
Volume 2, Issue 8, August 2014 ISSN: 2321 7782 (Online) International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online
Evaluating Task Scheduling in Hadoop-based Cloud Systems
2013 IEEE International Conference on Big Data Evaluating Task Scheduling in Hadoop-based Cloud Systems Shengyuan Liu, Jungang Xu College of Computer and Control Engineering University of Chinese Academy
How To Analyze The Time Varying And Asymmetric Dependence Of International Crude Oil Spot And Futures Price, Price, And Price Of Futures And Spot Price
Send Orders for Reprints to [email protected] The Open Petroleum Engineering Journal, 2015, 8, 463-467 463 Open Access Asymmetric Dependence Analysis of International Crude Oil Spot and Futures
Hadoop Scheduler w i t h Deadline Constraint
Hadoop Scheduler w i t h Deadline Constraint Geetha J 1, N UdayBhaskar 2, P ChennaReddy 3,Neha Sniha 4 1,4 Department of Computer Science and Engineering, M S Ramaiah Institute of Technology, Bangalore,
Data Security Strategy Based on Artificial Immune Algorithm for Cloud Computing
Appl. Math. Inf. Sci. 7, No. 1L, 149-153 (2013) 149 Applied Mathematics & Information Sciences An International Journal Data Security Strategy Based on Artificial Immune Algorithm for Cloud Computing Chen
Analysis and Optimization of Massive Data Processing on High Performance Computing Architecture
Analysis and Optimization of Massive Data Processing on High Performance Computing Architecture He Huang, Shanshan Li, Xiaodong Yi, Feng Zhang, Xiangke Liao and Pan Dong School of Computer Science National
An efficient Mapreduce scheduling algorithm in hadoop R.Thangaselvi 1, S.Ananthbabu 2, R.Aruna 3
An efficient Mapreduce scheduling algorithm in hadoop R.Thangaselvi 1, S.Ananthbabu 2, R.Aruna 3 1 M.E: Department of Computer Science, VV College of Engineering, Tirunelveli, India 2 Assistant Professor,
Lecture 32 Big Data. 1. Big Data problem 2. Why the excitement about big data 3. What is MapReduce 4. What is Hadoop 5. Get started with Hadoop
Lecture 32 Big Data 1. Big Data problem 2. Why the excitement about big data 3. What is MapReduce 4. What is Hadoop 5. Get started with Hadoop 1 2 Big Data Problems Data explosion Data from users on social
Open Access Numerical Analysis on Mutual Influences in Urban Subway Double-Hole Parallel Tunneling
Send Orders for Reprints to [email protected] The Open Construction and Building Technology Journal, 2014, 8, 455-462 455 Open Access Numerical Analysis on Mutual Influences in Urban Subway Double-Hole
Research on Clustering Analysis of Big Data Yuan Yuanming 1, 2, a, Wu Chanle 1, 2
Advanced Engineering Forum Vols. 6-7 (2012) pp 82-87 Online: 2012-09-26 (2012) Trans Tech Publications, Switzerland doi:10.4028/www.scientific.net/aef.6-7.82 Research on Clustering Analysis of Big Data
Survey on Scheduling Algorithm in MapReduce Framework
Survey on Scheduling Algorithm in MapReduce Framework Pravin P. Nimbalkar 1, Devendra P.Gadekar 2 1,2 Department of Computer Engineering, JSPM s Imperial College of Engineering and Research, Pune, India
Capability Service Management System for Manufacturing Equipments in
Capability Service Management System for Manufacturing Equipments in Cloud Manufacturing 1 Junwei Yan, 2 Sijin Xin, 3 Quan Liu, 4 Wenjun Xu *1, Corresponding Author School of Information Engineering, Wuhan
The Design and Application of Water Jet Propulsion Boat Weibo Song, Junhai Jiang3, a, Shuping Zhao, Kaiyan Zhu, Qihua Wang
International Conference on Automation, Mechanical Control and Computational Engineering (AMCCE 2015) The Design and Application of Water Jet Propulsion Boat Weibo Song, Junhai Jiang3, a, Shuping Zhao,
A Method of Cloud Resource Load Balancing Scheduling Based on Improved Adaptive Genetic Algorithm
Journal of Information & Computational Science 9: 16 (2012) 4801 4809 Available at http://www.joics.com A Method of Cloud Resource Load Balancing Scheduling Based on Improved Adaptive Genetic Algorithm
Shareability and Locality Aware Scheduling Algorithm in Hadoop for Mobile Cloud Computing
Shareability and Locality Aware Scheduling Algorithm in Hadoop for Mobile Cloud Computing Hsin-Wen Wei 1,2, Che-Wei Hsu 2, Tin-Yu Wu 3, Wei-Tsong Lee 1 1 Department of Electrical Engineering, Tamkang University
Detection of Distributed Denial of Service Attack with Hadoop on Live Network
Detection of Distributed Denial of Service Attack with Hadoop on Live Network Suchita Korad 1, Shubhada Kadam 2, Prajakta Deore 3, Madhuri Jadhav 4, Prof.Rahul Patil 5 Students, Dept. of Computer, PCCOE,
International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015
RESEARCH ARTICLE OPEN ACCESS Ensuring Reliability and High Availability in Cloud by Employing a Fault Tolerance Enabled Load Balancing Algorithm G.Gayathri [1], N.Prabakaran [2] Department of Computer
E-R Method Applied to Design the Teacher Information Management System s Database Model
Vol. 6, o. 4, August, 2013 E-R ethod Applied to Design the Teacher Information anagement System s Database odel Yingjian Kang 1 and Dan Zhao 2 1 Department of Computer Technology, College of Telecommunications
Load Balancing of virtual machines on multiple hosts
Load Balancing of virtual machines on multiple hosts -By Prince Malik Kavya Mugadur Sakshi Singh Instructor: Prof. Ming Hwa Wang Santa Clara University Table of Contents 1. Introduction... 54 1.1 Objective...
Performance Analysis of Book Recommendation System on Hadoop Platform
Performance Analysis of Book Recommendation System on Hadoop Platform Sugandha Bhatia #1, Surbhi Sehgal #2, Seema Sharma #3 Department of Computer Science & Engineering, Amity School of Engineering & Technology,
Open Access A Facial Expression Recognition Algorithm Based on Local Binary Pattern and Empirical Mode Decomposition
Send Orders for Reprints to [email protected] The Open Electrical & Electronic Engineering Journal, 2014, 8, 599-604 599 Open Access A Facial Expression Recognition Algorithm Based on Local Binary
Optimization of PID parameters with an improved simplex PSO
Li et al. Journal of Inequalities and Applications (2015) 2015:325 DOI 10.1186/s13660-015-0785-2 R E S E A R C H Open Access Optimization of PID parameters with an improved simplex PSO Ji-min Li 1, Yeong-Cheng
Wireless Sensor Networks Coverage Optimization based on Improved AFSA Algorithm
, pp. 99-108 http://dx.doi.org/10.1457/ijfgcn.015.8.1.11 Wireless Sensor Networks Coverage Optimization based on Improved AFSA Algorithm Wang DaWei and Wang Changliang Zhejiang Industry Polytechnic College
Design call center management system of e-commerce based on BP neural network and multifractal
Available online www.jocpr.com Journal of Chemical and Pharmaceutical Research, 2014, 6(6):951-956 Research Article ISSN : 0975-7384 CODEN(USA) : JCPRC5 Design call center management system of e-commerce
Task Scheduling in Hadoop
Task Scheduling in Hadoop Sagar Mamdapure Munira Ginwala Neha Papat SAE,Kondhwa SAE,Kondhwa SAE,Kondhwa Abstract Hadoop is widely used for storing large datasets and processing them efficiently under distributed
Apache Hadoop new way for the company to store and analyze big data
Apache Hadoop new way for the company to store and analyze big data Reyna Ulaque Software Engineer Agenda What is Big Data? What is Hadoop? Who uses Hadoop? Hadoop Architecture Hadoop Distributed File
ANALYSING THE FEATURES OF JAVA AND MAP/REDUCE ON HADOOP
ANALYSING THE FEATURES OF JAVA AND MAP/REDUCE ON HADOOP Livjeet Kaur Research Student, Department of Computer Science, Punjabi University, Patiala, India Abstract In the present study, we have compared
Big Data: Study in Structured and Unstructured Data
Big Data: Study in Structured and Unstructured Data Motashim Rasool 1, Wasim Khan 2 [email protected], [email protected] Abstract With the overlay of digital world, Information is available
Mining of Web Server Logs in a Distributed Cluster Using Big Data Technologies
Mining of Web Server Logs in a Distributed Cluster Using Big Data Technologies Savitha K Dept. of Computer Science, Research Scholar PSGR Krishnammal College for Women Coimbatore, India. Vijaya MS Dept.
