Towards Realization of Open Cloud Services Architecture for Data Service in Materials Field

Size: px
Start display at page:

Download "Towards Realization of Open Cloud Services Architecture for Data Service in Materials Field"

Transcription

1 Journal of Computational Information Systems 10: 3 (2014) Available at Towards Realization of Open Cloud Services Architecture for Data Service in Materials Field Xin CHENG 1,2, Changjun HU 1,2,, Yang LI 1,2, Xin LIU 1,2 1 School of Computer and Communication Engineering, University of Science and Technology Beijing (USTB), Beijing , China 2 Beijing Key Laboratory of Knowledge Engineering for Materials Science, Beijing , China Abstract Against the new challenges for data management and service architecture due to the evolution of big data, the problems of scientific domain big data management, reuse and on-demand services must be solved. In this paper, we propose an Open Cloud Service Architecture (OCSA) based on the joint building of materials Virtual DataSpace (VDS), Requirement Space (RS) model and Behavior Space (BS) analysis, and discuss the theoretical concept and construction method of them. Based on the cloud computing supporting environment, we adopt the mechanisms of semantic mapping and dynamic evolution to support the idea of data first, on-demand service. The theoretical feasibility and supporting effect of OCSA have been effectively verified through the application of materials scientific data sharing service platform. Keywords: Open Cloud Services Architecture; Virtual DataSpace; Requirement Model; Behavior Analysis; Materials Data Service 1 Introduction In recent years, with the rapid growth of needs for the Internet data and related services, we are facing the new challenges of data management and service modeling. Data has been changed fundamentally in particular with the development of Web 2.0. The big data management issues [1, 2] are addressed in the aspect of domain scientific data. Especially in the materials field, there are some new data features such as massive, distributed, heterogeneous, complex association, dynamic changes, environmental correlation, and so on. Materials scientists find that it is difficult to deal with these new data features for realizing the e-science applications. Many issues are faced Project supported by the R&D Infrastructure and Facility Development Program (No. 2005DKA32800), the 2012 Ladder Plan Project of Beijing Key Laboratory of Knowledge Engineering for Materials Science (No. Z ), the Key Science-Technology Plan of the National Twelfth Five-Year-Plan of China (No. 2011BAK08B04), and the National Key Basic Research and Development Program (973 Program) (No. 2013CB329606). Corresponding author. address: hu.cj.mail@gmail.com (Changjun HU) / Copyright 2014 Binary Information Press DOI: /jcis9426 February 1, 2014

2 1112 X. Cheng et al. /Journal of Computational Information Systems 10: 3 (2014) as: how to efficiently manage the materials science big data; how to reuse and share the scientific data by deep mining data relationships; how to analyze the dynamic changes of domain scientific data and its relationships; how to proactively provide the individualized on-demand services for domain scientists; etc. Currently, the processing of large-scale data sets cannot catch the speed of data generation by modern scientific research. The traditional database management system can not meet the needs of management for the growing data. And the traditional service architecture which has the closed and rigid supporting model cannot satisfy the demands of new applications. Therefore, it is important to build a new data management model and new service architecture to support the realization of management for the large-scale data sets and the diversification of cloud service [3] applications. In this paper, our main object is to present a new data service architecture to manage the materials science data, and provide the effective data services. The rest of this paper is organized as follows. The related work about cloud computing and dataspace are introduced in Section 2. The Open Cloud Services Architecture (OCSA) is presented, and the key technologies involved in OCSA are elaborated in Section 3. The case study of OCSA in the materials field is described in Section 4. The brief conclusion and future work are given in Section 5. 2 Related Works Robert L Grossman et al. propose an Open Science Data Cloud (OSDC) [4] to support the analysis, processing and management of large-scale scientific data sets. Liangjie Zhang et al. introduce a Cloud Computing Open Architecture (CCOA) [5] to support the reusable and customizable of services based on seven architectural principles, realize the effective integration of Oriented Service Architecture (SOA) and virtualization in the ecological environment of cloud computing. K Keahey et al. [6] propose a method to construct scientific cloud platform infrastructure by using Nimbus CloudKit, the virtualization technology enable users to implement data-intensive scientific computing. Paul Watson et al. [7] use work flow to organize and manage the services and build the cloud platform Carmen based on the concept of pay-as-you-go to realize the data sharing, integration and analysis by the supporting of metadata. Pengcheng Xiong et al. propose a resource management system called SmartSLA [8] which can be applied to the cloud environment to support the intelligent management of cloud resources. M F Husain et al. [9] propose a massive data semantic storage and retrieval architecture based on the Hadoop technology to effectively improve the processing performance of storage and query. The researchers are trying to seek a new technology to deal with the new challenges of data management. The concept of dataspace was proposed by M Franklin et al. in 2005 [10]. The research about dataspace mainly includes two directions: one is the theory and algorithms study about model [11, 12], query [13, 14] and index [15] of dataspace; the other is about the technology research of personal data management and application. Currently, the dataspace research is still in its infancy, only a few prototype systems such as imemex [16] from ETH and Semex [17] from University of Washington are developed, and these two systems are mainly research the dataspace model, data storage and indexing, query processing, etc. As a new data management technology, the dataspace research is still in the initial stage. Moreover, the work combined the concept of dataspace with the cloud computing technology has been rarely done. In conclusion, there are many problems of domain data management that remain to be explored and resolved further.

3 X. Cheng et al. /Journal of Computational Information Systems 10: 3 (2014) Open Cloud Services Architecture (OCSA) Early as in 2006, Ibrahim Elsayed et al. [18] proposed an Open Grid Services Architecture (OGSA) by the basis of traditional grid five layers hourglass structure and Web Service technology. This architecture has combined the dataspace concept with the grid technology. It could effectively support the management of heterogeneous and distributed data. However, OGSA is unable to meet the data analysis, management and sharing with the increasingly prominent of big data problems. Based on this, we propose an Open Cloud Services Architecture (OCSA) to more emphasis the characteristics of virtualization and data resources allocated on demand. There are some similarities between OCSA and OGSA, such as the distributed computing, parallel processing, resolve the heterogeneous problem, support the resources sharing, and so on. However, OCSA is not the simple extension of OGSA. The significant differences between them are shown as Table 1. Core Focus OGSA Table 1: Compare OGSA and OCSA service as the core, emphasize resources sharing; resources scattered, applications focus workload is transferred to the available remote computing resources; focus parallel computing centralization demand OCSA data as the core, emphasize proprietary service; resources focus relatively, services scattered computing resources are converted in form to adapt the workload; focus a large number of separate transactional applications demand Treatment parallel job processing across physical machines virtualization processing, free scheduling of resources Extensibility difficult to automatically extend automatic/semi-automatic extend Reuse computing capability and data data and service Synergy working-level co-processing services and even data level (more flexible and diverse) Through the comparing and analyzing, it could be seen that OCSA has the following characteristics: 1) data-centric, i.e., support the Data as a Service (DaaS) ; 2) virtualization processing for data; 3) high-performance computing; 4) Virtual DataSpace (VDS), support the construction and evolution of data association; 5) Requirement Space (RS), support the construction and collaboration of services association; 6) Behavior Space (BS), support the behavior analysis of user habits, data dissemination, service evolution, and so on. Thus, we can define OCSA as follow. Definition 1 Open Cloud Services Architecture (OCSA) could be represented as a six-tuples, OCSA = (DRS, CSE, VDS, RS, BS, AIS). Where DRS denotes the data resource sets in the underlying physical layer of OCSA; CSE denotes the cloud computing supporting environment which support the virtualized processing by the Hadoop technology; VDS denotes the virtual dataspace which could manage and allocate the data resources on-demand by using the semantic mapping and dynamic evolution mechanism; RS denotes the requirement space which could build the requirement model and manage the services by combining the workflow technology; BS denotes the behavior space which could optimize the data services by behavior analysis; AIS denotes the application instance sets which are needed by the top level users of OCSA. The abstract framework of OCSA could be described as in Fig. 1. First of all, OCSA provides the virtual processing for the resource sets in data layer, and support the construction of data association in VDS and the construction of service association in RS based on CSE. It builds a global unified logical resource view through the semantic mapping and dynamic evolution of associated data in VDS. Meanwhile, establish the user requirements model for supporting the domain special services, and through the establishment of mapping between VDS and RS to

4 1114 X. Cheng et al. /Journal of Computational Information Systems 10: 3 (2014) open up the association among data, services and requirements. Furthermore, set up BS based on various types of behavior analysis in order to optimize the data services and meet the various user requirements of application layer. Fig. 1: Abstract framework of OCSA The detailed definition and analysis about DRS, CSE, VDS, RS and BS could be described as the following five parts. 3.1 Data acquisition from DRS The data resource sets (DRS) in the physical underlying of OCSA could be defined as a twotuples, DRS = (NDS, WNR), where NDS denotes the node data sets, WNR denotes the web network resources. We need adopt different technical schemes to achieve the data acquisition for these two types of data resources. For NDS, we use the interoperable interfaces based on data cross-domain sharing technology to realize the node data access. And for WNR, we adopt the network information intelligent crawl technology to get relevant data. They could be defined as follows. Definition 2 Node Data Sets (NDS) is a triples, NDS = (N i, DO ij, U ij ), i=1,2,,n, j=1,2,,m. Where n is the relevant nodes number; m is the number of data operation service provided by a certain node; N i denotes the node name; DO ij denotes the data operation of node N i ; U ij denotes the link URL of data operation DO ij in node N i. Definition 3 Web Network Resources (WNR) is a quadruple, WNR = (OD i, SS, U ij, AA), i=1,2,,n, j=1,2,,m. Where n is the number of crawling objective; m is the number of links about a certain crawling objective; OD i denotes the description about crawling objective; SS denotes the selected search strategy; U ij denotes the link URL of OD i ; AA denotes the selected analysis algorithms. There is the same parameter type link URL between NDS and WNR, therefore we can construct the data acquisition model (DAM) of OCSA as seen in Fig. 2. Through the combination of them, we can effectively obtain the massive data which distributed in different regions to solve the problems of big data distribution.

5 X. Cheng et al. /Journal of Computational Information Systems 10: 3 (2014) Fig. 2: Data acquisition model (DAM) 3.2 Virtualized processing in CSE The data resources updated frequently and have complexity types which acquired from the underlying physical nodes and web network of OCSA. Therefore the virtualization processing technology is adopted based on Hadoop to build the cloud computing supporting environment (CSE) as seen in Fig. 3. Hadoop could provide efficient distributed computing and storage supporting for the big data management because of its features of high transmission rate, high fault-tolerance, high reliability, low cost, high efficiency, and so on. The core technologies of Hadoop are HDFS and MapReduce. HDFS is the distributed file system of Hadoop, which supports the large file operation to realize the large-scale computing, storage and access. MapReduce is the distributed parallel programming model, which provides the Map function to decompose tasks and use the Reduce function to aggregate the distributed processing results into the end results. CSE provides the high performance computing and efficient store access supporting for OCSA, and then effectively meets the physical distribution, logical integration virtualized data processing requirements. Therefore, research the construction mechanism of Hadoop and optimize the programming algorithms of MapReduce, could significantly improve the data service of OCSA. Fig. 3: Cloud computing supporting environment (CSE) 3.3 Data organization and evolution in VDS Based on the high performance computing and efficient store access of CSE, we propose a Virtual DataSpace (VDS) technology to manage the data resources which are acquired from the underlying of OCSA. VDS is the sets of data, services and their relationships which related with the subjects and based on the supporting of virtualization. Compared with the traditional database and dataspace management mode, VDS has obvious technological advantages (as in Fig. 4. and Table 2), such as data first, more emphasis on the data associated mapping and dynamic evolution, more highlights the importance of service, and so on.

6 1116 X. Cheng et al. /Journal of Computational Information Systems 10: 3 (2014) Fig. 4: Comparison of DB, DS and VDS Table 2: Comparison of DB, DS and VDS DB DS VDS mode feature close, focus completion open, with pay-as-you-go feature data source only structured data structured, semi-structured, unstructured data data feature focus on data stability emphasis on data association and evolution data storage remote server each local data center service Web service (perpetual) retrieval services (centralization) cloud service (flexible dispersion) integration first model, after data weakening model, highlight data dilute model, emphasized data, on-demand service core application themes principal needs data services virtualization support virtualization processing of data services serviceability dynamic, diverse and personalized service requirement demand-led service coordination behavior optimize data services by behavior research, improve service initiative Studying how to construct VDS based on OCSA could satisfy the relevant needs of big data management and dynamic service improvement. From the macro level, the whole public virtual dataspace (P-VDS) could be understood as the sets of all the data and their relationships; and it could be defined as follow. Definition 4 Public Virtual DataSpace is a tuple, P-VDS = (ADS, ARS), where ADS is all the data set, ARS is all the relationship set. VDS usually associates to the related subjects, it is equivalent to the subset of public virtual dataspace, i.e. P-VDS = S-V DS v, v=1,2,,v. Where V is the number of sub-vds; S-V DS v denotes the sub virtual dataspace, its subject-related data set is the subset of ADS, its subjectrelated data relationships set is the subset of ARS. So the sub-vds mainly include four parts: subject, service, data and relationship, it could be defined as follow. Definition 5 Sub-VDS is a four-tuples, S-VDS = (P i, S j, DS ij, DRS ij ), i=1,2,,i, j=1,2,,j. Where I is the number of subjects; J is the number of services; P i denotes the owner of this sub-vds, i.e. the subject; S j denotes the service which is supported by this sub-vds; DS ij denotes the data sets related with P i and S j ; DRS ij denotes the data relationship sets related with P i and S j. Combined with the Definition 4, it is not the corresponding relation among the number of subjects, services and sub-vds, one subject might joined several services to several sub-vds, one service might exists in several subjects related sub-vds. Thus the number of sub-vds should

7 X. Cheng et al. /Journal of Computational Information Systems 10: 3 (2014) meet data ranges: V [I, I J]. The data sets of sub-vds could be expressed as: DS ij = DS V = DEd, d=1,2,,d, where D is the number of data entities in this data sets, DE d is the data entity, it could be defined as follow. Definition 6 Data Entity is a five-tuples, DE = (DATA name, pro, value, CD D P, CD D S ). Where DATA data, pro, value is the triad of name, property and value for this data entity; CD D P is the correlation degree between data and subject; CD D S is the correlation degree between data and service. The data relationship set of sub-vds could be expressed as: DRS ij =DRS V = DRE r1, r1= 1,2,,R1, where R1 is the number of data relationship entities in this data sets, DRE r1 is the data relationship entity, it could be defined as follow. Definition 7 Data Relationship Entity is a five-tuples, DRE=(TN, CD D, DES DE 1 DE 2 DE d, O L, W DR OL ). Where TN is the type name of the data relationship entity; CD D is the correlation degree of data; DES DE 1 DE 2 DE d is the data entity sets involved this data relationship entity, d is the number of involved data entities, DE d is the d-th involved data entity; O L is the local ontology which the data relationship entities belong to; W DR OL is the weight of data relationships in the corresponding local ontology. The local ontology set could be expressed as: OS Lij =OS LV = O Lo, o=1,2,,o, where O is the number of local ontologies in sub-vds (DS V ) related with P i and S j, O Lo is the local ontologies in sub-vds (DS V ), it could be defined as follow. Definition 8 Local Ontology is a triples, O L = (ON L, S-VDS v, W OL OG ). Where ON L is the name of this local ontology; S-VDS v is the sub-vds which the local ontologies belonged to; W OL OG is the weight of local ontology in the global ontology. The global ontology is set up by the local ontology set of each sub-vds, it could be expressed as: O G = OS Lij = OS LV = O Lo = (O Lo1 +O Lo2 + +O LoV ), therefore, the number of local ontology in global ontology could be calculated as: O = o 1 +o 2 + +o V = o v, v=1,2,,v, where V is the number of sub-vds in the global ontology, o v is the number of local ontologies in the v-th sub-vds. Therefore, we can construct VDS which supports the semantic mapping and dynamic evolution as the following steps: Step 1 Design the different semantics wrappers to wrap and transform the heterogeneous data; Step 2 Initialization of VDS: determine the domain core concepts, initially set S j that demanded by P i, and then select the interested DS ij and DRS ij to build the initial S-VD S v which is related to P i ; Step 3 Expansion of VDS: determine CD D P and CD D S of the newly included data according to the initial definition of data set, and put the related data into the suitable S-VDS v ; Step 4 Data evolution: amend CD D P and CD D S based on the frequency degree of data access and the supporting of valid algorithms, and then continuously optimize the DS ij of S- VDS v ; Step 5 Relationship evolution: continuously correct the CD D of data and improve DRS ij by combining the initial definition of data relationship set and the related optimization of data entities DE 1 DE 2 DE d which are involved in the data relationship set; Step 6 Local ontology mapping: build the mapping M RL : DRS ij OS Lij, then correct the W DR OL of relationship entities in the belonging local ontology based on the evolution of data

8 1118 X. Cheng et al. /Journal of Computational Information Systems 10: 3 (2014) relationships; Step 7 Global ontology mapping: build the mapping M LG : OS Lij O G, then correct the W OL OG of local ontologies in the global ontology, thereby improve P-VDS which is integrated by lots of S-VDS v. From the building process as in Fig. 5, we can see that VDS could effectively solve the problems of association complex and dynamic changing for big data in OCSA. Fig. 5: The building process of VDS 3.4 Requirements modeling in RS VDS has realized the flexible and personalized service, furthermore, we propose a Requirement Space (RS) to lead the collaborative work and improve the initiative of service in OCSA. VDS provides the bottom-up data leading to satisfy the service demands, in contrast, RS provides the top-down needs leading to optimize the service data. Co-ordinate VDS and RS into OCSA could complement each other and achieve more abundant cloud data services. RS is the set of data services and their relationships which related to the user requirements. It mainly includes five features: 1) demand first ; 2) emphasize the association and evolution of services; 3) support the service semantic collaboration, has higher initiative of services; 4) mapping with VDS by use the services as a link; 5) support the research of behavior evolution analysis. RS could be defined as follow. Definition 9 Requirement Space is a six-tuples, RS = (U i, RQ k, SS ik, SRS ik, DS ik, DRS ik ), i=1,2,,i, k=1,2,,k. Where I is the number of users; K is the number of requirements; U i is the user of RS; RQ k is the requirement included in RS; SS ik is the service set which is needed by users; SRS ik is the service relationship set; DS ik is the data set which is needed by users; DRS ik is the data relationship set. The modeling process of RS could be described as in Fig. 6. Firstly, adopt the method of OWL-S to describe, register, publish and obtain the services in SS ik and the service relationships in SRS ik to build the association mapping of service semantic. Secondly, find the regularity of synergistic combination by using the workflow technology, and then integrate the services of two or more based on the ideology of Mashup to efficiently achieve the service semantic collaboration. Finally, build the mapping of VDS and RS based on the establishment of semantic association between the S ik in RS and S j in VDS to get through the communication between the upper demands and the underlying data. Based on this, optimize the quality of data service to realize the personalized service integration and diversified sharing collaboration in OCSA.

9 X. Cheng et al. /Journal of Computational Information Systems 10: 3 (2014) Fig. 6: The modeling process of RS 3.5 Behavior analysis in BS Based on the data management in VDS and the services coordination in RS, we could optimize and improve the data services in OCSA by behavior analysis technology. Thereby we propose a Behavior Space (BS) as a new concept and method in the OCSA. BS is the scope of the dynamic evolutionary track and its correlation analysis about the related members. The members of BS mainly include user, requirement, data, service, relationship, threat and behavior itself. It forms a relatively stable system of relations based on the interaction of members, and then builds a basic mode for BS. There are different types of behavior analysis in BS, such as the user behavior analysis, the data dissemination analysis, the service evolution analysis, the requirement evolution analysis, the threatening behavior analysis, the relationship evolution analysis, and so on. The supporting model of BS as Fig. 7 shows that it has the characteristics of interaction, real-time performance, effects propagation, and so on. With the relevant members as a link, OCSA supports the effectively construction of mappings among VDS, RS and BS, then provides the optimization of data services. The statistical analysis and probability comparison based on the behavior characteristics could provide the user-interested and personalized service recommendation. And the behavior prediction mechanism constructed by the behavior analysis could support the active service and user-friendly design of OCSA. Fig. 7: The supporting model of BS 4 Materials Scientific Data Sharing Service Platform For the analysis and validation of OCSA, we developed a Materials Scientific Data Sharing Service Platform based on the Materials Domain Cloud Services Architecture (MDCSA) and achieved the practical applications in OCSA. As shown in Fig. 8, this platform has integrated the massive data resources into twelve categories, such as ferrous materials, composite materials, organic polymer materials, etc. The data resource set aggregated various types of materials relevant information such as grades, properties, processing, manufacturer, etc., and covered many types of materials scientific data such as tables, XML documents, images, etc. At present, the

10 1120 X. Cheng et al. /Journal of Computational Information Systems 10: 3 (2014) platform has totally collected nearly six hundred thousand data resource items, and the amount of data is still growing rapidly. Fig. 8: Materials Domain Cloud Services Architecture (MDCSA) Extract and transform the data obtained from each node and network based on the cloud computing supporting environment, and then initially built several basic sub-vds such as the property combination query sub-vds, failure range judgment sub-vds, etc. Meanwhile, initially built various types of RS according to the users demands, for example, the keyword search RS, materials selection RS, life prediction RS, and so on. In addition, according to the variety behavior analysis, initially formed several categories of basic BS model such as user habits BS, service evolution prediction BS, etc. Build the mapping between VDS and RS based on the data associated evolution of VDS and the service associated combination of RS to get through the barriers between the upper layer user demands and the lower layer data construction. And then use the various types of analysis in BS to further optimize the data service for providing a wide variety of featured application services of material scientific domain in the upper layer of MDCSA. The specific application service instances such as keywords fuzzy query

11 X. Cheng et al. /Journal of Computational Information Systems 10: 3 (2014) service, building materials failure warning service, pipeline full life cycle services, and so on. The whole platform building process instances model is shown in Fig. 9. Fig. 9: Platform building process instances model 5 Conclusions and Future Work For the realization of big data management, reuse and on-demand services, this paper proposed an Open Cloud Service Architecture (OCSA), elaborated the involved key technologies, and then verified the feasibility and superiority of OCSA by studying the related application case in materials field. The building of OCSA has combined with the cloud computing, VDS, RS, BS, ontology semantic, etc., it has the scientifically rigorous theory support and flexible extensive application prospects. The future work will further develop the framework of OCSA, depth study the dynamic evolution algorithm of VDS, improve the theoretical research of RS and BS, then deep mining the relevant building technology and realization measures, moreover, add the technical support for real-time and uncertainty in OCSA. References [1] C. Lynch, Big data: How do your data grow?, Nature 455 (2008) [2] D. Howe, M. Costanzo, et al., Big data: The future of biocuration, Nature 455 (2008) [3] A.I. Yong, X. ZHANG, K.E. Jie, et al., Application of Information Resource View on Cloud Computing Service, Journal of Computational Information Systems 9(2) (2013) [4] Robert L. Grossman, Y. Gu, J. Mambretti, et al., An overview of the Open Science Data Cloud, in: Proc. 19th ACM International Symposium on High Performance Distributed Computing, 2010, pp [5] L.J. Zhang, Q. Zhou, CCOA: Cloud Computing Open Architecture, in: Proc. IEEE International Conference on Web Services, 2009, pp [6] K. Keahey, R. Figueiredo, J. Fortes, T. Freeman, M. Tsugawa, Science Clouds: Early Experiences in Cloud Computing for Scientific Applications, Cloud Computing and Its Applications (2008)

12 1122 X. Cheng et al. /Journal of Computational Information Systems 10: 3 (2014) [7] P. Watson, P. Lord, F. Gibson, P. Periorellis, G. Pitsilis, Cloud Computing for e-science with CARMEN, in: Proc. IBERGRID, 2008, pp [8] P. Xiong, Y. Chi, S. Zhu, et al., Intelligent management of virtualized resources for database systems in cloud environment, in: Proc. ICDE, 2011, pp [9] M.F. Husain, et al., Data Intensive Query Processing for Large RDF Graphs Using Cloud Computing Tools, in: Proc. IEEE 3rd International Conference on Cloud Computing (CLOUD), 2010, pp [10] M. Franklin, et al., From databases to dataspaces: a new abstraction for information management, ACM Sigmod Record 34 (2005) [11] J.P. Dittrich, M.A.V Salles, idm: A unified and versatile data model for personal dataspace management, in: Proc. VLDB, 2006, pp [12] C. Hedeler, et al., Flexible dataspace management through model management, in: Proc EDBT/ICDT Workshops, 2010, pp [13] M.A.V Salles, Pay-as-you-go information integration in personal and social dataspaces, PhD thesis, Diss., Eidgen ö ssische Technische Hochschule ETH Z ü rich, Nr , [14] M.A.V Salles, et al., Intensional associations in dataspaces, in: Proc. Data Engineering (ICDE), 2010 IEEE 26th International Conference on. IEEE, 2010, pp [15] X. Dong, A. Halevy, Indexing dataspaces, in: Proc. SIGMOD, 2007, pp [16] L. Blunschi, et al., A dataspace odyssey: The imemex personal dataspace management system, in: Proc. CIDR, 2007, pp [17] X.L. Dong, A. Halevy, A platform for personal information management and integration, in: Proc. VLDB 2005 PhD Workshop, CIDR2005, 2005, pp [18] I. Elsayed, P. Brezany, A.M. Tjoa, Towards Realization of Dataspaces, in: Proc. 17th International Conference on Database and Expert Systems Applications (DEXA 06), 2006, pp

UPS battery remote monitoring system in cloud computing

UPS battery remote monitoring system in cloud computing , pp.11-15 http://dx.doi.org/10.14257/astl.2014.53.03 UPS battery remote monitoring system in cloud computing Shiwei Li, Haiying Wang, Qi Fan School of Automation, Harbin University of Science and Technology

More information

Massive Data Query Optimization on Large Clusters

Massive Data Query Optimization on Large Clusters Journal of Computational Information Systems 8: 8 (2012) 3191 3198 Available at http://www.jofcis.com Massive Data Query Optimization on Large Clusters Guigang ZHANG, Chao LI, Yong ZHANG, Chunxiao XING

More information

Mobile Storage and Search Engine of Information Oriented to Food Cloud

Mobile Storage and Search Engine of Information Oriented to Food Cloud Advance Journal of Food Science and Technology 5(10): 1331-1336, 2013 ISSN: 2042-4868; e-issn: 2042-4876 Maxwell Scientific Organization, 2013 Submitted: May 29, 2013 Accepted: July 04, 2013 Published:

More information

A SaaS-based Logistics Informatization Model for Specialized Farmers Cooperatives in China

A SaaS-based Logistics Informatization Model for Specialized Farmers Cooperatives in China A SaaS-based Logistics Informatization Model for Specialized Farmers Cooperatives in China Zhongqiang Liu 1, Kaiyi Wang 1*, Shufeng Wang 1, Feng Yang 1 and Xiandi Zhang 1, 1 Beijing Research Center for

More information

Big Data Storage Architecture Design in Cloud Computing

Big Data Storage Architecture Design in Cloud Computing Big Data Storage Architecture Design in Cloud Computing Xuebin Chen 1, Shi Wang 1( ), Yanyan Dong 1, and Xu Wang 2 1 College of Science, North China University of Science and Technology, Tangshan, Hebei,

More information

SEMANTIC WEB BASED INFERENCE MODEL FOR LARGE SCALE ONTOLOGIES FROM BIG DATA

SEMANTIC WEB BASED INFERENCE MODEL FOR LARGE SCALE ONTOLOGIES FROM BIG DATA SEMANTIC WEB BASED INFERENCE MODEL FOR LARGE SCALE ONTOLOGIES FROM BIG DATA J.RAVI RAJESH PG Scholar Rajalakshmi engineering college Thandalam, Chennai. ravirajesh.j.2013.mecse@rajalakshmi.edu.in Mrs.

More information

Study on Redundant Strategies in Peer to Peer Cloud Storage Systems

Study on Redundant Strategies in Peer to Peer Cloud Storage Systems Applied Mathematics & Information Sciences An International Journal 2011 NSP 5 (2) (2011), 235S-242S Study on Redundant Strategies in Peer to Peer Cloud Storage Systems Wu Ji-yi 1, Zhang Jian-lin 1, Wang

More information

Open Access The Cooperative Study Between the Hadoop Big Data Platform and the Traditional Data Warehouse

Open Access The Cooperative Study Between the Hadoop Big Data Platform and the Traditional Data Warehouse Send Orders for Reprints to reprints@benthamscience.ae 1144 The Open Automation and Control Systems Journal, 2015, 7, 1144-1152 Open Access The Cooperative Study Between the Hadoop Big Data Platform and

More information

Lightweight Data Integration using the WebComposition Data Grid Service

Lightweight Data Integration using the WebComposition Data Grid Service Lightweight Data Integration using the WebComposition Data Grid Service Ralph Sommermeier 1, Andreas Heil 2, Martin Gaedke 1 1 Chemnitz University of Technology, Faculty of Computer Science, Distributed

More information

Log Mining Based on Hadoop s Map and Reduce Technique

Log Mining Based on Hadoop s Map and Reduce Technique Log Mining Based on Hadoop s Map and Reduce Technique ABSTRACT: Anuja Pandit Department of Computer Science, anujapandit25@gmail.com Amruta Deshpande Department of Computer Science, amrutadeshpande1991@gmail.com

More information

Large-Scale Data Sets Clustering Based on MapReduce and Hadoop

Large-Scale Data Sets Clustering Based on MapReduce and Hadoop Journal of Computational Information Systems 7: 16 (2011) 5956-5963 Available at http://www.jofcis.com Large-Scale Data Sets Clustering Based on MapReduce and Hadoop Ping ZHOU, Jingsheng LEI, Wenjun YE

More information

http://www.paper.edu.cn

http://www.paper.edu.cn 5 10 15 20 25 30 35 A platform for massive railway information data storage # SHAN Xu 1, WANG Genying 1, LIU Lin 2** (1. Key Laboratory of Communication and Information Systems, Beijing Municipal Commission

More information

Applied research on data mining platform for weather forecast based on cloud storage

Applied research on data mining platform for weather forecast based on cloud storage Applied research on data mining platform for weather forecast based on cloud storage Haiyan Song¹, Leixiao Li 2* and Yuhong Fan 3* 1 Department of Software Engineering t, Inner Mongolia Electronic Information

More information

CONCEPTUAL MODEL OF MULTI-AGENT BUSINESS COLLABORATION BASED ON CLOUD WORKFLOW

CONCEPTUAL MODEL OF MULTI-AGENT BUSINESS COLLABORATION BASED ON CLOUD WORKFLOW CONCEPTUAL MODEL OF MULTI-AGENT BUSINESS COLLABORATION BASED ON CLOUD WORKFLOW 1 XINQIN GAO, 2 MINGSHUN YANG, 3 YONG LIU, 4 XIAOLI HOU School of Mechanical and Precision Instrument Engineering, Xi'an University

More information

Research on Operation Management under the Environment of Cloud Computing Data Center

Research on Operation Management under the Environment of Cloud Computing Data Center , pp.185-192 http://dx.doi.org/10.14257/ijdta.2015.8.2.17 Research on Operation Management under the Environment of Cloud Computing Data Center Wei Bai and Wenli Geng Computer and information engineering

More information

International Journal of Innovative Research in Computer and Communication Engineering

International Journal of Innovative Research in Computer and Communication Engineering FP Tree Algorithm and Approaches in Big Data T.Rathika 1, J.Senthil Murugan 2 Assistant Professor, Department of CSE, SRM University, Ramapuram Campus, Chennai, Tamil Nadu,India 1 Assistant Professor,

More information

Cloud Storage Solution for WSN Based on Internet Innovation Union

Cloud Storage Solution for WSN Based on Internet Innovation Union Cloud Storage Solution for WSN Based on Internet Innovation Union Tongrang Fan 1, Xuan Zhang 1, Feng Gao 1 1 School of Information Science and Technology, Shijiazhuang Tiedao University, Shijiazhuang,

More information

The Power Marketing Information System Model Based on Cloud Computing

The Power Marketing Information System Model Based on Cloud Computing 2011 International Conference on Computer Science and Information Technology (ICCSIT 2011) IPCSIT vol. 51 (2012) (2012) IACSIT Press, Singapore DOI: 10.7763/IPCSIT.2012.V51.96 The Power Marketing Information

More information

CLOUDDMSS: CLOUD-BASED DISTRIBUTED MULTIMEDIA STREAMING SERVICE SYSTEM FOR HETEROGENEOUS DEVICES

CLOUDDMSS: CLOUD-BASED DISTRIBUTED MULTIMEDIA STREAMING SERVICE SYSTEM FOR HETEROGENEOUS DEVICES CLOUDDMSS: CLOUD-BASED DISTRIBUTED MULTIMEDIA STREAMING SERVICE SYSTEM FOR HETEROGENEOUS DEVICES 1 MYOUNGJIN KIM, 2 CUI YUN, 3 SEUNGHO HAN, 4 HANKU LEE 1,2,3,4 Department of Internet & Multimedia Engineering,

More information

Conceiving a Multiscale Dataspace for Data Analysis

Conceiving a Multiscale Dataspace for Data Analysis Conceiving a Multiscale Dataspace for Data Analysis Matheus Silva Mota 1, André Santanchè 1 1 Institute of Computing UNICAMP Campinas SP Brazil {mota,santanche}@ic.unicamp.br Abstract. A consequence of

More information

A Load Balancing Algorithm based on the Variation Trend of Entropy in Homogeneous Cluster

A Load Balancing Algorithm based on the Variation Trend of Entropy in Homogeneous Cluster , pp.11-20 http://dx.doi.org/10.14257/ ijgdc.2014.7.2.02 A Load Balancing Algorithm based on the Variation Trend of Entropy in Homogeneous Cluster Kehe Wu 1, Long Chen 2, Shichao Ye 2 and Yi Li 2 1 Beijing

More information

Figure 1: Architecture of a cloud services model for a digital education resource management system.

Figure 1: Architecture of a cloud services model for a digital education resource management system. World Transactions on Engineering and Technology Education Vol.13, No.3, 2015 2015 WIETE Cloud service model for the management and sharing of massive amounts of digital education resources Binwen Huang

More information

Exploration on Security System Structure of Smart Campus Based on Cloud Computing. Wei Zhou

Exploration on Security System Structure of Smart Campus Based on Cloud Computing. Wei Zhou 3rd International Conference on Science and Social Research (ICSSR 2014) Exploration on Security System Structure of Smart Campus Based on Cloud Computing Wei Zhou Information Center, Shanghai University

More information

A STUDY ON HADOOP ARCHITECTURE FOR BIG DATA ANALYTICS

A STUDY ON HADOOP ARCHITECTURE FOR BIG DATA ANALYTICS A STUDY ON HADOOP ARCHITECTURE FOR BIG DATA ANALYTICS Dr. Ananthi Sheshasayee 1, J V N Lakshmi 2 1 Head Department of Computer Science & Research, Quaid-E-Millath Govt College for Women, Chennai, (India)

More information

Design of Electric Energy Acquisition System on Hadoop

Design of Electric Energy Acquisition System on Hadoop , pp.47-54 http://dx.doi.org/10.14257/ijgdc.2015.8.5.04 Design of Electric Energy Acquisition System on Hadoop Yi Wu 1 and Jianjun Zhou 2 1 School of Information Science and Technology, Heilongjiang University

More information

The Key Technology Research of Virtual Laboratory based On Cloud Computing Ling Zhang

The Key Technology Research of Virtual Laboratory based On Cloud Computing Ling Zhang International Conference on Advances in Mechanical Engineering and Industrial Informatics (AMEII 2015) The Key Technology Research of Virtual Laboratory based On Cloud Computing Ling Zhang Nanjing Communications

More information

Cloud Computing for Agent-based Traffic Management Systems

Cloud Computing for Agent-based Traffic Management Systems Cloud Computing for Agent-based Traffic Management Systems Manoj A Patil Asst.Prof. IT Dept. Khyamling A Parane Asst.Prof. CSE Dept. D. Rajesh Asst.Prof. IT Dept. ABSTRACT Increased traffic congestion

More information

CLOUD BASED PEER TO PEER NETWORK FOR ENTERPRISE DATAWAREHOUSE SHARING

CLOUD BASED PEER TO PEER NETWORK FOR ENTERPRISE DATAWAREHOUSE SHARING CLOUD BASED PEER TO PEER NETWORK FOR ENTERPRISE DATAWAREHOUSE SHARING Basangouda V.K 1,Aruna M.G 2 1 PG Student, Dept of CSE, M.S Engineering College, Bangalore,basangoudavk@gmail.com 2 Associate Professor.,

More information

Research of Railway Wagon Flow Forecast System Based on Hadoop-Hazelcast

Research of Railway Wagon Flow Forecast System Based on Hadoop-Hazelcast International Conference on Civil, Transportation and Environment (ICCTE 2016) Research of Railway Wagon Flow Forecast System Based on Hadoop-Hazelcast Xiaodong Zhang1, a, Baotian Dong1, b, Weijia Zhang2,

More information

A Case Study of Question Answering in Automatic Tourism Service Packaging

A Case Study of Question Answering in Automatic Tourism Service Packaging BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 13, Special Issue Sofia 2013 Print ISSN: 1311-9702; Online ISSN: 1314-4081 DOI: 10.2478/cait-2013-0045 A Case Study of Question

More information

Developing Scalable Smart Grid Infrastructure to Enable Secure Transmission System Control

Developing Scalable Smart Grid Infrastructure to Enable Secure Transmission System Control Developing Scalable Smart Grid Infrastructure to Enable Secure Transmission System Control EP/K006487/1 UK PI: Prof Gareth Taylor (BU) China PI: Prof Yong-Hua Song (THU) Consortium UK Members: Brunel University

More information

The Application and Development of Software Testing in Cloud Computing Environment

The Application and Development of Software Testing in Cloud Computing Environment 2012 International Conference on Computer Science and Service System The Application and Development of Software Testing in Cloud Computing Environment Peng Zhenlong Ou Yang Zhonghui School of Business

More information

San Diego Supercomputer Center, UCSD. Institute for Digital Research and Education, UCLA

San Diego Supercomputer Center, UCSD. Institute for Digital Research and Education, UCLA Facilitate Parallel Computation Using Kepler Workflow System on Virtual Resources Jianwu Wang 1, Prakashan Korambath 2, Ilkay Altintas 1 1 San Diego Supercomputer Center, UCSD 2 Institute for Digital Research

More information

Exploring the Efficiency of Big Data Processing with Hadoop MapReduce

Exploring the Efficiency of Big Data Processing with Hadoop MapReduce Exploring the Efficiency of Big Data Processing with Hadoop MapReduce Brian Ye, Anders Ye School of Computer Science and Communication (CSC), Royal Institute of Technology KTH, Stockholm, Sweden Abstract.

More information

Research of Postal Data mining system based on big data

Research of Postal Data mining system based on big data 3rd International Conference on Mechatronics, Robotics and Automation (ICMRA 2015) Research of Postal Data mining system based on big data Xia Hu 1, Yanfeng Jin 1, Fan Wang 1 1 Shi Jiazhuang Post & Telecommunication

More information

Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2

Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2 Volume 6, Issue 3, March 2016 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Special Issue

More information

A Grid Architecture for Manufacturing Database System

A Grid Architecture for Manufacturing Database System Database Systems Journal vol. II, no. 2/2011 23 A Grid Architecture for Manufacturing Database System Laurentiu CIOVICĂ, Constantin Daniel AVRAM Economic Informatics Department, Academy of Economic Studies

More information

A Virtual Machine Searching Method in Networks using a Vector Space Model and Routing Table Tree Architecture

A Virtual Machine Searching Method in Networks using a Vector Space Model and Routing Table Tree Architecture A Virtual Machine Searching Method in Networks using a Vector Space Model and Routing Table Tree Architecture Hyeon seok O, Namgi Kim1, Byoung-Dai Lee dept. of Computer Science. Kyonggi University, Suwon,

More information

Research on Trust Management Strategies in Cloud Computing Environment

Research on Trust Management Strategies in Cloud Computing Environment Journal of Computational Information Systems 8: 4 (2012) 1757 1763 Available at http://www.jofcis.com Research on Trust Management Strategies in Cloud Computing Environment Wenjuan LI 1,2,, Lingdi PING

More information

Index Terms : Load rebalance, distributed file systems, clouds, movement cost, load imbalance, chunk.

Index Terms : Load rebalance, distributed file systems, clouds, movement cost, load imbalance, chunk. Load Rebalancing for Distributed File Systems in Clouds. Smita Salunkhe, S. S. Sannakki Department of Computer Science and Engineering KLS Gogte Institute of Technology, Belgaum, Karnataka, India Affiliated

More information

OWL based XML Data Integration

OWL based XML Data Integration OWL based XML Data Integration Manjula Shenoy K Manipal University CSE MIT Manipal, India K.C.Shet, PhD. N.I.T.K. CSE, Suratkal Karnataka, India U. Dinesh Acharya, PhD. ManipalUniversity CSE MIT, Manipal,

More information

Development and Application Study of Marine Data Managing and Sharing Platform

Development and Application Study of Marine Data Managing and Sharing Platform Development and Application Study of Marine Data Managing and Sharing Platform Abstract North China Sea Marine Technical Support Center, State Oceanic Administration, China 266033 Corresponding author

More information

Research and realization of Resource Cloud Encapsulation in Cloud Manufacturing

Research and realization of Resource Cloud Encapsulation in Cloud Manufacturing www.ijcsi.org 579 Research and realization of Resource Cloud Encapsulation in Cloud Manufacturing Zhang Ming 1, Hu Chunyang 2 1 Department of Teaching and Practicing, Guilin University of Electronic Technology

More information

A Study on Data Analysis Process Management System in MapReduce using BPM

A Study on Data Analysis Process Management System in MapReduce using BPM A Study on Data Analysis Process Management System in MapReduce using BPM Yoon-Sik Yoo 1, Jaehak Yu 1, Hyo-Chan Bang 1, Cheong Hee Park 1 Electronics and Telecommunications Research Institute, 138 Gajeongno,

More information

A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM

A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM Sneha D.Borkar 1, Prof.Chaitali S.Surtakar 2 Student of B.E., Information Technology, J.D.I.E.T, sborkar95@gmail.com Assistant Professor, Information

More information

Service Virtualization in Large Scale, Heterogeneous and Distributed Environment

Service Virtualization in Large Scale, Heterogeneous and Distributed Environment Service Virtualization in Large Scale, Heterogeneous and Distributed Environment Hong-Hui Chen 1, De-Ke Guo 1, Xue Qun-Wei, Xue-Shan Luo 1, Wei-Ming Zhang 1 1 School of Information System &Management,

More information

Key Technology Study of Agriculture Information Cloud-Services

Key Technology Study of Agriculture Information Cloud-Services Key Technology Study of Agriculture Information Cloud-Services Yunpeng Cui, Shihong Liu Key Laboratory of Digital Agricultural Early-warning Technology, Ministry of Agriculture, Beijing, The People s epublic

More information

Project Knowledge Management Based on Social Networks

Project Knowledge Management Based on Social Networks DOI: 10.7763/IPEDR. 2014. V70. 10 Project Knowledge Management Based on Social Networks Panos Fitsilis 1+, Vassilis Gerogiannis 1, and Leonidas Anthopoulos 1 1 Business Administration Dep., Technological

More information

Evaluation Model for Internet Cloud Data Structure Audit System

Evaluation Model for Internet Cloud Data Structure Audit System Evaluation Model for Internet Data Structure Audit System Wang Fan School of Accounting Zhejiang Gongshang University Hangzhou, 310018, P. R.China wangfanswcd@126.com Journal of Digital Information Management

More information

Towards Cloud Factory Simulation. Abstract

Towards Cloud Factory Simulation. Abstract Towards Cloud Factory Simulation 第 十 八 屆 決 策 分 析 研 討 會 Toly Chen Department of Industrial Engineering and Systems Management, Feng Chia University *tolychen@ms37.hinet.net Abstract An important and practical

More information

Research of the Combination of Distributed Business Processes Based on Dynamic Planning

Research of the Combination of Distributed Business Processes Based on Dynamic Planning , pp.257-266 http://dx.doi.org/10.14257/ijunesst.2015.8.6.25 Research of the Combination of Distributed Business Processes Based on Dynamic Planning Yuan Gang, Sun Rui-zhi and Shi Yin-xue Key laboratory

More information

Efficient Cloud Management for Parallel Data Processing In Private Cloud

Efficient Cloud Management for Parallel Data Processing In Private Cloud 2012 International Conference on Information and Network Technology (ICINT 2012) IPCSIT vol. 37 (2012) (2012) IACSIT Press, Singapore Efficient Cloud Management for Parallel Data Processing In Private

More information

IJSER Figure1 Wrapper Architecture

IJSER Figure1 Wrapper Architecture International Journal of Scientific & Engineering Research, Volume 5, Issue 5, May-2014 24 ONTOLOGY BASED DATA INTEGRATION WITH USER FEEDBACK Devini.K, M.S. Hema Abstract-Many applications need to access

More information

How To Handle Big Data With A Data Scientist

How To Handle Big Data With A Data Scientist III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution

More information

Research of Smart Distribution Network Big Data Model

Research of Smart Distribution Network Big Data Model Research of Smart Distribution Network Big Data Model Guangyi LIU Yang YU Feng GAO Wendong ZHU China Electric Power Stanford Smart Grid Research Institute Smart Grid Research Institute Research Institute

More information

How To Build A Cloud Based Intelligence System

How To Build A Cloud Based Intelligence System Semantic Technology and Cloud Computing Applied to Tactical Intelligence Domain Steve Hamby Chief Technology Officer Orbis Technologies, Inc. shamby@orbistechnologies.com 678.346.6386 1 Abstract The tactical

More information

A Framework of User-Driven Data Analytics in the Cloud for Course Management

A Framework of User-Driven Data Analytics in the Cloud for Course Management A Framework of User-Driven Data Analytics in the Cloud for Course Management Jie ZHANG 1, William Chandra TJHI 2, Bu Sung LEE 1, Kee Khoon LEE 2, Julita VASSILEVA 3 & Chee Kit LOOI 4 1 School of Computer

More information

Data Services @neurist and beyond

Data Services @neurist and beyond s @neurist and beyond Siegfried Benkner Department of Scientific Computing Faculty of Computer Science University of Vienna http://www.par.univie.ac.at Department of Scientific Computing Parallel Computing

More information

Grid Middleware for Realizing Autonomous Resource Sharing: Grid Service Platform

Grid Middleware for Realizing Autonomous Resource Sharing: Grid Service Platform Grid Middleware for Realizing Autonomous Resource Sharing: Grid Service Platform V Soichi Shigeta V Haruyasu Ueda V Nobutaka Imamura (Manuscript received April 19, 2007) These days, many enterprises are

More information

Capability Service Management System for Manufacturing Equipments in

Capability Service Management System for Manufacturing Equipments in Capability Service Management System for Manufacturing Equipments in Cloud Manufacturing 1 Junwei Yan, 2 Sijin Xin, 3 Quan Liu, 4 Wenjun Xu *1, Corresponding Author School of Information Engineering, Wuhan

More information

Enhancing Dataset Processing in Hadoop YARN Performance for Big Data Applications

Enhancing Dataset Processing in Hadoop YARN Performance for Big Data Applications Enhancing Dataset Processing in Hadoop YARN Performance for Big Data Applications Ahmed Abdulhakim Al-Absi, Dae-Ki Kang and Myong-Jong Kim Abstract In Hadoop MapReduce distributed file system, as the input

More information

GeoSquare: A cloud-enabled geospatial information resources (GIRs) interoperate infrastructure for cooperation and sharing

GeoSquare: A cloud-enabled geospatial information resources (GIRs) interoperate infrastructure for cooperation and sharing GeoSquare: A cloud-enabled geospatial information resources (GIRs) interoperate infrastructure for cooperation and sharing Kai Hu 1, Huayi Wu 1, Zhipeng Gui 2, Lan You 1, Ping Shen 1, Shuang Gao 1, Jie

More information

Remote Sensitive Image Stations and Grid Services

Remote Sensitive Image Stations and Grid Services International Journal of Grid and Distributed Computing 23 Remote Sensing Images Data Integration Based on the Agent Service Binge Cui, Chuanmin Wang, Qiang Wang College of Information Science and Engineering,

More information

Grid Data Integration based on Schema-mapping

Grid Data Integration based on Schema-mapping Grid Data Integration based on Schema-mapping Carmela Comito and Domenico Talia DEIS, University of Calabria, Via P. Bucci 41 c, 87036 Rende, Italy {ccomito, talia}@deis.unical.it http://www.deis.unical.it/

More information

Data-intensive HPC: opportunities and challenges. Patrick Valduriez

Data-intensive HPC: opportunities and challenges. Patrick Valduriez Data-intensive HPC: opportunities and challenges Patrick Valduriez Big Data Landscape Multi-$billion market! Big data = Hadoop = MapReduce? No one-size-fits-all solution: SQL, NoSQL, MapReduce, No standard,

More information

Research on the data warehouse testing method in database design process based on the shared nothing frame

Research on the data warehouse testing method in database design process based on the shared nothing frame Research on the data warehouse testing method in database design process based on the shared nothing frame Abstract Keming Chen School of Continuing Education, XinYu University,XinYu University, JiangXi,

More information

Journal of Chemical and Pharmaceutical Research, 2015, 7(3):1388-1392. Research Article. E-commerce recommendation system on cloud computing

Journal of Chemical and Pharmaceutical Research, 2015, 7(3):1388-1392. Research Article. E-commerce recommendation system on cloud computing Available online www.jocpr.com Journal of Chemical and Pharmaceutical Research, 2015, 7(3):1388-1392 Research Article ISSN : 0975-7384 CODEN(USA) : JCPRC5 E-commerce recommendation system on cloud computing

More information

Volume 3, Issue 6, June 2015 International Journal of Advance Research in Computer Science and Management Studies

Volume 3, Issue 6, June 2015 International Journal of Advance Research in Computer Science and Management Studies Volume 3, Issue 6, June 2015 International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online at: www.ijarcsms.com Image

More information

Personalized e-learning a Goal Oriented Approach

Personalized e-learning a Goal Oriented Approach Proceedings of the 7th WSEAS International Conference on Distance Learning and Web Engineering, Beijing, China, September 15-17, 2007 304 Personalized e-learning a Goal Oriented Approach ZHIQI SHEN 1,

More information

Core Enterprise Services, SOA, and Semantic Technologies: Supporting Semantic Interoperability

Core Enterprise Services, SOA, and Semantic Technologies: Supporting Semantic Interoperability Core Enterprise, SOA, and Semantic Technologies: Supporting Semantic Interoperability in a Network-Enabled Environment 2011 SOA & Semantic Technology Symposium 13-14 July 2011 Sven E. Kuehne sven.kuehne@nc3a.nato.int

More information

Design and Implementation of the Self-Management Travel System

Design and Implementation of the Self-Management Travel System Design and Implementation of the Self-Management Travel System Dongbei University of Finance and Economics School of Tourism and Hotel Management,Dalian, China 116025 Abstract Through skeleton system,

More information

A Cloud Computing-Based ERP System under The Cloud Manufacturing

A Cloud Computing-Based ERP System under The Cloud Manufacturing A Cloud Computing-Based ERP System under The Cloud Manufacturing Environment 1 Nan Yang, 2 Dongbo Li, 3 Yifei Tong 1, First Author Department of Industry Engineering,Nanjing University of Science and Technology,Nanjing210094,People

More information

A Survey on Data Warehouse Architecture

A Survey on Data Warehouse Architecture A Survey on Data Warehouse Architecture Rajiv Senapati 1, D.Anil Kumar 2 1 Assistant Professor, Department of IT, G.I.E.T, Gunupur, India 2 Associate Professor, Department of CSE, G.I.E.T, Gunupur, India

More information

Data Modeling for Big Data

Data Modeling for Big Data Data Modeling for Big Data by Jinbao Zhu, Principal Software Engineer, and Allen Wang, Manager, Software Engineering, CA Technologies In the Internet era, the volume of data we deal with has grown to terabytes

More information

JOURNAL OF OBJECT TECHNOLOGY

JOURNAL OF OBJECT TECHNOLOGY JOURNAL OF OBJECT TECHNOLOGY Online at www.jot.fm. Published by ETH Zurich, Chair of Software Engineering JOT, 2008 Vol. 7, No. 8, November-December 2008 What s Your Information Agenda? Mahesh H. Dodani,

More information

Data Mining in Web Search Engine Optimization and User Assisted Rank Results

Data Mining in Web Search Engine Optimization and User Assisted Rank Results Data Mining in Web Search Engine Optimization and User Assisted Rank Results Minky Jindal Institute of Technology and Management Gurgaon 122017, Haryana, India Nisha kharb Institute of Technology and Management

More information

An Oracle White Paper November 2010. Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics

An Oracle White Paper November 2010. Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics An Oracle White Paper November 2010 Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics 1 Introduction New applications such as web searches, recommendation engines,

More information

Fault Analysis in Software with the Data Interaction of Classes

Fault Analysis in Software with the Data Interaction of Classes , pp.189-196 http://dx.doi.org/10.14257/ijsia.2015.9.9.17 Fault Analysis in Software with the Data Interaction of Classes Yan Xiaobo 1 and Wang Yichen 2 1 Science & Technology on Reliability & Environmental

More information

Big Data Provenance: Challenges and Implications for Benchmarking

Big Data Provenance: Challenges and Implications for Benchmarking Big Data Provenance: Challenges and Implications for Benchmarking Boris Glavic Illinois Institute of Technology 10 W 31st Street, Chicago, IL 60615, USA glavic@iit.edu Abstract. Data Provenance is information

More information

HadoopRDF : A Scalable RDF Data Analysis System

HadoopRDF : A Scalable RDF Data Analysis System HadoopRDF : A Scalable RDF Data Analysis System Yuan Tian 1, Jinhang DU 1, Haofen Wang 1, Yuan Ni 2, and Yong Yu 1 1 Shanghai Jiao Tong University, Shanghai, China {tian,dujh,whfcarter}@apex.sjtu.edu.cn

More information

Make search become the internal function of Internet

Make search become the internal function of Internet Make search become the internal function of Internet Wang Liang 1, Guo Yi-Ping 2, Fang Ming 3 1, 3 (Department of Control Science and Control Engineer, Huazhong University of Science and Technology, WuHan,

More information

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON Overview * Introduction * Multiple faces of Big Data * Challenges of Big Data * Cloud Computing

More information

Context Capture in Software Development

Context Capture in Software Development Context Capture in Software Development Bruno Antunes, Francisco Correia and Paulo Gomes Knowledge and Intelligent Systems Laboratory Cognitive and Media Systems Group Centre for Informatics and Systems

More information

Open Access Research on Database Massive Data Processing and Mining Method based on Hadoop Cloud Platform

Open Access Research on Database Massive Data Processing and Mining Method based on Hadoop Cloud Platform Send Orders for Reprints to reprints@benthamscience.ae The Open Automation and Control Systems Journal, 2014, 6, 1463-1467 1463 Open Access Research on Database Massive Data Processing and Mining Method

More information

An Ontology-enhanced Cloud Service Discovery System

An Ontology-enhanced Cloud Service Discovery System An Ontology-enhanced Cloud Service Discovery System Taekgyeong Han and Kwang Mong Sim* Abstract This paper presents a Cloud service discovery system (CSDS) that aims to support the Cloud users in finding

More information

SEO Techniques for various Applications - A Comparative Analyses and Evaluation

SEO Techniques for various Applications - A Comparative Analyses and Evaluation IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727 PP 20-24 www.iosrjournals.org SEO Techniques for various Applications - A Comparative Analyses and Evaluation Sandhya

More information

Map-Parallel Scheduling (mps) using Hadoop environment for job scheduler and time span for Multicore Processors

Map-Parallel Scheduling (mps) using Hadoop environment for job scheduler and time span for Multicore Processors Map-Parallel Scheduling (mps) using Hadoop environment for job scheduler and time span for Sudarsanam P Abstract G. Singaravel Parallel computing is an base mechanism for data process with scheduling task,

More information

Design of Remote data acquisition system based on Internet of Things

Design of Remote data acquisition system based on Internet of Things , pp.32-36 http://dx.doi.org/10.14257/astl.214.79.07 Design of Remote data acquisition system based on Internet of Things NIU Ling Zhou Kou Normal University, Zhoukou 466001,China; Niuling@zknu.edu.cn

More information

XMPP A Perfect Protocol for the New Era of Volunteer Cloud Computing

XMPP A Perfect Protocol for the New Era of Volunteer Cloud Computing International Journal of Computational Engineering Research Vol, 03 Issue, 10 XMPP A Perfect Protocol for the New Era of Volunteer Cloud Computing Kamlesh Lakhwani 1, Ruchika Saini 1 1 (Dept. of Computer

More information

A Method of Cloud Resource Load Balancing Scheduling Based on Improved Adaptive Genetic Algorithm

A Method of Cloud Resource Load Balancing Scheduling Based on Improved Adaptive Genetic Algorithm Journal of Information & Computational Science 9: 16 (2012) 4801 4809 Available at http://www.joics.com A Method of Cloud Resource Load Balancing Scheduling Based on Improved Adaptive Genetic Algorithm

More information

Study on Cloud Service Mode of Agricultural Information Institutions

Study on Cloud Service Mode of Agricultural Information Institutions Study on Cloud Service Mode of Agricultural Information Institutions Xiaorong Yang, Nengfu Xie, Dan Wang, Lihua Jiang To cite this version: Xiaorong Yang, Nengfu Xie, Dan Wang, Lihua Jiang. Study on Cloud

More information

International Journal of Engineering Research ISSN: 2348-4039 & Management Technology November-2015 Volume 2, Issue-6

International Journal of Engineering Research ISSN: 2348-4039 & Management Technology November-2015 Volume 2, Issue-6 International Journal of Engineering Research ISSN: 2348-4039 & Management Technology Email: editor@ijermt.org November-2015 Volume 2, Issue-6 www.ijermt.org Modeling Big Data Characteristics for Discovering

More information

Towards the Optimization of Data Mining Execution Process in Distributed Environments

Towards the Optimization of Data Mining Execution Process in Distributed Environments Journal of Computational Information Systems 7: 8 (2011) 2931-2939 Available at http://www.jofcis.com Towards the Optimization of Data Mining Execution Process in Distributed Environments Yan ZHANG 1,,

More information

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS! The Bloor Group IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS VENDOR PROFILE The IBM Big Data Landscape IBM can legitimately claim to have been involved in Big Data and to have a much broader

More information

Open Access Research of Massive Spatiotemporal Data Mining Technology Based on Cloud Computing

Open Access Research of Massive Spatiotemporal Data Mining Technology Based on Cloud Computing Send Orders for Reprints to reprints@benthamscience.ae 2244 The Open Automation and Control Systems Journal, 2015, 7, 2244-2252 Open Access Research of Massive Spatiotemporal Data Mining Technology Based

More information

Operation and Maintenance Management Strategy of Cloud Computing Data Center

Operation and Maintenance Management Strategy of Cloud Computing Data Center , pp.5-9 http://dx.doi.org/10.14257/astl.2014.78.02 Operation and Maintenance Management Strategy of Cloud Computing Data Center Wei Bai 1, Wenli Geng 1 1 Computer and information engineering institute

More information

FUTURE RESEARCH DIRECTIONS OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING *

FUTURE RESEARCH DIRECTIONS OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING * International Journal of Software Engineering and Knowledge Engineering World Scientific Publishing Company FUTURE RESEARCH DIRECTIONS OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING * HAIPING XU Computer

More information

A Semantic Approach for Access Control in Web Services

A Semantic Approach for Access Control in Web Services A Semantic Approach for Access Control in Web Services M. I. Yagüe, J. Mª Troya Computer Science Department, University of Málaga, Málaga, Spain {yague, troya}@lcc.uma.es Abstract One of the most important

More information

Semantic Concept Based Retrieval of Software Bug Report with Feedback

Semantic Concept Based Retrieval of Software Bug Report with Feedback Semantic Concept Based Retrieval of Software Bug Report with Feedback Tao Zhang, Byungjeong Lee, Hanjoon Kim, Jaeho Lee, Sooyong Kang, and Ilhoon Shin Abstract Mining software bugs provides a way to develop

More information

International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015

International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015 RESEARCH ARTICLE OPEN ACCESS Ensuring Reliability and High Availability in Cloud by Employing a Fault Tolerance Enabled Load Balancing Algorithm G.Gayathri [1], N.Prabakaran [2] Department of Computer

More information