Dynamic Querying In NoSQL System
|
|
|
- Matilda Kennedy
- 10 years ago
- Views:
Transcription
1 Dynamic Querying In NoSQL System Reshma Suku 1, Kasim K. 2 Student, Dept. of Computer Science and Engineering, M. E. A Engineering College, Perinthalmanna, Kerala, India 1 Assistant Professor, Dept. of Computer Science and Engineering, M. E. A Engineering College, Perinthalmanna, Kerala, India 2 ABSTRACT: Three mege trends Big data, Big user and cloud computing that leads to the adoption of NoSQL. NoSQL simply means Not Only SQL. Today most of the applications are hosted in cloud and that available through internet. They must support large number of users 24 hours a day, 365 days a year. This create explosion in number of concurrent users. So here needs a technique to handle large number of data. This paper proposes a dynamic query form interface for database exploration of an organization. Here use a document oriented NoSQL database ie, MONGODB. MONGODB support dynamic queries that do not require predefined map reduce function. The generation of a query form is an iterative process guided by user. At each iteration, system automatically generate ranking list of form components and user adds the desired form component into query form and submit queries to view query result. There are two traditional measures to evaluate the quality of query result ie, precision and recall. From the quality measures we can derive overall performance measure as F- measure. KEYWORDS: query form, user interaction, F-measure. I. INTRODUCTION Modern scientific database and web database maintain large and heterogeneous data. Web databases include Freebase[5] and DBPedia[3] have thousands of structured web entities. These real world database contain over hundreds or even thousands of relations and attributes. Traditional predefined query forms are not able to satisfy various ad-hoc queries on those complex database. Query form is one of the most widely used user interface to return users desired result. In this paper, here proposes Dynamic Query Form System[1], a query form interface which is able to dynamically generate query form according to user desire at run time. This is entirely different from document retrieval, users in database retrieval perform many refining query condition before identifying the final candidates. Interactive applications have changed dramatically over last 15 years. In the late 90 s, large web companies deals with increase in scale on many dimensions inclue: i. Big Data ii. Big User iii. Cloud Computing Dealing with these issues was more difficult using relational database technology. The key reason behind is that relational database are architected to run on a single machine and use a rigid, schema based approach to modelling data. Google, Amazon, Facebook and LinkedIn were among the first companies to discover the limitation of relational database technology for supporting new application requirement. Copyright to IJIRSET 282
2 In this paper, use MONGODB[14] instead of relational base. It is a cross platform, document oriented database that provide high performance, high availability and scalability. MONGODB works on the concept of collection and document. Document oriented database are different from traditional relational database. Rather than store data in rigid structures like tables, they store data in loosely defined document. With relational database management system table, if we need to add new column, we need to change the definition of table itself, which will add that column to every existing record. This is due to RDBMS strict schema based design. However, with documents we can add new attributes to individual document without changing any other documents. This is because document oriented database are generally schema less by design. II. RELATED WORKS In Assisted querying using instant response interfaces, A.Nandi and H.V. Jagdish[6], This proposed auto completion approaches for database queries. Here a novel user interface have been developed to assist the user to type the database queries based on query workload, the data distribution and the database schema. In automated creation of a form-based database query interface and In expressive query specification through form customization, M. Jayapandian and H.V. Jagadish[7][8], Here proposed a customized approach to provide visual interface for developers to create or customize query form. EasyQuery, ColdFusion, SAP are the main tools to help developers design and generate the query form. The problem of these tool is that, they are provided for the professional developers who are familiar with their database not for end users. Another problem is that, if the database schema is very large, it is difficult for them to find appropriate database entities, attributes and to create desired query form. In Combining keyword search and forms for adhoc querying of database, E.Chu, A.Baid X. Chai, Doan and J.F. Naughton[10], It automatically generate a lot of query form in advance. Here user input several keywords to find relevant query form. This leads to the conclusion that a query rewrite by mapping data values to schema values during keyword search. Another one is that, simply displaying the returned form as a flat list. In automating the design and construction of query forms, M.Jayapandian and H.V. Jagadish[9], Here propose automatic approaches to generate the database query without user participation. It is a work-load driven model. It applies clustering algorithm to find representative queries. One of the disadvantage is that if we generate lots of query forms in advance, there are still user queries that cannot be satisfied by one of the query form. Query recommendation for interactive database exploration, M. Eirinaki and N. Polyzotis[11], Here introduce a collaborative approach to recommend database query components for database exploration. They treat SQL queries as item in the collaborative filtering approach and recommend similar queries to related users. One of the problem is that they do not consider the goodness of query result. In proposed system recommendation is a query component for each iteration. In Building dynamic faceted search systems over database, S.B. Roy, H. Nambiar, G. Das and M.K. Mohania[12], a domain independent system, that provides effective minimum- effort based dynamic faceted search solution over enterprise database. It present relevant facets for the users according to navigation path. Dynamic faceted search engine are similar to our dynamic query form if we only consider selection components in query. In Usher: Improving Data Quality with dynamic forms, K.Chen, H.Chen, N.Conway, J.M. Hellerstein and T.S. Parikh[13], Data quality is a critical problem in modern database. USHER, an end to end system for form design, entry and data quality assurance. In DQF, deals with database query forms instead of data entry forms. Copyright to IJIRSET 283
3 III. PROBLEM FORMULATION Fig1: Flow chart of dynamic query form Figure shows flow chart of dynamic query form. A dynamic query form system which generates query form according to the users desire at run time. The system provides a solution for query interface in large and complex database. Apply F- measure to estimate the goodness of a query form. F-measure is a typical metric to evaluate query result. The metric is also appropriate for query form because query forms are designed to help users query the database. The goodness of a query form is determined by the query results generated from the query form. Based on this, we can rank and recommend the potential query form components. Here efficiency is important because dynamic query form is an online system where users often expect quick response. Each query form corresponds to SQL query template. Query forms allow users to fill parameters to generate different queries. In this paper, we focus on the projection and selection components of a query form. Ad-hoc join is not handled by our dynamic query form because join is not a part of the query form and is invisible for users. To decide whether a query form is desired or not, a user does not have time to go over every data instance in the query result. In addition, many database queries output a huge amount of data instances. In order to avoid this many-answer problem, only outputcompressed result table to show a high level view of the query result first. Each instance in the compressed table represents a cluster of actual data instances. Then, the user can click through interested clusters to view the detailed data instances. There are many one- pass clustering algorithms for generating the compressed view efficiently. In our implementation, we choose the incremental data clustering framework, because of the efficiency issue. Certainly, different data clustering methods would have different compressed views for users. Also, different clustering methods are preferable to different data types. Here, clustering is just to provide a better view of the query result for user. The system developers can select a different clustering algorithm if needed. Another important usage of the compressed view is to collect the user feedback. Using the collected feedback, we can estimate the goodness of a query form so that we could recommend appropriate query form components. In real world, end users are reluctant to provide explicit feedback. Figure below shows the user action. Copyright to IJIRSET 284
4 Fig.2 User action i). Component ranking IV. METHODOLOGY DQF has two-level ranked list for projection components. The first level is the ranked list of entities. Second level is the ranked list of attributes in the same entity. For ranking attribute we have to calculate F-Score. This can be calculated from corresponding precision and recall as follows. Let current query form be F i, the next query form be F i+1, we obtain FScore(F i+1 ) as follows: FScore(F i+1 ) = (1+β).Precision E (F i+1 ).Recall E (F i+1 ) β 2 Precision E (F i+1 )+ Recall E (F i+1 ) Here apply kernel density estimation method to estimate user interest on instances in the query result. We utilize the schema graph to compute the relevance of two attribute. The ranking score of an entity is just average of FScore(F i+1 ) of that entity s attributes. If one entity has many high score attribute, then it should have a higher rank. The selection attribute are relevant to the current projected entities, otherwise selection would be meaningless. Relevant attributes can be selected based on the database schema. For simplicity of user interface, most of the query form s selection component are simple binary relation in the form A j op C j. Where A j is an attribute, C j is a constant and op is a relational operator. The following algorithm describes the algorithm of one-query s query construction. One Query adds all the selection attributes into projection of query. The function Generate Query is to generate the database query based on projection attribute A one and selection expression σ one. When the component attributes were ranked, we can determine the probability of user interested form components. Here we can use schema graph to show relevance of attribute. When comes to entity ranking, if one entity has many high score attribute, then it should have higher rank. Copyright to IJIRSET 285
5 Algorithm 1: Query Construction Data: Q={Q 1, Q 2,.,} is a set of previous queries executed on F i+1 Result: Q one is the query of one-query begin σ one 0 for Qϵ Q do σ one σ one V σq A one A Fi U A r (F r ) Q one GenerateQuery(A one,σ one ) ii). Quality Metric Module The quality of query result can be described by paying more importance to precision and recall. Precision is also called positive predicate value. It is the fraction of retrieved instance that are relevant. Recall is also called sensitivity. Recall is the fraction of relevant instance that are retrieved. Query forms are able to produce different queries by different input and different queries can output different query result and achieve different precision and recall. Hence we use expected precision and expected recall to evaluate expected performance of query form. Probabilistic model can be used to find precision and recall. iii). Metaprocessor Module Metadata can be defined as the data providing information about one or more aspects of the data. This provides user friendly interface to novel users. Map-reduce function is used to extract keys from collection. In map-reduce operation, MONGODB applies map phase to each input documents. The map function emit key-value pairs. For those keys have multiple values, MONGODB applies the reduce phase, which collect and condense the aggregated data. All map reduce functions in MONGODB are javascript and run within the mongod process. iv). Query Processor Module The essence of DQF is to capture users interest during user interactions and adapt the query form iteratively. In each iteration the user interaction be two types as follows: Query Form Enrichment 1. DQF recommends a ranked list of query form components to the user. 2. The user select the desired form components into the current query form. Copyright to IJIRSET 286
6 Query Execution 1. The user fills out the current query form and submit a query. 2. DQF executes the query and shows the result. 3. The user provides the feedback about the query result. SQL to MONGODB mapping chart as follows: Fig 3: Data retrieval SQL Terms/Concept Database Table Row Column Index MONGODB Tems/Concept Database Collection Document/ BSON Document Field Index Copyright to IJIRSET 287
7 Table Join Primary Key Specify any unique coloumn or coloumn combination as primary key Aggregation(Group By) Embedded document or linking Primary Key In MONGODB, primary key is set to the _id field Aggregation pipeline V. FACILITIES FOR THE REQUIRED WORK The system will be implemented as a J2EE application and is a web based system. The application is based on MVC application architecture and is confirmed to the modern n- tier application pattern. The system front end is normal web browser(internet explorer). This constitutes the thin client tier. The browser connects to a web server and web server make use of a Java application server to invoke the JSP components(the web tier ). These JSP components realize presentation logic for the system. The JSP components in turn call into Java Bean components. The Java Bean components are responsible for implementing the business logic of the application. This is the business logic tier. Finally the data persistence is done using an NoSQL system comprising the data tier. Here use a document oriented NoSQL such as MONGODB. A standard server machine will be required which will act as the server machine and a web container like Apache Tomcat will be installed in the server on which the application will be running. VI. CONCLUSION AND FUTURE WORK Propose a dynamic query form generation approach which helps users dynamically generate query forms. The key idea is to use a probabilistic model to rank form components based on user preferences. We capture user preference using both historical queries and run-time feedback such as clickthrough. Experimental result shows that the dynamic approach often leads to higher success rate and simpler query form compared with a static approach. The ranking of form components also makes it easier for users to customize query forms. Here use a NoSQL database such as MONGODB[14]. As for future work, we plan to develop multiple methods to capture the user s interest for the queries besides the click feedback. For instance, we can add a text-box for users to input some keywords queries. The relevance score between the keywords and the query form can be incorporated into the ranking form components at each step. REFERENCES [1]. Dynamic Query Forms for Database Queries Liang Tang, Tao Li, Yexi Jiang and Zhiyuan Chen [2]. Cold Fusion. [3]. DBPedia. [4]. EasyQuery. [5]. Freebase. Copyright to IJIRSET 288
8 [6]. A.Nandi and H.V. Jagdish. Assisted querying using instant responseinterfaces. In proceedings of ACMSIGMOD, pages ,2007. [7]. M. Jayapandian and H. V. Jagadish. Automated creation of a forms-based database query interface. In proceedings of the VLDB Endowment, pages , August [8]. M. Jayapandian and H. V. Jagadish. Expressive query specification through form customization. In proceedings of International Conference on Extending Database Technology(EDBT), pages , Nantes, France, March [9]. M. Jayapandian and H. V. Jagadish. Automating the design and construction of query forms. IEEE TKDE, 21(10): ,2009. [10]. E.Chu, A. Baid, X. Chai, A. Doan and J. F. Naughton. Combaining keyword search and form for ad hoc querying of databases. In proceedings of ACM SIGMOD Conference, pages , providence, Rhode IsLand, USA, June [11] G. Chatzopoulou, M. Eirinaki and N. Polyzotis. Query recommendations for interactive database exploration. In proceedings of SSDBM, pages 3-18, New Orleans, LA, USA, June [12]. S. B. Roy, H. Wang, U. Nambiar, G. Das and M. K. Mohsnia. Dynacet: Building faceted search systems over databases. In proceedings of ICDE Conference, pages , Shanghai, China, March [13]. K. Chen, H. Chen, N. Conway, J. M. Hellerstein and T. S. Parikh. Usher: Improving data quality with dynamic forms. Im proceedings of ICDE conference, pages , Long Beach, California, USA, March [14]. MONGODBmanual. Copyright to IJIRSET 289
DYNAMIC QUERY FORMS WITH NoSQL
IMPACT: International Journal of Research in Engineering & Technology (IMPACT: IJRET) ISSN(E): 2321-8843; ISSN(P): 2347-4599 Vol. 2, Issue 7, Jul 2014, 157-162 Impact Journals DYNAMIC QUERY FORMS WITH
Cloud Scale Distributed Data Storage. Jürmo Mehine
Cloud Scale Distributed Data Storage Jürmo Mehine 2014 Outline Background Relational model Database scaling Keys, values and aggregates The NoSQL landscape Non-relational data models Key-value Document-oriented
An Efficient Human Resource Management Through Web Based Analytics
Middle-East Journal of Scientific Research 23 (7): 1512-1520, 2015 ISSN 1990-9233 IDOSI Publications, 2015 DOI: 10.5829/idosi.mejsr.2015.23.07.22381 An Efficient Human Resource Management Through Web Based
Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database
Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica
How To Handle Big Data With A Data Scientist
III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution
An Approach to Implement Map Reduce with NoSQL Databases
www.ijecs.in International Journal Of Engineering And Computer Science ISSN: 2319-7242 Volume 4 Issue 8 Aug 2015, Page No. 13635-13639 An Approach to Implement Map Reduce with NoSQL Databases Ashutosh
KEYWORD SEARCH IN RELATIONAL DATABASES
KEYWORD SEARCH IN RELATIONAL DATABASES N.Divya Bharathi 1 1 PG Scholar, Department of Computer Science and Engineering, ABSTRACT Adhiyamaan College of Engineering, Hosur, (India). Data mining refers to
CiteSeer x in the Cloud
Published in the 2nd USENIX Workshop on Hot Topics in Cloud Computing 2010 CiteSeer x in the Cloud Pradeep B. Teregowda Pennsylvania State University C. Lee Giles Pennsylvania State University Bhuvan Urgaonkar
BIG DATA: BIG CHALLENGE FOR SOFTWARE TESTERS
BIG DATA: BIG CHALLENGE FOR SOFTWARE TESTERS Megha Joshi Assistant Professor, ASM s Institute of Computer Studies, Pune, India Abstract: Industry is struggling to handle voluminous, complex, unstructured
Chapter 11 Map-Reduce, Hadoop, HDFS, Hbase, MongoDB, Apache HIVE, and Related
Chapter 11 Map-Reduce, Hadoop, HDFS, Hbase, MongoDB, Apache HIVE, and Related Summary Xiangzhe Li Nowadays, there are more and more data everyday about everything. For instance, here are some of the astonishing
Analytics March 2015 White paper. Why NoSQL? Your database options in the new non-relational world
Analytics March 2015 White paper Why NoSQL? Your database options in the new non-relational world 2 Why NoSQL? Contents 2 New types of apps are generating new types of data 2 A brief history of NoSQL 3
A UPS Framework for Providing Privacy Protection in Personalized Web Search
A UPS Framework for Providing Privacy Protection in Personalized Web Search V. Sai kumar 1, P.N.V.S. Pavan Kumar 2 PG Scholar, Dept. of CSE, G Pulla Reddy Engineering College, Kurnool, Andhra Pradesh,
Chapter 7. Using Hadoop Cluster and MapReduce
Chapter 7 Using Hadoop Cluster and MapReduce Modeling and Prototyping of RMS for QoS Oriented Grid Page 152 7. Using Hadoop Cluster and MapReduce for Big Data Problems The size of the databases used in
Cloud computing - Architecting in the cloud
Cloud computing - Architecting in the cloud [email protected] 1 Outline Cloud computing What is? Levels of cloud computing: IaaS, PaaS, SaaS Moving to the cloud? Architecting in the cloud Best practices
Oracle Big Data SQL Technical Update
Oracle Big Data SQL Technical Update Jean-Pierre Dijcks Oracle Redwood City, CA, USA Keywords: Big Data, Hadoop, NoSQL Databases, Relational Databases, SQL, Security, Performance Introduction This technical
Manifest for Big Data Pig, Hive & Jaql
Manifest for Big Data Pig, Hive & Jaql Ajay Chotrani, Priyanka Punjabi, Prachi Ratnani, Rupali Hande Final Year Student, Dept. of Computer Engineering, V.E.S.I.T, Mumbai, India Faculty, Computer Engineering,
Why NoSQL? Your database options in the new non- relational world. 2015 IBM Cloudant 1
Why NoSQL? Your database options in the new non- relational world 2015 IBM Cloudant 1 Table of Contents New types of apps are generating new types of data... 3 A brief history on NoSQL... 3 NoSQL s roots
Towards Privacy aware Big Data analytics
Towards Privacy aware Big Data analytics Pietro Colombo, Barbara Carminati, and Elena Ferrari Department of Theoretical and Applied Sciences, University of Insubria, Via Mazzini 5, 21100 - Varese, Italy
NoSQL and Hadoop Technologies On Oracle Cloud
NoSQL and Hadoop Technologies On Oracle Cloud Vatika Sharma 1, Meenu Dave 2 1 M.Tech. Scholar, Department of CSE, Jagan Nath University, Jaipur, India 2 Assistant Professor, Department of CSE, Jagan Nath
Reference Architecture, Requirements, Gaps, Roles
Reference Architecture, Requirements, Gaps, Roles The contents of this document are an excerpt from the brainstorming document M0014. The purpose is to show how a detailed Big Data Reference Architecture
TRENDS IN THE DEVELOPMENT OF BUSINESS INTELLIGENCE SYSTEMS
9 8 TRENDS IN THE DEVELOPMENT OF BUSINESS INTELLIGENCE SYSTEMS Assist. Prof. Latinka Todoranova Econ Lit C 810 Information technology is a highly dynamic field of research. As part of it, business intelligence
AD-HOC QUERY BUILDER
AD-HOC QUERY BUILDER International Institute of Information Technology Bangalore Submitted By: Bratati Mohapatra (MT2009089) Rashmi R Rao (MT2009116) Niranjani S (MT2009124) Guided By: Prof Chandrashekar
Open Source Technologies on Microsoft Azure
Open Source Technologies on Microsoft Azure A Survey @DChappellAssoc Copyright 2014 Chappell & Associates The Main Idea i Open source technologies are a fundamental part of Microsoft Azure The Big Questions
Composite Data Virtualization Composite Data Virtualization And NOSQL Data Stores
Composite Data Virtualization Composite Data Virtualization And NOSQL Data Stores Composite Software October 2010 TABLE OF CONTENTS INTRODUCTION... 3 BUSINESS AND IT DRIVERS... 4 NOSQL DATA STORES LANDSCAPE...
Search Result Optimization using Annotators
Search Result Optimization using Annotators Vishal A. Kamble 1, Amit B. Chougule 2 1 Department of Computer Science and Engineering, D Y Patil College of engineering, Kolhapur, Maharashtra, India 2 Professor,
BIG DATA CHALLENGES AND PERSPECTIVES
BIG DATA CHALLENGES AND PERSPECTIVES Meenakshi Sharma 1, Keshav Kishore 2 1 Student of Master of Technology, 2 Head of Department, Department of Computer Science and Engineering, A P Goyal Shimla University,
Hacettepe University Department Of Computer Engineering BBM 471 Database Management Systems Experiment
Hacettepe University Department Of Computer Engineering BBM 471 Database Management Systems Experiment Subject NoSQL Databases - MongoDB Submission Date 20.11.2013 Due Date 26.12.2013 Programming Environment
ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat
ESS event: Big Data in Official Statistics Antonino Virgillito, Istat v erbi v is 1 About me Head of Unit Web and BI Technologies, IT Directorate of Istat Project manager and technical coordinator of Web
International Journal of Engineering Research ISSN: 2348-4039 & Management Technology November-2015 Volume 2, Issue-6
International Journal of Engineering Research ISSN: 2348-4039 & Management Technology Email: [email protected] November-2015 Volume 2, Issue-6 www.ijermt.org Modeling Big Data Characteristics for Discovering
Application of NoSQL Database in Web Crawling
Application of NoSQL Database in Web Crawling College of Computer and Software, Nanjing University of Information Science and Technology, Nanjing, 210044, China doi:10.4156/jdcta.vol5.issue6.31 Abstract
THE SEMANTIC WEB AND IT`S APPLICATIONS
15-16 September 2011, BULGARIA 1 Proceedings of the International Conference on Information Technologies (InfoTech-2011) 15-16 September 2011, Bulgaria THE SEMANTIC WEB AND IT`S APPLICATIONS Dimitar Vuldzhev
Automated Data Ingestion. Bernhard Disselhoff Enterprise Sales Engineer
Automated Data Ingestion Bernhard Disselhoff Enterprise Sales Engineer Agenda Pentaho Overview Templated dynamic ETL workflows Pentaho Data Integration (PDI) Use Cases Pentaho Overview Overview What we
CLOUD BASED PEER TO PEER NETWORK FOR ENTERPRISE DATAWAREHOUSE SHARING
CLOUD BASED PEER TO PEER NETWORK FOR ENTERPRISE DATAWAREHOUSE SHARING Basangouda V.K 1,Aruna M.G 2 1 PG Student, Dept of CSE, M.S Engineering College, Bangalore,[email protected] 2 Associate Professor.,
GigaSpaces Real-Time Analytics for Big Data
GigaSpaces Real-Time Analytics for Big Data GigaSpaces makes it easy to build and deploy large-scale real-time analytics systems Rapidly increasing use of large-scale and location-aware social media and
Oracle Data Integrator 11g: Integration and Administration
Oracle University Contact Us: Local: 1800 103 4775 Intl: +91 80 4108 4709 Oracle Data Integrator 11g: Integration and Administration Duration: 5 Days What you will learn Oracle Data Integrator is a comprehensive
Big Data and Analytics: Challenges and Opportunities
Big Data and Analytics: Challenges and Opportunities Dr. Amin Beheshti Lecturer and Senior Research Associate University of New South Wales, Australia (Service Oriented Computing Group, CSE) Talk: Sharif
Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2
Volume 6, Issue 3, March 2016 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Special Issue
JOURNAL OF COMPUTER SCIENCE AND ENGINEERING
Exploration on Service Matching Methodology Based On Description Logic using Similarity Performance Parameters K.Jayasri Final Year Student IFET College of engineering [email protected] R.Rajmohan
NoSQL systems: introduction and data models. Riccardo Torlone Università Roma Tre
NoSQL systems: introduction and data models Riccardo Torlone Università Roma Tre Why NoSQL? In the last thirty years relational databases have been the default choice for serious data storage. An architect
Beyond Web Application Log Analysis using Apache TM Hadoop. A Whitepaper by Orzota, Inc.
Beyond Web Application Log Analysis using Apache TM Hadoop A Whitepaper by Orzota, Inc. 1 Web Applications As more and more software moves to a Software as a Service (SaaS) model, the web application has
On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform
On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform Page 1 of 16 Table of Contents Table of Contents... 2 Introduction... 3 NoSQL Databases... 3 CumuLogic NoSQL Database Service...
Similarity Search in a Very Large Scale Using Hadoop and HBase
Similarity Search in a Very Large Scale Using Hadoop and HBase Stanislav Barton, Vlastislav Dohnal, Philippe Rigaux LAMSADE - Universite Paris Dauphine, France Internet Memory Foundation, Paris, France
INTRODUCTION TO CASSANDRA
INTRODUCTION TO CASSANDRA This ebook provides a high level overview of Cassandra and describes some of its key strengths and applications. WHAT IS CASSANDRA? Apache Cassandra is a high performance, open
CloudDB: A Data Store for all Sizes in the Cloud
CloudDB: A Data Store for all Sizes in the Cloud Hakan Hacigumus Data Management Research NEC Laboratories America http://www.nec-labs.com/dm www.nec-labs.com What I will try to cover Historical perspective
Lecture Data Warehouse Systems
Lecture Data Warehouse Systems Eva Zangerle SS 2013 PART C: Novel Approaches in DW NoSQL and MapReduce Stonebraker on Data Warehouses Star and snowflake schemas are a good idea in the DW world C-Stores
Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing
Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Wayne W. Eckerson Director of Research, TechTarget Founder, BI Leadership Forum Business Analytics
Unified Batch & Stream Processing Platform
Unified Batch & Stream Processing Platform Himanshu Bari Director Product Management Most Big Data Use Cases Are About Improving/Re-write EXISTING solutions To KNOWN problems Current Solutions Were Built
Dynamic Resource allocation in Cloud
Dynamic Resource allocation in Cloud ABSTRACT: Cloud computing allows business customers to scale up and down their resource usage based on needs. Many of the touted gains in the cloud model come from
Business Intelligence and Service Oriented Architectures. An Oracle White Paper May 2007
Business Intelligence and Service Oriented Architectures An Oracle White Paper May 2007 Note: The following is intended to outline our general product direction. It is intended for information purposes
A Survey on Product Aspect Ranking
A Survey on Product Aspect Ranking Charushila Patil 1, Prof. P. M. Chawan 2, Priyamvada Chauhan 3, Sonali Wankhede 4 M. Tech Student, Department of Computer Engineering and IT, VJTI College, Mumbai, Maharashtra,
Online Content Optimization Using Hadoop. Jyoti Ahuja Dec 20 2011
Online Content Optimization Using Hadoop Jyoti Ahuja Dec 20 2011 What do we do? Deliver right CONTENT to the right USER at the right TIME o Effectively and pro-actively learn from user interactions with
Monitis Project Proposals for AUA. September 2014, Yerevan, Armenia
Monitis Project Proposals for AUA September 2014, Yerevan, Armenia Distributed Log Collecting and Analysing Platform Project Specifications Category: Big Data and NoSQL Software Requirements: Apache Hadoop
Oracle Data Miner (Extension of SQL Developer 4.0)
An Oracle White Paper October 2013 Oracle Data Miner (Extension of SQL Developer 4.0) Generate a PL/SQL script for workflow deployment Denny Wong Oracle Data Mining Technologies 10 Van de Graff Drive Burlington,
Domain driven design, NoSQL and multi-model databases
Domain driven design, NoSQL and multi-model databases Java Meetup New York, 10 November 2014 Max Neunhöffer www.arangodb.com Max Neunhöffer I am a mathematician Earlier life : Research in Computer Algebra
A Novel Cloud Based Elastic Framework for Big Data Preprocessing
School of Systems Engineering A Novel Cloud Based Elastic Framework for Big Data Preprocessing Omer Dawelbeit and Rachel McCrindle October 21, 2014 University of Reading 2008 www.reading.ac.uk Overview
Understanding Web personalization with Web Usage Mining and its Application: Recommender System
Understanding Web personalization with Web Usage Mining and its Application: Recommender System Manoj Swami 1, Prof. Manasi Kulkarni 2 1 M.Tech (Computer-NIMS), VJTI, Mumbai. 2 Department of Computer Technology,
A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM
A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM Sneha D.Borkar 1, Prof.Chaitali S.Surtakar 2 Student of B.E., Information Technology, J.D.I.E.T, [email protected] Assistant Professor, Information
Customer Bank Account Management System Technical Specification Document
Customer Bank Account Management System Technical Specification Document Technical Specification Document Page 1 of 15 Table of Contents Contents 1 Introduction 3 2 Design Overview 4 3 Topology Diagram.6
International Journal of Innovative Research in Computer and Communication Engineering
FP Tree Algorithm and Approaches in Big Data T.Rathika 1, J.Senthil Murugan 2 Assistant Professor, Department of CSE, SRM University, Ramapuram Campus, Chennai, Tamil Nadu,India 1 Assistant Professor,
Optimization of Search Results with Duplicate Page Elimination using Usage Data A. K. Sharma 1, Neelam Duhan 2 1, 2
Optimization of Search Results with Duplicate Page Elimination using Usage Data A. K. Sharma 1, Neelam Duhan 2 1, 2 Department of Computer Engineering, YMCA University of Science & Technology, Faridabad,
QOS Based Web Service Ranking Using Fuzzy C-means Clusters
Research Journal of Applied Sciences, Engineering and Technology 10(9): 1045-1050, 2015 ISSN: 2040-7459; e-issn: 2040-7467 Maxwell Scientific Organization, 2015 Submitted: March 19, 2015 Accepted: April
So today we shall continue our discussion on the search engines and web crawlers. (Refer Slide Time: 01:02)
Internet Technology Prof. Indranil Sengupta Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur Lecture No #39 Search Engines and Web Crawler :: Part 2 So today we
Performance Management of SQL Server
Performance Management of SQL Server Padma Krishnan Senior Manager When we design applications, we give equal importance to the backend database as we do to the architecture and design of the application
Ramesh Bhashyam Teradata Fellow Teradata Corporation [email protected]
Challenges of Handling Big Data Ramesh Bhashyam Teradata Fellow Teradata Corporation [email protected] Trend Too much information is a storage issue, certainly, but too much information is also
Performance and Scalability Overview
Performance and Scalability Overview This guide provides an overview of some of the performance and scalability capabilities of the Pentaho Business Analytics platform. PENTAHO PERFORMANCE ENGINEERING
Augmented Search for Web Applications. New frontier in big log data analysis and application intelligence
Augmented Search for Web Applications New frontier in big log data analysis and application intelligence Business white paper May 2015 Web applications are the most common business applications today.
Business Application Services Testing
Business Application Services Testing Curriculum Structure Course name Duration(days) Express 2 Testing Concept and methodologies 3 Introduction to Performance Testing 3 Web Testing 2 QTP 5 SQL 5 Load
NoSQL: Robust and Efficient Data Management on Deduplication Process by using a mobile application
NoSQL: Robust and Efficient Data Management on Deduplication Process by using a mobile application Hemn B.Abdalla, Jinzhao Lin, Guoquan Li Abstract Present Information technology has an enormous responsibility
Oracle Data Integrator 12c: Integration and Administration
Oracle University Contact Us: +33 15 7602 081 Oracle Data Integrator 12c: Integration and Administration Duration: 5 Days What you will learn Oracle Data Integrator is a comprehensive data integration
Client Overview. Engagement Situation. Key Requirements
Client Overview Our client is one of the leading providers of business intelligence systems for customers especially in BFSI space that needs intensive data analysis of huge amounts of data for their decision
The 4 Pillars of Technosoft s Big Data Practice
beyond possible Big Use End-user applications Big Analytics Visualisation tools Big Analytical tools Big management systems The 4 Pillars of Technosoft s Big Practice Overview Businesses have long managed
How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time
SCALEOUT SOFTWARE How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time by Dr. William Bain and Dr. Mikhail Sobolev, ScaleOut Software, Inc. 2012 ScaleOut Software, Inc. 12/27/2012 T wenty-first
Exploiting Tag Clouds for Database Browsing and Querying
Exploiting Tag Clouds for Database Browsing and Querying Stefania Leone, Matthias Geel, and Moira C. Norrie Institute for Information Systems, ETH Zurich CH-8092 Zurich, Switzerland {leone geel norrie}@inf.ethz.ch
What is Analytic Infrastructure and Why Should You Care?
What is Analytic Infrastructure and Why Should You Care? Robert L Grossman University of Illinois at Chicago and Open Data Group [email protected] ABSTRACT We define analytic infrastructure to be the services,
Large-Scale Data Sets Clustering Based on MapReduce and Hadoop
Journal of Computational Information Systems 7: 16 (2011) 5956-5963 Available at http://www.jofcis.com Large-Scale Data Sets Clustering Based on MapReduce and Hadoop Ping ZHOU, Jingsheng LEI, Wenjun YE
How To Write A Database Program
SQL, NoSQL, and Next Generation DBMSs Shahram Ghandeharizadeh Director of the USC Database Lab Outline A brief history of DBMSs. OSs SQL NoSQL 1960/70 1980+ 2000+ Before Computers Database DBMS/Data Store
What s New in JReport 12
What s New in JReport 12 Introduction JReport 12 helps take data visualization to the next level by providing new tools such as Visual Analysis that enhances self-service interactive data analysis powered
Automatic Annotation Wrapper Generation and Mining Web Database Search Result
Automatic Annotation Wrapper Generation and Mining Web Database Search Result V.Yogam 1, K.Umamaheswari 2 1 PG student, ME Software Engineering, Anna University (BIT campus), Trichy, Tamil nadu, India
BIG DATA Alignment of Supply & Demand Nuria de Lama Representative of Atos Research &
BIG DATA Alignment of Supply & Demand Nuria de Lama Representative of Atos Research & Innovation 04-08-2011 to the EC 8 th February, Luxembourg Your Atos business Research technologists. and Innovation
What s Cooking in KNIME
What s Cooking in KNIME Thomas Gabriel Copyright 2015 KNIME.com AG Agenda Querying NoSQL Databases Database Improvements & Big Data Copyright 2015 KNIME.com AG 2 Querying NoSQL Databases MongoDB & CouchDB
Data Mining in Web Search Engine Optimization and User Assisted Rank Results
Data Mining in Web Search Engine Optimization and User Assisted Rank Results Minky Jindal Institute of Technology and Management Gurgaon 122017, Haryana, India Nisha kharb Institute of Technology and Management
Sisense. Product Highlights. www.sisense.com
Sisense Product Highlights Introduction Sisense is a business intelligence solution that simplifies analytics for complex data by offering an end-to-end platform that lets users easily prepare and analyze
A Brief Analysis on Architecture and Reliability of Cloud Based Data Storage
Volume 2, No.4, July August 2013 International Journal of Information Systems and Computer Sciences ISSN 2319 7595 Tejaswini S L Jayanthy et al., Available International Online Journal at http://warse.org/pdfs/ijiscs03242013.pdf
Big Systems, Big Data
Big Systems, Big Data When considering Big Distributed Systems, it can be noted that a major concern is dealing with data, and in particular, Big Data Have general data issues (such as latency, availability,
MongoDB Developer and Administrator Certification Course Agenda
MongoDB Developer and Administrator Certification Course Agenda Lesson 1: NoSQL Database Introduction What is NoSQL? Why NoSQL? Difference Between RDBMS and NoSQL Databases Benefits of NoSQL Types of NoSQL
Scalable Multiple NameNodes Hadoop Cloud Storage System
Vol.8, No.1 (2015), pp.105-110 http://dx.doi.org/10.14257/ijdta.2015.8.1.12 Scalable Multiple NameNodes Hadoop Cloud Storage System Kun Bi 1 and Dezhi Han 1,2 1 College of Information Engineering, Shanghai
Data Mining for Data Cloud and Compute Cloud
Data Mining for Data Cloud and Compute Cloud Prof. Uzma Ali 1, Prof. Punam Khandar 2 Assistant Professor, Dept. Of Computer Application, SRCOEM, Nagpur, India 1 Assistant Professor, Dept. Of Computer Application,
ifinder ENTERPRISE SEARCH
DATA SHEET ifinder ENTERPRISE SEARCH ifinder - the Enterprise Search solution for company-wide information search, information logistics and text mining. CUSTOMER QUOTE IntraFind stands for high quality
Horizontal Aggregations In SQL To Generate Data Sets For Data Mining Analysis In An Optimized Manner
24 Horizontal Aggregations In SQL To Generate Data Sets For Data Mining Analysis In An Optimized Manner Rekha S. Nyaykhor M. Tech, Dept. Of CSE, Priyadarshini Bhagwati College of Engineering, Nagpur, India
NoSQL web apps. w/ MongoDB, Node.js, AngularJS. Dr. Gerd Jungbluth, NoSQL UG Cologne, 4.9.2013
NoSQL web apps w/ MongoDB, Node.js, AngularJS Dr. Gerd Jungbluth, NoSQL UG Cologne, 4.9.2013 About us Passionate (web) dev. since fallen in love with Sinclair ZX Spectrum Academic background in natural
Lofan Abrams Data Services for Big Data Session # 2987
Lofan Abrams Data Services for Big Data Session # 2987 Big Data Are you ready for blast-off? Big Data, for better or worse: 90% of world s data generated over last two years. ScienceDaily, ScienceDaily
Lightweight Data Integration using the WebComposition Data Grid Service
Lightweight Data Integration using the WebComposition Data Grid Service Ralph Sommermeier 1, Andreas Heil 2, Martin Gaedke 1 1 Chemnitz University of Technology, Faculty of Computer Science, Distributed
