Survey On: Nearest Neighbour Search With Keywords In Spatial Databases
|
|
|
- Robyn Watts
- 10 years ago
- Views:
Transcription
1 Survey On: Nearest Neighbour Search With Keywords In Spatial Databases SayaliBorse 1, Prof. P. M. Chawan 2, Prof. VishwanathChikaraddi 3, Prof. Manish Jansari 4 P.G. Student, Dept. of Computer Engineering& IT, V.J.T.I., Mumbai, Maharashtra, India 1 Associate Professor, Dept. of Computer Engineering& IT, V.J.T.I., Mumbai, Maharashtra, India 2 Assistant Professor, Dept. of Computer Engineering & IT, V.J.T.I., Mumbai, Maharashtra, India 3 Assistant Professor, Dept. of Computer Engineering & IT, V.J.T.I., Mumbai, Maharashtra, India 4 ABSTRACT:Users may search for different type of things from anywhere. But Search results depend on the user entered query which has to satisfy their searched properties that is stored in the spatial database. Due to rapid growth of users it become essential to optimize search results based on nearest neighbour property in spatial databases. Conventional spatial queries, such as range search and nearest neighbour retrieval, involve only geometric properties of objects which satisfies condition on geometric objects. Now a days many modern applications aim to find objects satisfying both a spatial condition and a condition on their associated texts which is known as Spatial keyword search.for example, the search engine can be used to find those nearest restaurant which offers Chinese Dishes like dumplings, pastry, stew and porridge or hospitals or chemists all at the same time. There are some straight forward approach which first deals with spatial condition and then on the non-spatial condition as a process of reduction. But these approaches are not good with complex queries. Solution to such queries is based on the IR2-tree, but IR2-tree also have some drawbacks which reduces the efficiency of IR2-tree badly. KEYWORDS:Spatial database, Data mining, Spatial queries, Spatial keyword search, Spatial inverted index I. INTRODUCTION Spatial databases are the databases used to hold geographical information. They contain data which are multidimensional in nature. A spatial database manages the multidimensional objects such as points and rectangles, etc. Such databases can be used for obtaining information based on queries and provides fast access to which objects based on the different the selection criteria. Now a days, the general use of the search engines had made it possible to write the spatial queries in the new way. Conventionally, queries much focus on the object's geometric properties only, for example, whether the point is in the rectangle, or how close two points are from the each other in the rectangle likewise query can be made on hotels, locations, schools, hospitals, lakes, parks and so on with various combinations of criteria. In the world there is need oflocation based services to search nearest neighbour. The NN search needs to process huge amount of geography information. As the number of geographic properties are more and multi-dimensional in nature, developing such systems is considered difficult task. Search speed is an important factor in such systems. With respect to a web application for making searches on geographical data, speed is an essential element. The application should produce desired results in short span of time. This is a challenging problem needs to be addressed. Nearest neighbour search is to find the nearest object contains the set of keywords. For example if the users are interested to find the nearest restaurants that offers Chinese Dishes like dumplings, pastry, stew and porridge all the same time or chemists offering particular brand medicines. There are two approaches to implement such queries that combine spatial and text features. In the first approach, we could first fetch all the restaurants whose menu contains the set of keywords {Chinese Food, dumplings, pastry, stew and porridge} and then from retrieved restaurants, find the Copyright to IJIRSET DOI: /IJIRSET
2 nearest one. Similarly in the second approach same query result can also find out reversely by targeting first spatial condition and then textual condition. IR2-tree is used in the existing system for providing best solution for finding nearest neighbour but this method has few deficiencies. II. RELATED WORK Nearest neighbour search (NNS), also known as closest point search, similarity search. It is an optimization problem for finding closest (or most similar) points. We can search closest point by giving keywords as input; it can be spatial or textual. Cao et al. [1] presented paper on the new problem for retrieving a spatial object s groups that are nearest to the query point location and each associated with a set of keywords. They collective known as spatial keyword query. G. Cong, C.S. Jensen, and D. Wu [2] proposed an approach that computes the relevancy of a query result by means of language models and a probabilistic ranking function. This relevance is then incorporated with the Euclidean distance between object and query to calculate an overall similarity of object to query. Zhang and Chee [3] introduced hybrid indexing structure br*-tree, that combines the R*-tree and bitmap indexing to process the m-closest keyword query that returns the spatially closest objects matching m keywords. In [4] Ian De Felipe et al. focused on finding objects closest to a specified location contains set of keywords. A method to answer top k spatial queries is effectively presented the method has tight of data structure and algorithms used in spatial database search and Information Retrieval. In [5] Chen li and BijiHore focused on location based information retrieval. A framework is proposed for Geographical Information Retrieval (GIR) system and focus on indexing strategies can process Spatial Keywords (SK) queries effectively. Two type of indexing mechanism is used. First method is separate index for spatial and text attribute. Second method is hybrid indices techniques combine the spatial and inverted file indices. 1. Quad Tree: III. SPATIAL DATA STRUCTURES The quad tree is used for indexing 2-Dimensional space. Each internal node of the quad tree splits the 2D space into four disjoint subspaces according to the axes [6]. Further each subspaces is split until there is one object or no object inside each of subspaces recursively. The quad tree is not balanced tree. Its balance depends on the data distribution among subspaces and the order of inserting the data points. Fig.1 Quad Tree Copyright to IJIRSET DOI: /IJIRSET
3 2. k-d Tree: In k-d tree method a binary tree is used to split k dimensional space [7]. According to one of the coordinates of the splitting point k-d tree splits the space into two subspaces as shown in below figure. Let level(n) be the length of the path from the root to the node n and suppose the axes are numbered from 0 to k 1. At the level level(n) in every node the space is split according to the coordinate number (level(n) mod k). Insertion and searching of the data are similar to the binary trees. According to the coordinate number (level(n) mod k) we only have to compare nodes. Disadvantage: It is very sensitive to the order of inserting the objects. 3. R-Tree: Fig.2 k-d Tree The R tree [7] is the modified B-tree for spatial data. R-tree is balanced tree. In R-tree spaces are splits into the rectangles which can overlap unlike quad tree. Except root node each node contains n to N children, where 2 <= n <= N/2. The root node contains 2 or more children unless it is a leaf. Below figure shows an example of order 3 R-tree. Each node is represented by the minimum bounding rectangle contains all the objects of its subtree. Each node is split recursively. In the leaf node pointers are stored which points to the data objects. Because of the overlapping of bounding rectangles it could be necessary to search more than one branch of the tree. Therefore, it is important to separate the rectangles as much as possible. This problem must be solved by the operation INSERT which uses some kind of heuristic. It finds such a leaf that inserting a new object into it will cause as small changes in the tree as possible. The splitting operation is also important. We want to decrease the probability that we will have to search both new nodes. Testing all the possibilities has exponential complexity, so the algorithms for approximate solution are used. The R-tree is one of the most cited spatial data structures and it is very often used for comparison with new structures. Fig.3 R-Tree Copyright to IJIRSET DOI: /IJIRSET
4 4. R*-Tree: R*-tree [8] is a modified R tree. It uses different kinds of heuristic for inserting data into the tree like R-tree. R* tree combines the criteria of minimizing the area of all nodes of the tree like in the R-tree with the area under the bounding rectangle, the margin of a rectangle and the overlap between rectangles. The objective of decreasing the area under the bounding rectangle is to decrease the dead space, which means the space covered by the bounding rectangle but not by the enclosed rectangles. This decreases the number of branches that are searched unnecessarily. Minimization of the margin (the sum of the lengths of the edges) of a bounding rectangle prefers the squares. Minimization of the overlap between rectangles decreases the number of paths that must be searched. The implementation of this method is harder, but R* trees are more effective than R-trees. 5. R+ -Tree: R+ tree is an extension of the R tree. In the R+ tree bounding rectangles of the nodes at one level can overlap. This decreases the number of searching branches of the tree and reduces the time for searching. R+ tree allows splitting of data objects due to which different parts of one object can be stored in more than one node on the same level of the tree. If in case a rectangles overlaps we decompose them into a group of non-overlapping rectangles which cover the same data objects. This increases a space consumption. But due to zero overlap of the nodes it reduces the time consumption. Fig.4 R+ - Tree 1. IR-Tree: IV. NEAREST NEIGHBOUR SEARCH TECHNIQUE IR-tree is used to retrieve a group of spatial web objects. Those group of objects contains query s keywords and objects are near to the query location and have the lowest inter object distances. This method addresses the two illustration of the group keyword query. In the first illustration we have to find the group of objects that cover the query keywords such that the sum of their distances to the query point is minimum. In the second illustration we find a group of objects that cover the query keywords such that sum of the maximum distance among an object in group of objects and query and maximum distance among two objects in group of objects is minimum. Both of these sub problems are NPcomplete. An approximate solution to the above problem is provided by Greedy algorithm that utilizes IR-tree (spatial keyword index) to reduce the searching space. But in some application query contain a small number of keywords, for this exact algorithm is used that uses the dynamic programming. Copyright to IJIRSET DOI: /IJIRSET
5 2. IUR-Tree (Intersection union R-tree): IR-tree is used to retrieve a group of spatial web objects. Those group of objects contains query s keywords and objects are near to the query location and have the lowest inter object distances. This method addresses the two illustration of the group keyword query. In the first illustration we have to find the group of objects that cover the query keywords such that the sum of their distances to the query point is minimum. In the second illustration we find a group of objects that cover the query keywords such that sum of the maximum distance among an object in group of objects and query and maximum distance among two objects in group of objects is minimum. Both of these sub problems are NPcomplete. An approximate solution to the above problem is provided by Greedy algorithm that utilizes IR-tree (spatial keyword index) to reduce the searching space. But in some application query contain a small number of keywords, for this exact algorithm is used that uses the dynamic programming. 3. BR*-Tree: This hybrid index structure is used to search m-closest keywords [9]. This technique finds the closest tuples that matches the keywords provided by the user. For processing the m-closest keyword query BR*-tree structure combines the features of R*-tree and bitmap indexing and returns the spatially closest objects matching m keywords as a results. A priori based search strategy is used to reduce the search space. To facilitate efficient pruning, two monotone constraints are used as a priori properties. But this method has some disadvantages while handling ranking queries and in this number of false hits is large. 4. IR2-Tree: IR2-tree is a combination of R-tree and signature files [10]. R-trees and the best-first algorithm for NN search are wellknown techniques in spatial databases. In general, Signature file refers to a hashing-based framework which is known as superimposed coding (SC), which is more effective than other instantiations. It is designed to perform membership tests: determine whether a query word t exists in a set T of words. SC is conservative, if it says no, then w is definitely not in W. On the other hand, if SC returns yes, the true answer can be either way, in which case the whole W must be scanned to avoid a false hit. V. CONCLUSION This paper presents the survey of various spatial data structures and techniques for nearest neighbor search for spatial database. There were many drawbacks in the previous methods. The current existing solutions for finding nearest neighbor keyword are unable to give real time answer and have drawback of more space consumption. REFERENCES [1] X. Cao, G. Cong, C.S. Jensen, and B.C. Ooi, Collective Spatial Keyword Querying, in Proc. ACM SIGMOD Int l Conf. Management of Data, pp , [2] G. Cong, C.S. Jensen, and D. Wu, Efficient Retrieval of the Top-k Most Relevant Spatial Web Objects, PVLDB, vol. 2, no. 1, pp , [3] D. Zhang, Y.M. Chee, A. Mondal, A.K.H. Tung, and M. Kitsuregawa, Keyword Search in Spatial Databases: Towards Searching by Document,in Proc. Int l Conf. Data Eng. (ICDE), pp , [4] D. Felipe, V. Hristidis, and N. Rishe., Keyword search on spatial databases, in Proc. of International Conference on Data Engineering (ICDE), pp , [5] R. Hariharan, B. Hore, C. Li, and S. Mehrotra, Processing spatial-keyword (SK) queries in geographic information retrieval (GIR) systems, in Proc. of Scientific and Statistical Database Management (SSDBM), [6] Ester M., Kriegel H. P., Sander J., Spatial Data Mining: A Database Approach,in Proc. of the Fifth Int l Symposium on Large Spatial Databases (SSD 97), Berlin, Germany, Springer, Copyright to IJIRSET DOI: /IJIRSET
6 [7] Gaede V., G unther O., Multidimensional Access Methods,ACM Computing Surveys, Vol. 30, No. 2, pp , June [8] Beckmann N., Kriegel H. P., Schneider R. and Seeger B., The R*-tree: an efficient and robust access method for points and rectangles,in Proc.of the ACM SIGMOD International conference on Managementof data, Atlantic City, NJ USA, pp , [9] Yufei Tao and Cheng Sheng, Fast Nearest Neighbor Search with Keywords, IEEE transactions on knowledge and data engineering, VOL. 26, NO. 4, APRIL [10] Cao,Cong, G., and Jensen, C. S., Retrieving top-k prestige based relevant spatial web objects, PVLDB, 3(1), pp , Copyright to IJIRSET DOI: /IJIRSET
International Journal of Advance Research in Computer Science and Management Studies
Volume 3, Issue 11, November 2015 ISSN: 2321 7782 (Online) International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online
R-trees. R-Trees: A Dynamic Index Structure For Spatial Searching. R-Tree. Invariants
R-Trees: A Dynamic Index Structure For Spatial Searching A. Guttman R-trees Generalization of B+-trees to higher dimensions Disk-based index structure Occupancy guarantee Multiple search paths Insertions
Efficient and Scalable Method for Processing Top-k Spatial Boolean Queries
Efficient and Scalable Method for Processing Top-k Spatial Boolean Queries Ariel Cary 1, Ouri Wolfson 2, and Naphtali Rishe 1 1 School of Computing and Information Sciences, Florida International University,
Data Warehousing und Data Mining
Data Warehousing und Data Mining Multidimensionale Indexstrukturen Ulf Leser Wissensmanagement in der Bioinformatik Content of this Lecture Multidimensional Indexing Grid-Files Kd-trees Ulf Leser: Data
An Analysis on Density Based Clustering of Multi Dimensional Spatial Data
An Analysis on Density Based Clustering of Multi Dimensional Spatial Data K. Mumtaz 1 Assistant Professor, Department of MCA Vivekanandha Institute of Information and Management Studies, Tiruchengode,
Ag + -tree: an Index Structure for Range-aggregation Queries in Data Warehouse Environments
Ag + -tree: an Index Structure for Range-aggregation Queries in Data Warehouse Environments Yaokai Feng a, Akifumi Makinouchi b a Faculty of Information Science and Electrical Engineering, Kyushu University,
Processing Spatial-Keyword (SK) Queries in Geographic Information Retrieval (GIR) Systems
Processing Spatial-Keyword (SK) Queries in Geographic Information Retrieval (GIR) Systems Ramaswamy Hariharan, Bijit Hore, Chen Li, Sharad Mehrotra Donald Bren School of Information and Computer Sciences
REAL ESTATE APPLICATION USING SPATIAL DATABASE
REAL ESTATE APPLICATION USING SPATIAL DATABASE 1 M. Kiruthika, 2 Smita Dange, 3 Swati Kinhekar, 4 Girish B, 5 Trupti G, 6 Sushant R. 1 Assoc. Prof., Deptt. of Comp. Engg., Fr. CRIT, Vashi, Navi Mumbai,
Indexing and Retrieval of Historical Aggregate Information about Moving Objects
Indexing and Retrieval of Historical Aggregate Information about Moving Objects Dimitris Papadias, Yufei Tao, Jun Zhang, Nikos Mamoulis, Qiongmao Shen, and Jimeng Sun Department of Computer Science Hong
Efficient Iceberg Query Evaluation for Structured Data using Bitmap Indices
Proc. of Int. Conf. on Advances in Computer Science, AETACS Efficient Iceberg Query Evaluation for Structured Data using Bitmap Indices Ms.Archana G.Narawade a, Mrs.Vaishali Kolhe b a PG student, D.Y.Patil
KEYWORD SEARCH IN RELATIONAL DATABASES
KEYWORD SEARCH IN RELATIONAL DATABASES N.Divya Bharathi 1 1 PG Scholar, Department of Computer Science and Engineering, ABSTRACT Adhiyamaan College of Engineering, Hosur, (India). Data mining refers to
Vector storage and access; algorithms in GIS. This is lecture 6
Vector storage and access; algorithms in GIS This is lecture 6 Vector data storage and access Vectors are built from points, line and areas. (x,y) Surface: (x,y,z) Vector data access Access to vector
A UPS Framework for Providing Privacy Protection in Personalized Web Search
A UPS Framework for Providing Privacy Protection in Personalized Web Search V. Sai kumar 1, P.N.V.S. Pavan Kumar 2 PG Scholar, Dept. of CSE, G Pulla Reddy Engineering College, Kurnool, Andhra Pradesh,
Oracle8i Spatial: Experiences with Extensible Databases
Oracle8i Spatial: Experiences with Extensible Databases Siva Ravada and Jayant Sharma Spatial Products Division Oracle Corporation One Oracle Drive Nashua NH-03062 {sravada,jsharma}@us.oracle.com 1 Introduction
Classification algorithm in Data mining: An Overview
Classification algorithm in Data mining: An Overview S.Neelamegam #1, Dr.E.Ramaraj *2 #1 M.phil Scholar, Department of Computer Science and Engineering, Alagappa University, Karaikudi. *2 Professor, Department
International journal of Engineering Research-Online A Peer Reviewed International Journal Articles available online http://www.ijoer.
RESEARCH ARTICLE ISSN: 2321-7758 GLOBAL LOAD DISTRIBUTION USING SKIP GRAPH, BATON AND CHORD J.K.JEEVITHA, B.KARTHIKA* Information Technology,PSNA College of Engineering & Technology, Dindigul, India Article
A Note on Maximum Independent Sets in Rectangle Intersection Graphs
A Note on Maximum Independent Sets in Rectangle Intersection Graphs Timothy M. Chan School of Computer Science University of Waterloo Waterloo, Ontario N2L 3G1, Canada [email protected] September 12,
Cluster Analysis for Optimal Indexing
Proceedings of the Twenty-Sixth International Florida Artificial Intelligence Research Society Conference Cluster Analysis for Optimal Indexing Tim Wylie, Michael A. Schuh, John Sheppard, and Rafal A.
EOST-An Access Method for Obfuscating Spatio-Temporal Data in LBS
IJCST Vo l. 3, Is s u e 2, Ap r i l - Ju n e 2012 EOST-An Access Method for Obfuscating Spatio-Temporal Data in LBS 1 Gavali A. B, 2 Limkar Suresh, 3 D. S. Patil 1,2 GHRCEM, Wagholi, Pune, Maharashtra,
Efficient File Sharing Scheme in Mobile Adhoc Network
Efficient File Sharing Scheme in Mobile Adhoc Network 1 Y. Santhi, 2 Mrs. M. Maria Sheeba 1 2ndMECSE, Ponjesly College of engineering, Nagercoil 2 Assistant professor, Department of CSE, Nagercoil Abstract:
Lecture 10: Regression Trees
Lecture 10: Regression Trees 36-350: Data Mining October 11, 2006 Reading: Textbook, sections 5.2 and 10.5. The next three lectures are going to be about a particular kind of nonlinear predictive model,
AN EFFICIENT STRATEGY OF AGGREGATE SECURE DATA TRANSMISSION
INTERNATIONAL JOURNAL OF REVIEWS ON RECENT ELECTRONICS AND COMPUTER SCIENCE AN EFFICIENT STRATEGY OF AGGREGATE SECURE DATA TRANSMISSION K.Anusha 1, K.Sudha 2 1 M.Tech Student, Dept of CSE, Aurora's Technological
QuickDB Yet YetAnother Database Management System?
QuickDB Yet YetAnother Database Management System? Radim Bača, Peter Chovanec, Michal Krátký, and Petr Lukáš Radim Bača, Peter Chovanec, Michal Krátký, and Petr Lukáš Department of Computer Science, FEECS,
Bitmap Index as Effective Indexing for Low Cardinality Column in Data Warehouse
Bitmap Index as Effective Indexing for Low Cardinality Column in Data Warehouse Zainab Qays Abdulhadi* * Ministry of Higher Education & Scientific Research Baghdad, Iraq Zhang Zuping Hamed Ibrahim Housien**
CLOUD BASED PEER TO PEER NETWORK FOR ENTERPRISE DATAWAREHOUSE SHARING
CLOUD BASED PEER TO PEER NETWORK FOR ENTERPRISE DATAWAREHOUSE SHARING Basangouda V.K 1,Aruna M.G 2 1 PG Student, Dept of CSE, M.S Engineering College, Bangalore,[email protected] 2 Associate Professor.,
A Survey on Accessing Data over Cloud Environment using Data mining Algorithms
A Survey on Accessing Data over Cloud Environment using Data mining Algorithms A.Selvaraj 1, B.Prasanalakshmi 2 Assistant Professor 1, Associate Professor 2, P.G. Department of Computer Applications 1,
Indexing the Trajectories of Moving Objects in Networks
Indexing the Trajectories of Moving Objects in Networks Victor Teixeira de Almeida Ralf Hartmut Güting Praktische Informatik IV Fernuniversität Hagen, D-5884 Hagen, Germany {victor.almeida, rhg}@fernuni-hagen.de
Complex Network Visualization based on Voronoi Diagram and Smoothed-particle Hydrodynamics
Complex Network Visualization based on Voronoi Diagram and Smoothed-particle Hydrodynamics Zhao Wenbin 1, Zhao Zhengxu 2 1 School of Instrument Science and Engineering, Southeast University, Nanjing, Jiangsu
Clustering on Large Numeric Data Sets Using Hierarchical Approach Birch
Global Journal of Computer Science and Technology Software & Data Engineering Volume 12 Issue 12 Version 1.0 Year 2012 Type: Double Blind Peer Reviewed International Research Journal Publisher: Global
Portable Bushy Processing Trees for Join Queries
Reihe Informatik 11 / 1996 Constructing Optimal Bushy Processing Trees for Join Queries is NP-hard Wolfgang Scheufele Guido Moerkotte 1 Constructing Optimal Bushy Processing Trees for Join Queries is NP-hard
Computational Geometry. Lecture 1: Introduction and Convex Hulls
Lecture 1: Introduction and convex hulls 1 Geometry: points, lines,... Plane (two-dimensional), R 2 Space (three-dimensional), R 3 Space (higher-dimensional), R d A point in the plane, 3-dimensional space,
IMPROVING BUSINESS PROCESS MODELING USING RECOMMENDATION METHOD
Journal homepage: www.mjret.in ISSN:2348-6953 IMPROVING BUSINESS PROCESS MODELING USING RECOMMENDATION METHOD Deepak Ramchandara Lad 1, Soumitra S. Das 2 Computer Dept. 12 Dr. D. Y. Patil School of Engineering,(Affiliated
Multi-dimensional index structures Part I: motivation
Multi-dimensional index structures Part I: motivation 144 Motivation: Data Warehouse A definition A data warehouse is a repository of integrated enterprise data. A data warehouse is used specifically for
Spatial Data Warehouse and Mining. Rajiv Gandhi
Spatial Data Warehouse and Mining Rajiv Gandhi Roll Number 05331002 Centre of Studies in Resource Engineering Indian Institute of Technology Bombay Powai, Mumbai -400076 India. As part of the first stage
SEARCH ENGINE OPTIMIZATION USING D-DICTIONARY
SEARCH ENGINE OPTIMIZATION USING D-DICTIONARY G.Evangelin Jenifer #1, Mrs.J.Jaya Sherin *2 # PG Scholar, Department of Electronics and Communication Engineering(Communication and Networking), CSI Institute
International Journal of Advance Research in Computer Science and Management Studies
Volume 2, Issue 12, December 2014 ISSN: 2321 7782 (Online) International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online
Email Spam Detection Using Customized SimHash Function
International Journal of Research Studies in Computer Science and Engineering (IJRSCSE) Volume 1, Issue 8, December 2014, PP 35-40 ISSN 2349-4840 (Print) & ISSN 2349-4859 (Online) www.arcjournals.org Email
Natural Language to Relational Query by Using Parsing Compiler
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,
A Survey on Product Aspect Ranking
A Survey on Product Aspect Ranking Charushila Patil 1, Prof. P. M. Chawan 2, Priyamvada Chauhan 3, Sonali Wankhede 4 M. Tech Student, Department of Computer Engineering and IT, VJTI College, Mumbai, Maharashtra,
Mining Mobile Group Patterns: A Trajectory-Based Approach
Mining Mobile Group Patterns: A Trajectory-Based Approach San-Yih Hwang, Ying-Han Liu, Jeng-Kuen Chiu, and Ee-Peng Lim Department of Information Management National Sun Yat-Sen University, Kaohsiung, Taiwan
Principles of Data Mining by Hand&Mannila&Smyth
Principles of Data Mining by Hand&Mannila&Smyth Slides for Textbook Ari Visa,, Institute of Signal Processing Tampere University of Technology October 4, 2010 Data Mining: Concepts and Techniques 1 Differences
Part-Based Recognition
Part-Based Recognition Benedict Brown CS597D, Fall 2003 Princeton University CS 597D, Part-Based Recognition p. 1/32 Introduction Many objects are made up of parts It s presumably easier to identify simple
Binary Coded Web Access Pattern Tree in Education Domain
Binary Coded Web Access Pattern Tree in Education Domain C. Gomathi P.G. Department of Computer Science Kongu Arts and Science College Erode-638-107, Tamil Nadu, India E-mail: [email protected] M. Moorthi
Secure and Faster NN Queries on Outsourced Metric Data Assets Renuka Bandi #1, Madhu Babu Ch *2
Secure and Faster NN Queries on Outsourced Metric Data Assets Renuka Bandi #1, Madhu Babu Ch *2 #1 M.Tech, CSE, BVRIT, Hyderabad, Andhra Pradesh, India # Professor, Department of CSE, BVRIT, Hyderabad,
SEARCH ENGINE WITH PARALLEL PROCESSING AND INCREMENTAL K-MEANS FOR FAST SEARCH AND RETRIEVAL
SEARCH ENGINE WITH PARALLEL PROCESSING AND INCREMENTAL K-MEANS FOR FAST SEARCH AND RETRIEVAL Krishna Kiran Kattamuri 1 and Rupa Chiramdasu 2 Department of Computer Science Engineering, VVIT, Guntur, India
Analysis of Algorithms I: Binary Search Trees
Analysis of Algorithms I: Binary Search Trees Xi Chen Columbia University Hash table: A data structure that maintains a subset of keys from a universe set U = {0, 1,..., p 1} and supports all three dictionary
Evaluation of Bitmap Index Compression using Data Pump in Oracle Database
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661, p- ISSN: 2278-8727Volume 16, Issue 3, Ver. III (May-Jun. 2014), PP 43-48 Evaluation of Bitmap Index Compression using Data Pump in Oracle
Big Data and Scripting. Part 4: Memory Hierarchies
1, Big Data and Scripting Part 4: Memory Hierarchies 2, Model and Definitions memory size: M machine words total storage (on disk) of N elements (N is very large) disk size unlimited (for our considerations)
Automatic Annotation Wrapper Generation and Mining Web Database Search Result
Automatic Annotation Wrapper Generation and Mining Web Database Search Result V.Yogam 1, K.Umamaheswari 2 1 PG student, ME Software Engineering, Anna University (BIT campus), Trichy, Tamil nadu, India
Sustaining Privacy Protection in Personalized Web Search with Temporal Behavior
Sustaining Privacy Protection in Personalized Web Search with Temporal Behavior N.Jagatheshwaran 1 R.Menaka 2 1 Final B.Tech (IT), [email protected], Velalar College of Engineering and Technology,
Load Balancing in Structured Peer to Peer Systems
Load Balancing in Structured Peer to Peer Systems DR.K.P.KALIYAMURTHIE 1, D.PARAMESWARI 2 Professor and Head, Dept. of IT, Bharath University, Chennai-600 073 1 Asst. Prof. (SG), Dept. of Computer Applications,
MUTI-KEYWORD SEARCH WITH PRESERVING PRIVACY OVER ENCRYPTED DATA IN THE CLOUD
MUTI-KEYWORD SEARCH WITH PRESERVING PRIVACY OVER ENCRYPTED DATA IN THE CLOUD A.Shanthi 1, M. Purushotham Reddy 2, G.Rama Subba Reddy 3 1 M.tech Scholar (CSE), 2 Asst.professor, Dept. of CSE, Vignana Bharathi
Challenges in Finding an Appropriate Multi-Dimensional Index Structure with Respect to Specific Use Cases
Challenges in Finding an Appropriate Multi-Dimensional Index Structure with Respect to Specific Use Cases Alexander Grebhahn [email protected] Reimar Schröter [email protected] David Broneske [email protected]
Data Mining and Database Systems: Where is the Intersection?
Data Mining and Database Systems: Where is the Intersection? Surajit Chaudhuri Microsoft Research Email: [email protected] 1 Introduction The promise of decision support systems is to exploit enterprise
Scalable Cluster Analysis of Spatial Events
International Workshop on Visual Analytics (2012) K. Matkovic and G. Santucci (Editors) Scalable Cluster Analysis of Spatial Events I. Peca 1, G. Fuchs 1, K. Vrotsou 1,2, N. Andrienko 1 & G. Andrienko
International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015
RESEARCH ARTICLE OPEN ACCESS Data Mining Technology for Efficient Network Security Management Ankit Naik [1], S.W. Ahmad [2] Student [1], Assistant Professor [2] Department of Computer Science and Engineering
A REVIEW ON EFFICIENT DATA ANALYSIS FRAMEWORK FOR INCREASING THROUGHPUT IN BIG DATA. Technology, Coimbatore. Engineering and Technology, Coimbatore.
A REVIEW ON EFFICIENT DATA ANALYSIS FRAMEWORK FOR INCREASING THROUGHPUT IN BIG DATA 1 V.N.Anushya and 2 Dr.G.Ravi Kumar 1 Pg scholar, Department of Computer Science and Engineering, Coimbatore Institute
B+ Tree Properties B+ Tree Searching B+ Tree Insertion B+ Tree Deletion Static Hashing Extendable Hashing Questions in pass papers
B+ Tree and Hashing B+ Tree Properties B+ Tree Searching B+ Tree Insertion B+ Tree Deletion Static Hashing Extendable Hashing Questions in pass papers B+ Tree Properties Balanced Tree Same height for paths
Index Terms Domain name, Firewall, Packet, Phishing, URL.
BDD for Implementation of Packet Filter Firewall and Detecting Phishing Websites Naresh Shende Vidyalankar Institute of Technology Prof. S. K. Shinde Lokmanya Tilak College of Engineering Abstract Packet
A Novel Approach for Load Balancing In Heterogeneous Cellular Network
A Novel Approach for Load Balancing In Heterogeneous Cellular Network Bittu Ann Mathew1, Sumy Joseph2 PG Scholar, Dept of Computer Science, Amal Jyothi College of Engineering, Kanjirappally, Kerala, India1
Artificial Neural Network, Decision Tree and Statistical Techniques Applied for Designing and Developing E-mail Classifier
International Journal of Recent Technology and Engineering (IJRTE) ISSN: 2277-3878, Volume-1, Issue-6, January 2013 Artificial Neural Network, Decision Tree and Statistical Techniques Applied for Designing
Echidna: Efficient Clustering of Hierarchical Data for Network Traffic Analysis
Echidna: Efficient Clustering of Hierarchical Data for Network Traffic Analysis Abdun Mahmood, Christopher Leckie, Parampalli Udaya Department of Computer Science and Software Engineering University of
Bitmap Index an Efficient Approach to Improve Performance of Data Warehouse Queries
Bitmap Index an Efficient Approach to Improve Performance of Data Warehouse Queries Kale Sarika Prakash 1, P. M. Joe Prathap 2 1 Research Scholar, Department of Computer Science and Engineering, St. Peters
AN EFFECTIVE PROPOSAL FOR SHARING OF DATA SERVICES FOR NETWORK APPLICATIONS
INTERNATIONAL JOURNAL OF REVIEWS ON RECENT ELECTRONICS AND COMPUTER SCIENCE AN EFFECTIVE PROPOSAL FOR SHARING OF DATA SERVICES FOR NETWORK APPLICATIONS Koyyala Vijaya Kumar 1, L.Sunitha 2, D.Koteswar Rao
Social Media Mining. Data Mining Essentials
Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers
CUBE INDEXING IMPLEMENTATION USING INTEGRATION OF SIDERA AND BERKELEY DB
CUBE INDEXING IMPLEMENTATION USING INTEGRATION OF SIDERA AND BERKELEY DB Badal K. Kothari 1, Prof. Ashok R. Patel 2 1 Research Scholar, Mewar University, Chittorgadh, Rajasthan, India 2 Department of Computer
Technologies & Applications
Chapter 10 Emerging Database Technologies & Applications Truong Quynh Chi [email protected] Spring - 2013 Contents 1 Distributed Databases & Client-Server Architectures 2 Spatial and Temporal Database
GraphZip: A Fast and Automatic Compression Method for Spatial Data Clustering
GraphZip: A Fast and Automatic Compression Method for Spatial Data Clustering Yu Qian Kang Zhang Department of Computer Science, The University of Texas at Dallas, Richardson, TX 75083-0688, USA {yxq012100,
ADVANCED DATA STRUCTURES FOR SURFACE STORAGE
1Department of Mathematics, Univerzitni Karel, Faculty 22, JANECKA1, 323 of 00, Applied Pilsen, Michal, Sciences, Czech KARA2 Republic University of West Bohemia, ADVANCED DATA STRUCTURES FOR SURFACE STORAGE
A Distribution-Based Clustering Algorithm for Mining in Large Spatial Databases
Published in the Proceedings of 14th International Conference on Data Engineering (ICDE 98) A Distribution-Based Clustering Algorithm for Mining in Large Spatial Databases Xiaowei Xu, Martin Ester, Hans-Peter
Fuzzy Spatial Data Warehouse: A Multidimensional Model
4 Fuzzy Spatial Data Warehouse: A Multidimensional Model Pérez David, Somodevilla María J. and Pineda Ivo H. Facultad de Ciencias de la Computación, BUAP, Mexico 1. Introduction A data warehouse is defined
AI: A Modern Approach, Chpts. 3-4 Russell and Norvig
AI: A Modern Approach, Chpts. 3-4 Russell and Norvig Sequential Decision Making in Robotics CS 599 Geoffrey Hollinger and Gaurav Sukhatme (Some slide content from Stuart Russell and HweeTou Ng) Spring,
Checking for Dimensional Correctness in Physics Equations
Checking for Dimensional Correctness in Physics Equations C.W. Liew Department of Computer Science Lafayette College [email protected] D.E. Smith Department of Computer Science Rutgers University [email protected]
DATA MINING USING INTEGRATION OF CLUSTERING AND DECISION TREE
DATA MINING USING INTEGRATION OF CLUSTERING AND DECISION TREE 1 K.Murugan, 2 P.Varalakshmi, 3 R.Nandha Kumar, 4 S.Boobalan 1 Teaching Fellow, Department of Computer Technology, Anna University 2 Assistant
Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification
Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification Tina R. Patil, Mrs. S. S. Sherekar Sant Gadgebaba Amravati University, Amravati [email protected], [email protected]
CHAPTER-24 Mining Spatial Databases
CHAPTER-24 Mining Spatial Databases 24.1 Introduction 24.2 Spatial Data Cube Construction and Spatial OLAP 24.3 Spatial Association Analysis 24.4 Spatial Clustering Methods 24.5 Spatial Classification
