Similarity Search for Numerous Patterns in Multiple High-Speed Time-Series Streams
|
|
- Marcia Sabina Kelly
- 7 years ago
- Views:
Transcription
1 Similarity Search for Numerous Patterns in Multiple High-Speed Time-Series Streams Bui Cong Giao, Duong Tuan Anh Presenter: Bui Cong Giao
2 Contents 1. Introduction 2. Preliminaries 3. The Proposed Method 4. Experimental Evaluation 5. Conclusions
3 Introduction Pattern discovery by similarity search in streaming context where new values are continuously appended as time progresses Retrievvals of newcoming time-series subsequences of streaming time series, which are approximately matched with static time-series patterns under the Euclidean distance (ED) Important scenario in which incoming time-series data are from many concurrent time-series streams at high-speed rates, and there are numerous patterns 3
4 Main contributions A novel multi-scale representation of time-series data for similarity search in streaming context Range search over streaming time-series for numerous patterns in which every pattern has its own search radius 4
5 Preliminaries Two ways to search patterns in time-series sequences under ED Whole matching : the sequences to be compared have the same length, e.g UCR-ED (UCR- Euclidean Distance) Subsequence matching : the sequences is partitioned into many segments. The search procedure begins from the first segment to the last one, e.g SS-NOS (Similar Search using Non- Overlapped Segmentation) 5
6 UCR-ED Introduced by Rakthanmanon et al. in 2012 Conduct similarity search for patterns in static timeseries sequences Read the time-series sequence into many big sections. After that, UCR-ED uses z-normalization in an incremental fashion while the window slides over a big section of the time-series sequence to find matching pairs Change UCR-ED so that the method accommodates with multi-threading, referred as TUCR-ED 6
7 SS-NOS Similar Search using Non-Overlapped Segmentation Introduced by us in 2014 Similar search for patterns over streaming time-series using non-overlapped segmentation Fig. 1 The non-overlapped segmentation of a time-series pattern 7
8 SS-NOS (cont.) Phase 1 Phase 2 Retrieve the coefficient vectors of the z-normalized non-overlapped segments of patterns by DFT, or Haar DWT, or PAA Store the coefficient vectors in an array of R-trees as a multi-resolution index structure Equipped with multi-threading, SS-NOS carries out similarity search in streaming time series using the array of R-trees 8
9 Restrictions of SS-NOS If the length of the remainder is long, then the filtering process is likely inefficient for such a time- series pattern since the filtering process can miss out the unpromising patterns. SS-NOS performs range search with one search radius for all time-series patterns, so this is inflexible and rather impractical. 9
10 The Proposed Method Similar search for patterns over streaming time-series using overlapped segmentation, Similar Search using Overlapped Segmentation (SS-OS) Fig. 2 The overlapped segmentation of a time-series pattern 10
11 The Proposed Method (cont.) SS-OS is basically similar to SS-NOS in similarity search Fig. 3 SS-OS conducts similar search for patterns in a time-series stream. 11
12 The Proposed Method (cont.) Algorithm RangeSearch( S) When there is a new-coming data of S, T n // Phase 2 1. postcheckset // the set of patterns for post-checking 2. pset P // the set of potential patterns 3. for i = 1 to maxlevel 4. Incrementally normalize s i 5. for i = 1 to maxlevel 6. Retrieve v i 7. pset SearchInRtree( R-tree[i], pset, v i ) 8. if pset = then 9. break // go to phase foreach (p in pset) 11. if i is the maximum filter level of p then 12. postcheckset postcheckset p 13. Remove p from pset 14. foreach (p in postcheckset) // Phase Normalize c 16. Compute the ED distance between np and z-normalized c to check whether the distance is within p.r The core subroutine searches patterns whose i th coefficient vector is similar to v i within their own search radius. The range search takes place in the R-tree of the i th filter level. 12
13 Experimental Evaluation Platform Intel Dual Core i3 M GHz, 4GB RAM PC C# Parameters The circular buffers of the time-series streams have the size of 1,024. The minimum node occupancy of R-trees is 4 and the maximum node occupancy is
14 Three query sets were created from the time-series dataset. The number of queries in each query set is The length of the query sequences varies from 8 to
15 Experimental Evaluation Implement range search by UCR-ED, TUCR-ED, SS- NOS, and SS-OS on the three pattern sets with the same radius search (0.01) Use Haar DWT in SS-NOS and SS-OS. Compare the search methods in terms of their precision, the number of distance function calls in post processing, and wall-clock time. 15
16 Experimental Results SS-OS has the same precision as UCR-ED and SS- NOS. The number of distance function calls of the UCR-ED and TUCR-ED are very large, while SS-OS and SS- NOS use multi-scale filtering so their numbers are very small. The pruning power of SS-OS is over 99.92%, whereas that of SS-NOS is only over 99.89%. 16
17 Experimental Results Fig. 4 The number of distance function calls in the post-processing phase 17
18 Experimental Results On average, the wall-clock times of SS-OS and SS- NOS are tiny, varying from 16 seconds to 19 seconds. The wall-clock times of UCR-ED for the three pattern sets are roughly 10 minutes, 13 minutes, and 11 minutes, respectively. The wall-clock times of TUCR-ED for the three pattern sets are roughly 2 minutes. 18
19 Experimental Results SearchInRtree in Algorithm RangeSearch performs range search in R-trees precisely. The average CPU times to process a new-coming data point of RangeSearch in all cases are tiny, varying from 2,000 ticks (*) to 2,600 ticks. PAA has the best performance in run time. Able to perform similarity search for numerous patterns over multiple high-speed time-series streams. (*) 1 millisecond = 10,000 ticks 19
20 Conclusions Propose an efficient multi-scale representation of timeseries data, the overlapped segmentation, for similarity search Perform range search for time-series patterns in which each pattern has its own search radius Work precisely and have fast responses while dealing with multiple streaming time series at high-speed rates 20
21 References [1] B. C. Giao and D. T. Anh, "Efficient similarity search for static queries in streaming time series," in Proceedings of International Conference on Green and Human Information Technology (ICGHIT) 2014, HoChiMinh City, 2014, pp [2] T. Rakthanmanon, B. Campana, A. Mueen, G. Batista, B. Westover, Q. Zhu, J. Zakaria and E. Keogh, "Searching and mining trillions of time series subsequences under Dynamic Time Warping," in Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining, New York, NY, USA, August 12 16, 2012, pp [3] R. Agrawal, C. Faloutsos, and A. Swami, "Efficient similarity search in sequence databases," in Proceedings of the 4th International Conference on Foundations of Data Organization and Algorithms (FODO '93), Chicago, Illinois, USA, October 13-15, 1993, pp [4] K.-p. Chan and A. W.-c. Fu, "Efficient time series matching by wavelets," in Proceedings of the 15th IEEE International Conference on Data Engineering, March 23-26, 1999, pp [5] A. Guttman, "R-tree : A dynamic index structure for spatial searching," in Proceedings of the 1984 ACM SIGMOD International Conference on Management of Data, New York, NY, USA, 1984, pp [6] E. Keogh, K. Chakrabarti, S. Mehrotra, and M. Pazzani, "Locally adaptive dimensionality reduction for indexing large time series databases," in Proceedings of the 2001 ACM SIGMOD International Conference on Management of Data, May 2001, pp [7] E. Keogh. The UCR time series classification/clustering page. [Online].
22 Thanks for listening Questions & Answers
Time series databases. Indexing Time Series. Time series data. Time series are ubiquitous
Time series databases Indexing Time Series A time series is a sequence of real numbers, representing the measurements of a real variable at equal time intervals Stock prices Volume of sales over time Daily
More informationTemporal Data Mining for Small and Big Data. Theophano Mitsa, Ph.D. Independent Data Mining/Analytics Consultant
Temporal Data Mining for Small and Big Data Theophano Mitsa, Ph.D. Independent Data Mining/Analytics Consultant What is Temporal Data Mining? Knowledge discovery in data that contain temporal information.
More informationChapter 5: Stream Processing. Big Data Management and Analytics 193
Chapter 5: Big Data Management and Analytics 193 Today s Lesson Data Streams & Data Stream Management System Data Stream Models Insert-Only Insert-Delete Additive Streaming Methods Sliding Windows & Ageing
More informationStatic Data Mining Algorithm with Progressive Approach for Mining Knowledge
Global Journal of Business Management and Information Technology. Volume 1, Number 2 (2011), pp. 85-93 Research India Publications http://www.ripublication.com Static Data Mining Algorithm with Progressive
More informationTHE concept of Big Data refers to systems conveying
EDIC RESEARCH PROPOSAL 1 High Dimensional Nearest Neighbors Techniques for Data Cleaning Anca-Elena Alexandrescu I&C, EPFL Abstract Organisations from all domains have been searching for increasingly more
More informationReal-Time Adaptive Algorithm for Resource Monitoring
Real-Time Adaptive Algorithm for Resource Monitoring Mauro Andreolini, Michele Colajanni, Marcello Pietri, Stefania Tosi University of Modena and Reggio Emilia {mauro.andreolini,michele.colajanni,marcello.pietri,stefania.tosi}@unimore.it
More informationMINING TIME SERIES DATA
Chapter 1 MINING TIME SERIES DATA Chotirat Ann Ratanamahatana, Jessica Lin, Dimitrios Gunopulos, Eamonn Keogh University of California, Riverside Michail Vlachos IBM T.J. Watson Research Center Gautam
More informationMAXIMAL FREQUENT ITEMSET GENERATION USING SEGMENTATION APPROACH
MAXIMAL FREQUENT ITEMSET GENERATION USING SEGMENTATION APPROACH M.Rajalakshmi 1, Dr.T.Purusothaman 2, Dr.R.Nedunchezhian 3 1 Assistant Professor (SG), Coimbatore Institute of Technology, India, rajalakshmi@cit.edu.in
More informationDWMiner : A tool for mining frequent item sets efficiently in data warehouses
DWMiner : A tool for mining frequent item sets efficiently in data warehouses Bruno Kinder Almentero, Alexandre Gonçalves Evsukoff and Marta Mattoso COPPE/Federal University of Rio de Janeiro, P.O.Box
More information56 Mining Time Series Data
56 Mining Time Series Data Chotirat Ann Ratanamahatana 1, Jessica Lin 1, Dimitrios Gunopulos 1, Eamonn Keogh 1, Michail Vlachos 2, and Gautam Das 3 1 University of California, Riverside 2 IBM T.J. Watson
More informationPerformance of KDB-Trees with Query-Based Splitting*
Performance of KDB-Trees with Query-Based Splitting* Yves Lépouchard Ratko Orlandic John L. Pfaltz Dept. of Computer Science Dept. of Computer Science Dept. of Computer Science University of Virginia Illinois
More informationEffective Clustering of Time-Series Data Using FCM
Effective Clustering of Time-Series Data Using FCM Saeed Aghabozorgi and Teh Ying Wah Abstract Today, wide important advances in clustering time series have been obtained in the field of data mining. A
More informationInternational Journal of Advance Research in Computer Science and Management Studies
Volume 3, Issue 11, November 2015 ISSN: 2321 7782 (Online) International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online
More informationDistributed Framework for Data Mining As a Service on Private Cloud
RESEARCH ARTICLE OPEN ACCESS Distributed Framework for Data Mining As a Service on Private Cloud Shraddha Masih *, Sanjay Tanwani** *Research Scholar & Associate Professor, School of Computer Science &
More informationEmail Spam Detection Using Customized SimHash Function
International Journal of Research Studies in Computer Science and Engineering (IJRSCSE) Volume 1, Issue 8, December 2014, PP 35-40 ISSN 2349-4840 (Print) & ISSN 2349-4859 (Online) www.arcjournals.org Email
More informationMEMBERSHIP LOCALIZATION WITHIN A WEB BASED JOIN FRAMEWORK
MEMBERSHIP LOCALIZATION WITHIN A WEB BASED JOIN FRAMEWORK 1 K. LALITHA, 2 M. KEERTHANA, 3 G. KALPANA, 4 S.T. SHWETHA, 5 M. GEETHA 1 Assistant Professor, Information Technology, Panimalar Engineering College,
More informationVisualization Techniques in Data Mining
Tecniche di Apprendimento Automatico per Applicazioni di Data Mining Visualization Techniques in Data Mining Prof. Pier Luca Lanzi Laurea in Ingegneria Informatica Politecnico di Milano Polo di Milano
More informationExample application (1) Telecommunication. Lecture 1: Data Mining Overview and Process. Example application (2) Health
Lecture 1: Data Mining Overview and Process What is data mining? Example applications Definitions Multi disciplinary Techniques Major challenges The data mining process History of data mining Data mining
More informationTime Series Representation for Elliott Wave Identification in Stock Market Analysis
Time Series Representation for Elliott Wave Identification in Stock Market Analysis Chaliaw Phetking Faculty of Science and Technology Suan Dusit Rajabhat University Bangkok, Thailand +662-244-5600 chaliaw_phe@dusit.ac.th
More informationStock Trend Prediction by Using K-Means and AprioriAll Algorithm for Sequential Chart Pattern Mining *
JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 30, 653-667 (2014) Stock Trend Prediction by Using K-Means and AprioriAll Algorithm for Sequential Chart Pattern Mining * KUO-PING WU 1, YUNG-PIAO WU 1 AND
More informationSEARCH ENGINE WITH PARALLEL PROCESSING AND INCREMENTAL K-MEANS FOR FAST SEARCH AND RETRIEVAL
SEARCH ENGINE WITH PARALLEL PROCESSING AND INCREMENTAL K-MEANS FOR FAST SEARCH AND RETRIEVAL Krishna Kiran Kattamuri 1 and Rupa Chiramdasu 2 Department of Computer Science Engineering, VVIT, Guntur, India
More informationContinuous Fastest Path Planning in Road Networks by Mining Real-Time Traffic Event Information
Continuous Fastest Path Planning in Road Networks by Mining Real-Time Traffic Event Information Eric Hsueh-Chan Lu Chi-Wei Huang Vincent S. Tseng Institute of Computer Science and Information Engineering
More informationFinding Frequent Patterns Based On Quantitative Binary Attributes Using FP-Growth Algorithm
R. Sridevi et al Int. Journal of Engineering Research and Applications RESEARCH ARTICLE OPEN ACCESS Finding Frequent Patterns Based On Quantitative Binary Attributes Using FP-Growth Algorithm R. Sridevi,*
More informationLoad Distribution in Large Scale Network Monitoring Infrastructures
Load Distribution in Large Scale Network Monitoring Infrastructures Josep Sanjuàs-Cuxart, Pere Barlet-Ros, Gianluca Iannaccone, and Josep Solé-Pareta Universitat Politècnica de Catalunya (UPC) {jsanjuas,pbarlet,pareta}@ac.upc.edu
More informationThree Myths about Dynamic Time Warping Data Mining
Three Myths about Dynamic Time Warping Data Mining Chotirat Ann Ratanamahatana Eamonn Keogh Department of Computer Science and Engineering University of California, Riverside Riverside, CA 92521 { ratana,
More informationTrIMPI: A Data Structure for Efficient Pattern Matching on Moving Objects
TrIMPI: A Data Structure for Efficient Pattern Matching on Moving Objects ABSTRACT Tsvetelin Polomski Christian-Albrechts-University at Kiel Hermann-Rodewald-Straße 3 24118 Kiel tpo@is.informatik.uni-kiel.de
More informationCHAPTER FIVE RESULT ANALYSIS
CHAPTER FIVE RESULT ANALYSIS 5.1 Chapter Introduction 5.2 Discussion of Results 5.3 Performance Comparisons 5.4 Chapter Summary 61 5.1 Chapter Introduction This chapter outlines the results obtained from
More informationSpeed Performance Improvement of Vehicle Blob Tracking System
Speed Performance Improvement of Vehicle Blob Tracking System Sung Chun Lee and Ram Nevatia University of Southern California, Los Angeles, CA 90089, USA sungchun@usc.edu, nevatia@usc.edu Abstract. A speed
More informationCS 377: Operating Systems. Outline. A review of what you ve learned, and how it applies to a real operating system. Lecture 25 - Linux Case Study
CS 377: Operating Systems Lecture 25 - Linux Case Study Guest Lecturer: Tim Wood Outline Linux History Design Principles System Overview Process Scheduling Memory Management File Systems A review of what
More informationEfficient Integration of Data Mining Techniques in Database Management Systems
Efficient Integration of Data Mining Techniques in Database Management Systems Fadila Bentayeb Jérôme Darmont Cédric Udréa ERIC, University of Lyon 2 5 avenue Pierre Mendès-France 69676 Bron Cedex France
More informationBenchmark Hadoop and Mars: MapReduce on cluster versus on GPU
Benchmark Hadoop and Mars: MapReduce on cluster versus on GPU Heshan Li, Shaopeng Wang The Johns Hopkins University 3400 N. Charles Street Baltimore, Maryland 21218 {heshanli, shaopeng}@cs.jhu.edu 1 Overview
More informationOracle8i Spatial: Experiences with Extensible Databases
Oracle8i Spatial: Experiences with Extensible Databases Siva Ravada and Jayant Sharma Spatial Products Division Oracle Corporation One Oracle Drive Nashua NH-03062 {sravada,jsharma}@us.oracle.com 1 Introduction
More informationDevelopment and Evaluation of Point Cloud Compression for the Point Cloud Library
Development and Evaluation of Point Cloud Compression for the Institute for Media Technology, TUM, Germany May 12, 2011 Motivation Point Cloud Stream Compression Network Point Cloud Stream Decompression
More informationA Survey on Association Rule Mining in Market Basket Analysis
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 4, Number 4 (2014), pp. 409-414 International Research Publications House http://www. irphouse.com /ijict.htm A Survey
More informationAn Order-Invariant Time Series Distance Measure [Position on Recent Developments in Time Series Analysis]
An Order-Invariant Time Series Distance Measure [Position on Recent Developments in Time Series Analysis] Stephan Spiegel and Sahin Albayrak DAI-Lab, Technische Universität Berlin, Ernst-Reuter-Platz 7,
More informationSIGMOD RWE Review Towards Proximity Pattern Mining in Large Graphs
SIGMOD RWE Review Towards Proximity Pattern Mining in Large Graphs Fabian Hueske, TU Berlin June 26, 21 1 Review This document is a review report on the paper Towards Proximity Pattern Mining in Large
More informationMining Multi Level Association Rules Using Fuzzy Logic
Mining Multi Level Association Rules Using Fuzzy Logic Usha Rani 1, R Vijaya Praash 2, Dr. A. Govardhan 3 1 Research Scholar, JNTU, Hyderabad 2 Dept. Of Computer Science & Engineering, SR Engineering College,
More informationBuffer Operations in GIS
Buffer Operations in GIS Nagapramod Mandagere, Graduate Student, University of Minnesota npramod@cs.umn.edu SYNONYMS GIS Buffers, Buffering Operations DEFINITION A buffer is a region of memory used to
More informationBinary Coded Web Access Pattern Tree in Education Domain
Binary Coded Web Access Pattern Tree in Education Domain C. Gomathi P.G. Department of Computer Science Kongu Arts and Science College Erode-638-107, Tamil Nadu, India E-mail: kc.gomathi@gmail.com M. Moorthi
More informationData Backup and Archiving with Enterprise Storage Systems
Data Backup and Archiving with Enterprise Storage Systems Slavjan Ivanov 1, Igor Mishkovski 1 1 Faculty of Computer Science and Engineering Ss. Cyril and Methodius University Skopje, Macedonia slavjan_ivanov@yahoo.com,
More informationRiMONITOR. Monitoring Software. for RIEGL VZ-Line Laser Scanners. Ri Software. visit our website www.riegl.com. Preliminary Data Sheet
Monitoring Software RiMONITOR for RIEGL VZ-Line Laser Scanners for stand-alone monitoring applications by autonomous operation of all RIEGL VZ-Line Laser Scanners adaptable configuration of data acquisition
More informationOn the Need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration
On the Need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration EAMONN KEOGH SHRUTI KASETTY University of California, Riverside eamonn@cs.ucr.edu skasetty@cs.ucr.edu Editors: Hand,
More informationEM Clustering Approach for Multi-Dimensional Analysis of Big Data Set
EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set Amhmed A. Bhih School of Electrical and Electronic Engineering Princy Johnson School of Electrical and Electronic Engineering Martin
More informationSurvey On: Nearest Neighbour Search With Keywords In Spatial Databases
Survey On: Nearest Neighbour Search With Keywords In Spatial Databases SayaliBorse 1, Prof. P. M. Chawan 2, Prof. VishwanathChikaraddi 3, Prof. Manish Jansari 4 P.G. Student, Dept. of Computer Engineering&
More informationAn Ontology-enhanced Cloud Service Discovery System
An Ontology-enhanced Cloud Service Discovery System Taekgyeong Han and Kwang Mong Sim* Abstract This paper presents a Cloud service discovery system (CSDS) that aims to support the Cloud users in finding
More informationConquer the 5 Most Common Magento Coding Issues to Optimize Your Site for Performance
Conquer the 5 Most Common Magento Coding Issues to Optimize Your Site for Performance Written by: Oleksandr Zarichnyi Table of Contents INTRODUCTION... TOP 5 ISSUES... LOOPS... Calculating the size of
More informationHorizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis
IOSR Journal of Computer Engineering (IOSRJCE) ISSN: 2278-0661, ISBN: 2278-8727 Volume 6, Issue 5 (Nov. - Dec. 2012), PP 36-41 Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis
More informationSelf-Compressive Approach for Distributed System Monitoring
Self-Compressive Approach for Distributed System Monitoring Akshada T Bhondave Dr D.Y Patil COE Computer Department, Pune University, India Santoshkumar Biradar Assistant Prof. Dr D.Y Patil COE, Computer
More informationINDEXING BIOMEDICAL STREAMS IN DATA MANAGEMENT SYSTEM 1. INTRODUCTION
JOURNAL OF MEDICAL INFORMATICS & TECHNOLOGIES Vol. 9/2005, ISSN 1642-6037 Michał WIDERA *, Janusz WRÓBEL *, Adam MATONIA *, Michał JEŻEWSKI **,Krzysztof HOROBA *, Tomasz KUPKA * centralized monitoring,
More informationMethodology for Emulating Self Organizing Maps for Visualization of Large Datasets
Methodology for Emulating Self Organizing Maps for Visualization of Large Datasets Macario O. Cordel II and Arnulfo P. Azcarraga College of Computer Studies *Corresponding Author: macario.cordel@dlsu.edu.ph
More informationIntelligent Stock Market Assistant using Temporal Data Mining
Intelligent Stock Market Assistant using Temporal Data Mining Gerasimos Marketos 1, Konstantinos Pediaditakis 2, Yannis Theodoridis 1, and Babis Theodoulidis 2 1 Database Group, Information Systems Laboratory,
More informationAccelerating BIRCH for Clustering Large Scale Streaming Data Using CUDA Dynamic Parallelism
Accelerating BIRCH for Clustering Large Scale Streaming Data Using CUDA Dynamic Parallelism Jianqiang Dong, Fei Wang and Bo Yuan Intelligent Computing Lab, Division of Informatics Graduate School at Shenzhen,
More information0 )1, 2! 3!( +++ 4 5 677.689 :#4
! # % & ( )!! ) ) +++,! &, ( &. / 0 )1, 2! 3!( +++ 4 5 677.689 :#4./7 9.8 7. ; A neural network for mining large volumes of time series data Bojian Liang and James Austin Advanced Computer Architectures
More informationParallel Simplification of Large Meshes on PC Clusters
Parallel Simplification of Large Meshes on PC Clusters Hua Xiong, Xiaohong Jiang, Yaping Zhang, Jiaoying Shi State Key Lab of CAD&CG, College of Computer Science Zhejiang University Hangzhou, China April
More informationAlgorithmic Techniques for Big Data Analysis. Barna Saha AT&T Lab-Research
Algorithmic Techniques for Big Data Analysis Barna Saha AT&T Lab-Research Challenges of Big Data VOLUME Large amount of data VELOCITY Needs to be analyzed quickly VARIETY Different types of structured
More informationHYBRID INTRUSION DETECTION FOR CLUSTER BASED WIRELESS SENSOR NETWORK
HYBRID INTRUSION DETECTION FOR CLUSTER BASED WIRELESS SENSOR NETWORK 1 K.RANJITH SINGH 1 Dept. of Computer Science, Periyar University, TamilNadu, India 2 T.HEMA 2 Dept. of Computer Science, Periyar University,
More informationAPPLICATION OF DATA MINING TECHNIQUES FOR BUILDING SIMULATION PERFORMANCE PREDICTION ANALYSIS. email paul@esru.strath.ac.uk
Eighth International IBPSA Conference Eindhoven, Netherlands August -4, 2003 APPLICATION OF DATA MINING TECHNIQUES FOR BUILDING SIMULATION PERFORMANCE PREDICTION Christoph Morbitzer, Paul Strachan 2 and
More informationCommunity Mining from Multi-relational Networks
Community Mining from Multi-relational Networks Deng Cai 1, Zheng Shao 1, Xiaofei He 2, Xifeng Yan 1, and Jiawei Han 1 1 Computer Science Department, University of Illinois at Urbana Champaign (dengcai2,
More informationBitmap Index an Efficient Approach to Improve Performance of Data Warehouse Queries
Bitmap Index an Efficient Approach to Improve Performance of Data Warehouse Queries Kale Sarika Prakash 1, P. M. Joe Prathap 2 1 Research Scholar, Department of Computer Science and Engineering, St. Peters
More informationRVS-Seminar Implementation and Evaluation of WinJTAP Interface. Milan Nikolic Universität Bern
RVS-Seminar Implementation and Evaluation of WinJTAP Interface Milan Nikolic Universität Bern Overview > Short introduction > TAP interface on Win32 OS > Implementation of WinJTAP interface > Test of WinJTAP:
More informationAn Evaluation of Machine Learning Method for Intrusion Detection System Using LOF on Jubatus
An Evaluation of Machine Learning Method for Intrusion Detection System Using LOF on Jubatus Tadashi Ogino* Okinawa National College of Technology, Okinawa, Japan. * Corresponding author. Email: ogino@okinawa-ct.ac.jp
More informationBlog Post Extraction Using Title Finding
Blog Post Extraction Using Title Finding Linhai Song 1, 2, Xueqi Cheng 1, Yan Guo 1, Bo Wu 1, 2, Yu Wang 1, 2 1 Institute of Computing Technology, Chinese Academy of Sciences, Beijing 2 Graduate School
More informationClassification of Household Devices by Electricity Usage Profiles
Classification of Household Devices by Electricity Usage Profiles Jason Lines 1, Anthony Bagnall 1, Patrick Caiger-Smith 2, and Simon Anderson 2 1 School of Computing Sciences University of East Anglia
More informationA Novel Way of Deduplication Approach for Cloud Backup Services Using Block Index Caching Technique
A Novel Way of Deduplication Approach for Cloud Backup Services Using Block Index Caching Technique Jyoti Malhotra 1,Priya Ghyare 2 Associate Professor, Dept. of Information Technology, MIT College of
More informationSupporting Software Development Process Using Evolution Analysis : a Brief Survey
Supporting Software Development Process Using Evolution Analysis : a Brief Survey Samaneh Bayat Department of Computing Science, University of Alberta, Edmonton, Canada samaneh@ualberta.ca Abstract During
More informationDistributed Image Processing using Hadoop MapReduce framework. Binoy A Fernandez (200950006) Sameer Kumar (200950031)
using Hadoop MapReduce framework Binoy A Fernandez (200950006) Sameer Kumar (200950031) Objective To demonstrate how the hadoop mapreduce framework can be extended to work with image data for distributed
More informationISSN 2278-3091. Index Terms Cloud computing, outsourcing data, cloud storage security, public auditability
Outsourcing and Discovering Storage Inconsistencies in Cloud Through TPA Sumathi Karanam 1, GL Varaprasad 2 Student, Department of CSE, QIS College of Engineering and Technology, Ongole, AndhraPradesh,India
More informationEfficient Storage and Temporal Query Evaluation of Hierarchical Data Archiving Systems
Efficient Storage and Temporal Query Evaluation of Hierarchical Data Archiving Systems Hui (Wendy) Wang, Ruilin Liu Stevens Institute of Technology, New Jersey, USA Dimitri Theodoratos, Xiaoying Wu New
More informationProjection error evaluation for large multidimensional data sets
ISSN 1392-5113 Nonlinear Analysis: Modelling and Control, 2016, Vol. 21, No. 1, 92 102 http://dx.doi.org/10.15388/na.2016.1.6 Projection error evaluation for large multidimensional data sets Kotryna Paulauskienė,
More informationA Comparison of Dictionary Implementations
A Comparison of Dictionary Implementations Mark P Neyer April 10, 2009 1 Introduction A common problem in computer science is the representation of a mapping between two sets. A mapping f : A B is a function
More informationMobile Phone APP Software Browsing Behavior using Clustering Analysis
Proceedings of the 2014 International Conference on Industrial Engineering and Operations Management Bali, Indonesia, January 7 9, 2014 Mobile Phone APP Software Browsing Behavior using Clustering Analysis
More informationProject Participants
Annual Report for Period:10/2006-09/2007 Submitted on: 08/15/2007 Principal Investigator: Yang, Li. Award ID: 0414857 Organization: Western Michigan Univ Title: Projection and Interactive Exploration of
More informationMultiple Programming Models For Linux System Design and Development
A Flexible Scheduling Framework (for Linux): Supporting Multiple Programming Models with Arbitrary Semantics Noah Watkins, Jared Straub*, Douglas Niehaus* Presented by Noah Watkins Systems Research Lab
More informationAn Analysis on Density Based Clustering of Multi Dimensional Spatial Data
An Analysis on Density Based Clustering of Multi Dimensional Spatial Data K. Mumtaz 1 Assistant Professor, Department of MCA Vivekanandha Institute of Information and Management Studies, Tiruchengode,
More informationSEARCH ENGINE OPTIMIZATION USING D-DICTIONARY
SEARCH ENGINE OPTIMIZATION USING D-DICTIONARY G.Evangelin Jenifer #1, Mrs.J.Jaya Sherin *2 # PG Scholar, Department of Electronics and Communication Engineering(Communication and Networking), CSI Institute
More informationData Mining in Web Search Engine Optimization and User Assisted Rank Results
Data Mining in Web Search Engine Optimization and User Assisted Rank Results Minky Jindal Institute of Technology and Management Gurgaon 122017, Haryana, India Nisha kharb Institute of Technology and Management
More informationScalable Cluster Analysis of Spatial Events
International Workshop on Visual Analytics (2012) K. Matkovic and G. Santucci (Editors) Scalable Cluster Analysis of Spatial Events I. Peca 1, G. Fuchs 1, K. Vrotsou 1,2, N. Andrienko 1 & G. Andrienko
More informationPATTERN DISCOVERY - A SAX-GA Based Investment Strategy
PATTERN DISCOVERY - A SAX-GA Based Investment Strategy António Canelas Instituto de Telecomunicações Instituto Supertior Técnico Torre Norte Piso 10 Av. Rovisco Pais, 1 1049-001 Lisboa Portugal Phone :
More informationComplex Network Visualization based on Voronoi Diagram and Smoothed-particle Hydrodynamics
Complex Network Visualization based on Voronoi Diagram and Smoothed-particle Hydrodynamics Zhao Wenbin 1, Zhao Zhengxu 2 1 School of Instrument Science and Engineering, Southeast University, Nanjing, Jiangsu
More informationCURVE is the Institutional Repository for Coventry University http://curve.coventry.ac.uk/open
Anomaly detection in time series data using a combination of wavelets, neural networks and Hilbert transform Kanarachos, S., Mathew, J., Chroneos, A. and Fitzpatrick, M.E. Author post-print (accepted)
More informationCommunity Detection Proseminar - Elementary Data Mining Techniques by Simon Grätzer
Community Detection Proseminar - Elementary Data Mining Techniques by Simon Grätzer 1 Content What is Community Detection? Motivation Defining a community Methods to find communities Overlapping communities
More informationMining Pairs-Trading Patterns: A Framework
, pp.19-28 http://dx.doi.org/10.14257/ijdta.2013.6.6.02 Mining Pairs-Trading Patterns: A Framework Ghazi Al-Naymat College of Computer Science and Information Technology University of Dammam, KSA ghalnaymat@ud.edu.sa
More informationDiscovering Local Subgroups, with an Application to Fraud Detection
Discovering Local Subgroups, with an Application to Fraud Detection Abstract. In Subgroup Discovery, one is interested in finding subgroups that behave differently from the average behavior of the entire
More informationSimultaneous Gamma Correction and Registration in the Frequency Domain
Simultaneous Gamma Correction and Registration in the Frequency Domain Alexander Wong a28wong@uwaterloo.ca William Bishop wdbishop@uwaterloo.ca Department of Electrical and Computer Engineering University
More informationOptimization of Search Results with Duplicate Page Elimination using Usage Data A. K. Sharma 1, Neelam Duhan 2 1, 2
Optimization of Search Results with Duplicate Page Elimination using Usage Data A. K. Sharma 1, Neelam Duhan 2 1, 2 Department of Computer Engineering, YMCA University of Science & Technology, Faridabad,
More informationMap-Reduce for Machine Learning on Multicore
Map-Reduce for Machine Learning on Multicore Chu, et al. Problem The world is going multicore New computers - dual core to 12+-core Shift to more concurrent programming paradigms and languages Erlang,
More informationStudy on Redundant Strategies in Peer to Peer Cloud Storage Systems
Applied Mathematics & Information Sciences An International Journal 2011 NSP 5 (2) (2011), 235S-242S Study on Redundant Strategies in Peer to Peer Cloud Storage Systems Wu Ji-yi 1, Zhang Jian-lin 1, Wang
More informationMINING THE DATA FROM DISTRIBUTED DATABASE USING AN IMPROVED MINING ALGORITHM
MINING THE DATA FROM DISTRIBUTED DATABASE USING AN IMPROVED MINING ALGORITHM J. Arokia Renjit Asst. Professor/ CSE Department, Jeppiaar Engineering College, Chennai, TamilNadu,India 600119. Dr.K.L.Shunmuganathan
More informationA Frequency-Based Approach to Intrusion Detection
A Frequency-Based Approach to Intrusion Detection Mian Zhou and Sheau-Dong Lang School of Electrical Engineering & Computer Science and National Center for Forensic Science, University of Central Florida,
More informationIMPLEMENTATION OF RELIABLE CACHING STRATEGY IN CLOUD ENVIRONMENT
INTERNATIONAL JOURNAL OF ADVANCED RESEARCH IN ENGINEERING AND SCIENCE IMPLEMENTATION OF RELIABLE CACHING STRATEGY IN CLOUD ENVIRONMENT M.Swapna 1, K.Ashlesha 2 1 M.Tech Student, Dept of CSE, Lord s Institute
More informationIJCSES Vol.7 No.4 October 2013 pp.165-168 Serials Publications BEHAVIOR PERDITION VIA MINING SOCIAL DIMENSIONS
IJCSES Vol.7 No.4 October 2013 pp.165-168 Serials Publications BEHAVIOR PERDITION VIA MINING SOCIAL DIMENSIONS V.Sudhakar 1 and G. Draksha 2 Abstract:- Collective behavior refers to the behaviors of individuals
More informationExtension of Decision Tree Algorithm for Stream Data Mining Using Real Data
Fifth International Workshop on Computational Intelligence & Applications IEEE SMC Hiroshima Chapter, Hiroshima University, Japan, November 10, 11 & 12, 2009 Extension of Decision Tree Algorithm for Stream
More informationA Dynamic Load Balancing Strategy for Parallel Datacube Computation
A Dynamic Load Balancing Strategy for Parallel Datacube Computation Seigo Muto Institute of Industrial Science, University of Tokyo 7-22-1 Roppongi, Minato-ku, Tokyo, 106-8558 Japan +81-3-3402-6231 ext.
More informationOptimization of Image Search from Photo Sharing Websites Using Personal Data
Optimization of Image Search from Photo Sharing Websites Using Personal Data Mr. Naeem Naik Walchand Institute of Technology, Solapur, India Abstract The present research aims at optimizing the image search
More informationParallel Compression and Decompression of DNA Sequence Reads in FASTQ Format
, pp.91-100 http://dx.doi.org/10.14257/ijhit.2014.7.4.09 Parallel Compression and Decompression of DNA Sequence Reads in FASTQ Format Jingjing Zheng 1,* and Ting Wang 1, 2 1,* Parallel Software and Computational
More informationAn Ants Algorithm to Improve Energy Efficient Based on Secure Autonomous Routing in WSN
An Ants Algorithm to Improve Energy Efficient Based on Secure Autonomous Routing in WSN *M.A.Preethy, PG SCHOLAR DEPT OF CSE #M.Meena,M.E AP/CSE King College Of Technology, Namakkal Abstract Due to the
More informationHybrid model rating prediction with Linked Open Data for Recommender Systems
Hybrid model rating prediction with Linked Open Data for Recommender Systems Andrés Moreno 12 Christian Ariza-Porras 1, Paula Lago 1, Claudia Jiménez-Guarín 1, Harold Castro 1, and Michel Riveill 2 1 School
More informationultra fast SOM using CUDA
ultra fast SOM using CUDA SOM (Self-Organizing Map) is one of the most popular artificial neural network algorithms in the unsupervised learning category. Sijo Mathew Preetha Joy Sibi Rajendra Manoj A
More informationR-trees. R-Trees: A Dynamic Index Structure For Spatial Searching. R-Tree. Invariants
R-Trees: A Dynamic Index Structure For Spatial Searching A. Guttman R-trees Generalization of B+-trees to higher dimensions Disk-based index structure Occupancy guarantee Multiple search paths Insertions
More informationAn Adaptive Regression Tree for Non-stationary Data Streams
An Adaptive Regression Tree for Non-stationary Data Streams ABSTRACT Data streams are endless flow of data produced in high speed, large size and usually non-stationary environments. These characteristics
More information