Analyzing Trajectory Data

Size: px
Start display at page:

Download "Analyzing Trajectory Data"

Transcription

1 Analyzing Trajectory Data Gerasimos Marketos Free University of Bolzano 20/3/2009 Information Systems Lab (InfoLab) University of Piraeus (UniPi), Greece

2 Outline Preliminaries Trajectory Data Warehousing Our framework for trajectory analysis Reconstructing trajectories ETL processing Addressing the distinct count problem Traffic Mining Modeling traffic flow in a road network Our framework for mining traffic patterns Detecting traffic flow in a road network G. Marketos: Analyzing Trajectory Data 2

3 Preliminaries G. Marketos: Analyzing Trajectory Data 3

4 The big picture (the GeoPKDD project) Bandwidth/Power optimization Mobile cells planning Traffic Management Accessibility of services Mobility evolution Urban planning. Aggregative Location - based services Telecommunication provider interpretation visualization Public administration or business companies Privacy - aware Data mining GeoKnowledge trajectory reconstruction p(x)=0.02 ST patterns warehouse Trajectory warehouse Privacy enforcement G. Marketos: Analyzing Trajectory Data 4

5 A tour on GeoPKDD Mobility data management Acquiring and storing trajectories in MODs Location-aware querying Trajectory indexing Mobility data mining and the geographic KDD process Trajectory warehousing and OLAP Mobility data mining and reasoning Visual analytics for mobility data Privacy aspects on mobility data Anonymity-preserving mobility mining G. Marketos: Analyzing Trajectory Data 5

6 Key questions (for this talk) How to store and query trajectory data? Database technology is extended: Moving Object Databases How to reconstruct a trajectory from raw logs? Position devices provide us information just about location points and not about trajectories How to analyze trajectory data Data Warehousing technology is adapted to handle trajectory data Are there any spatio-temporal patterns in my data? New Data Mining algorithms are needed so as to discover such patterns Traffic Mining is a very interesting topic in this area G. Marketos: Analyzing Trajectory Data 6

7 Mobility Data Typical structure and size: N;Time;Lat;Lon;Height;Course;Speed;PDOP;State;NSat 8;22/03/07 08:51:52; ; ; 67.6;345.4;21.817;3.8;1808;4 9;22/03/07 08:51:56; ; ; 68.4;35.6;14.223;3.8;1808;4 10;22/03/07 08:51:59; ; ; 68.3;112.7;25.298;3.8;1808;4 11;22/03/07 08:52:03; ; ; 68.8;119.8;32.447;3.8;1808;4 12;22/03/07 08:52:06; ; ; 68.1;124.1;30.058;3.8;1808;4 13;22/03/07 08:52:09; ; ; 67.9;117.7;34.003;3.8;1808;4 14;22/03/07 08:52:12; ; ; 66.9;117.5;37.151;3.8;1808;4 15;22/03/07 08:52:15; ; ; 67.0;99.2;39.188;3.8;1808;4 16;22/03/07 08:52:18; ; ; 68.8;90.6;41.170;3.8;1808;4 17;22/03/07 08:52:21; ; ; 71.1;82.0;35.058;3.8;1808;4 18;22/03/07 08:52:24; ; ; 68.6;117.1;11.371;3.8;1808;4 G. Marketos: Analyzing Trajectory Data 7

8 Location data producers: GSM, GPS, WiFi Ti ( xi, yi, ti ),..., ( xi, yi, ti ni ni ni ) Location data (id, x, y, t) are generated Trajectory stream manager + Trajectory reconstruction trajectory data (obj-id, traj-id, (x, y, t) * ) are reconstructed T i ( x i, y i, t i ),..., ( x,, t ni ni ni i y i i ) Moving Object Database G. Marketos: Analyzing Trajectory Data 8

9 Moving Objects Databases The traditional database technology has been extended into Moving Object Databases (MODs) that handle modeling, indexing and query processing issues for trajectories Spatial and temporal dimensions are considered as first-class citizens. Both past and current (as well as anticipated future) positions of moving objects are of interest. Several prototype MODs DOMINO (Wolfson et al.) EDBT 02 PLACE (Mokbel et al.) VLDB 04 SECONDO (Güting et. al.) ICDE 05 HERMES (Pelekis et. al.) EDBT 06, SIGMOD 08 G. Marketos: Analyzing Trajectory Data 9

10 Hermes MOD engine Built-in ORACLE ORDBMS Data model: absolute vs. relative location coordinates Current location as a function in time over the starting location linear and arc movement functions MOD management Insert / Update / Delete a moving object or a segment of its trajectory Functions over trajectories or sets of trajectories Indexing support Supported indices: R-tree (for stationary data) Development of a specialized index (TB-tree) G. Marketos: Analyzing Trajectory Data 10

11 Hermes prototype architecture G. Marketos: Analyzing Trajectory Data 11

12 Hermes Moving Data Cartridge G. Marketos: Analyzing Trajectory Data 13

13 Hermes: trajectory data type Primitive definition: Unit_Function = d x i :double, y i :double, x e :double, y e :double, x c :double, y c :double, v:double, a:double, flag:typeoffunction, where TypeOfFunction={ CONST, PLNML_1, ARC_<1..8> } Unit_Moving_Point = d p: Period SEC, m: Unit_Function Moving_Point = d { tab: set Unit_Moving_Point constraints } xx' Y t ε [t1, t2) -> Linear movement Y' (x t,y t,t) (xe,y e,t e ) t ε [t2, t3) -> Arc movement t ε [t3, t4) -> Const movement (x i,y i,t i ) X' t ε [t4, t5) -> Linear movement tt' (x c,y c ) yy' o X t1 t2 t3 t4 t5 G. Marketos: Analyzing Trajectory Data 14

14 TB-Tree support in Hermes MOD engine TB-Tree Index (Pfoser et al., 2000) Maintains the trajectory concept Each node consists of segments of a single trajectory Nodes are linked together in a chain Effective for trajectory-oriented queries Implemented in Hermes using Oracle s indexing extensibility t1 t7 t3 t11 t1 t2 t3 t4 t5 t6 t7 t8 t9 t10 t11 t12 G. Marketos: Analyzing Trajectory Data 15

15 Trajectory Data Warehousing & OLAP G. Marketos: Analyzing Trajectory Data 16

16 Trajectory data warehousing The analysis of trajectory data raises opportunities for discovering behavioral patterns that can be exploited in various applications TDW should extract aggregate information from MOD support a variety of dimensions (temporal, spatial, thematic, ) and measures (about space, time and their derivatives) Storing measures associated with facts, concerning the set of trajs crossing the cell aggregate information in base cells Challenges design of a trajectory-oriented data cube high volume and complex nature of data; special query processing requirements extensions of traditional aggregation techniques to produce summary information for OLAP analysis G. Marketos: Analyzing Trajectory Data 17

17 Trajectory data warehousing A Trajectory Reconstruction process is applied on the raw timestamped location data in order to generate trajectories, which are then stored into a MOD An Extract-Transform-Load (ETL) procedure is activated that feeds the data cube(s) with aggregate information on trajectories The final step of the process offers OLAP (and, eventually, DM) capabilities over the aggregated information contained in the trajectory cube model trajectory data analyst location data producers Location data (id, x, y, t) are recorded Analysis over aggregate data is performed (OLAP) Trajectory Data Cube MOD Trajectory reconstruction module Aggregates are loaded in the data cube (ETL procedure) Reconstructed trajectory data are stored in MOD G. Marketos: Analyzing Trajectory Data 18

18 Related work Data Warehousing (DW) is widely investigated for conventional, non-spatial data. Some research work has been done in the area of Spatial Data Warehousing (Han et al., 1998). (Shekhar et al., 2001) extend the idea of cube dimensions so as to include spatial and non-spatial ones, and of cube measures so as to represent space regions and/or calculate numerical data One step beyond Spatial DW is modeling a TDW (Pelekis et al., 2007). A simple TDW model is presented in (Orlando et al., 2007). The notion of multiple aggregation paths is defined in (Jensen et al., 2004) The integration of spatial and temporal dimensions is proposed in (Tao and Papadias, 2005) G. Marketos: Analyzing Trajectory Data 19

19 Store Conventional DWs In conventional DWs, measures about entities (e.g. shopping baskets, customers etc.) refer to a specific base cell that are formed by the dimensions distinct objects appear only once at the minimum abstraction level of the cube Time 1/1 2/1 3/1 TV VCR PC 101 count 201 Count of transactions 301 A transaction cannot be found in both base cells sum G. Marketos: Analyzing Trajectory Data 20

20 Location Trajectory DW The challenge: In TDW, measures regarding trajectories may affect more than one base cell the same object might affect measures in several base cell due to its movement (!!) 50 < Age 30 < Age 50 Age 30 count Time Count of trajectories R1 R2 R3 A trajectory can be found in both base cells (the distinct counting problem) sum G. Marketos: Analyzing Trajectory Data 21

21 Sample schemas Moving Object Database OBJECTS (object-id: identifier, description: text, gender: {M F}, birth-date: date, profession: text, device-type: text) RAW_LOCATIONS (object-id: identifier, timestamp: datetime, eastings-x: numeric, northings-y: numeric, altitude-z: numeric) MOD_TRAJECTORIES (trajectory-id: identifier, object-id: identifier, trajectory: 3D geometry) Trajectory Data Warehouse Dimensions: Spatial, Temporal, Object Profile Measures: count (trajectories), count (users), avg (distance traveled), avg (travel duration), avg (speed), avg (abs (acceler) ) PK SPACE_DIM PARTITION_ID PARTITION_GEOMETRY DISTRICT CITY STATE COUNTRY PK,FK3 PK,FK2 PK,FK1 OBJECT_PROFILE_DIM PK OBJPROFILE_ID GENDER BIRTHYEAR PROFESSION MARITAL_STATUS DEVICE_TYPE FACT_TBL INTERVAL_ID PARTITION_ID OBJPROFILE_ID COUNT_TRAJECTORIES COUNT_USERS AVG_DISTANCE_TRAVELED AVG_TRAVEL_DURATION AVG_SPEED AVG_ABS_ACCELER PK TIME_DIM INTERVAL_ID INTERVAL_START INTERVAL_END HOUR DAY MONTH QUARTER YEAR DAY_OF_WEEK RUSH_HOUR G. Marketos: Analyzing Trajectory Data 22

22 Our framework for trajectory analysis We focus our research on three issues that are critical to trajectory data warehousing: The preprocessing phase that deals with the explicit reconstruction of the trajectories, which are then stored into a MOD Alternative ETL processes that feed the trajectory data warehouse The solutions proposed on the challenging issue of measure aggregation G. Marketos: Analyzing Trajectory Data 23

23 Reconstructing trajectories Collected raw data represent time-stamped geographical locations Apart from storing raw data in the MOD, we are also interested in reconstructing trajectories: Raw points arrive in bulk sets We need a filter that decides if the new series of data is to be appended to an existing trajectory or not: Tolerance distance Temporal gap Spatial gap Maximum speed Maximum noise duration t y t y x x G. Marketos: Analyzing Trajectory Data 24

24 Reconstructing trajectories: parameters Tolerance distance The tolerance of the transmitted time-stamped positions. In other words, it is the maximum distance between two consecutive time-stamped positions of the same object in order for the object to be considered as stationary t y t y x x G. Marketos: Analyzing Trajectory Data 25

25 Reconstructing trajectories: parameters Tolerance distance Temporal gap between trajectories The maximum allowed time interval between two consecutive time-stamped positions of the same trajectory for a single moving object t y t y temporal gap x x G. Marketos: Analyzing Trajectory Data 26

26 Reconstructing trajectories: parameters Tolerance distance Temporal gap between trajectories Spatial gap between trajectories The maximum allowed distance in 2D plane between two consecutive time-stamped positions of the same trajectory t y t y spatial gap x x G. Marketos: Analyzing Trajectory Data 27

27 Reconstructing trajectories: parameters Tolerance distance Temporal gap between trajectories Spatial gap between trajectories Maximum speed It is used in order to determine whether a reported time-stamped position must be considered as noise and consequently discarded from the output trajectory t y t y x x G. Marketos: Analyzing Trajectory Data 28

28 Reconstructing trajectories: parameters Tolerance distance Temporal gap between trajectories Spatial gap between trajectories Maximum speed Maximum noise duration The maximum duration of a noisy part of a trajectory. Any sequence of noisy time-stamped positions of the same object will result in a new trajectory given that its duration exceeds noise max t y t y x x G. Marketos: Analyzing Trajectory Data 29

29 Reconstructing trajectories: algorithm As a first step (lines 1-6), the algorithm checks whether the object has been processed if so, it retrieves its partial trajectory from the corresponding list otherwise, it creates a new trajectory and adds it to list Then (lines 7-31), it compares the incoming point P with the tail of the partial trajectory (LastPoint) by applying the trajectory reconstruction parameters Algorithm Trajectory-Reconstruction (PartialTrajectories List, P Point, OId ObjectId) IF NOT PartialTrajectories.Contains(OId) THEN CTrajectory=New Trajectory; CTrajectory.AddPoint(P); PartialTrajectories.Add(CTrajectory); ELSE CTrajectory=PartialTrajectories(OId); IF Distance(CTrajectory.LastPoint,P)<= D TOL THEN IF P.T CTrajectory.LastPoint.T > gap Time THEN Report Ctrajectory.LastPoint; CTrajectory.Id=CTrajectory.Id+1; CTrajectory.AddPoint(P); ENDIF ELSEIF Speed(CTrajectory.LastPoint,P)> V max THEN IF P.T CTrajectory.LastPoint.T > noise max THEN Report CTrajectory.Noise; ELSE CTrajectory.AddNoise(P); ENDIF ELSEIF Distance(CTrajectory.LastPoint,P)> gap space THEN Report Ctrajectory.LastPoint; CTrajectory.Id=CTrajectory.Id+1; CTrajectory.AddPoint(P); ELSE IF P.T CTrajectory.LastPoint.T > gap Time THEN Report Ctrajectory.LastPoint; CTrajectory.Id=CTrajectory.Id+1; CTrajectory.AddPoint(P); ELSE CTrajectory.AddPoint(P); ENDIF ENDIF ENDIF G. Marketos: Analyzing Trajectory Data 30

30 Our framework for trajectory analysis We focus our research on three issues that are critical to trajectory data warehousing: The preprocessing phase that deals with the explicit reconstruction of the trajectories, which are then stored into a MOD Alternative ETL processes that feed the trajectory data warehouse The solutions proposed on the challenging issue of measure aggregation G. Marketos: Analyzing Trajectory Data 31

31 ETL processing: loading Loading data into the dimension tables straightforward Loading data into the fact table complex Fill in the measures with the appropriate numeric values In order to calculate the measures, we have to extract the portions of the trajectories that fit into the base cells of the cube We propose two alternative solutions to this problem: cell-oriented trajectory-oriented y G. Marketos: Analyzing Trajectory Data 32 x

32 ETL processing: measures Measure COUNT_ TRAJECTORIES COUNT_USERS AVG_DISTANCE_ TRAVELED AVG_TRAVEL_ DURATION AVG_SPEED AVG_ABS_ ACCELER Formula count all distinct trajectory ids that pass through base cell (bc) count all the distinct object ids that pass through bc SUM _ DISTANCE ( bc) AVG _ DISTANCE _ TRAVELED ( bc) COUNT _ TRAJECTORIES( bc) SUM _ DISTANCEbc ( ) len( TP) SUM _ DURATION ( bc) AVG _ TRAVEL _ DURATION ( bc) COUNT _ TRAJECTORIES( bc) SUM _ DURATIONbc ( ) lifespantp ( ) SUM _ SPEED( bc) AVG _ SPEED( bc) COUNT _ TRAJECTORIES( bc) SUM _ SPEED( bc) speed fin ( TPi ) speedinit ( TPi ) SUM _ ABS _ ACCELER ( bc) G. Marketos: Analyzing Trajectory Data lifespan TP bc 33 ( TPi ) 33 TP i TP TP i bc i bc bc i len( TPi ) lifespan( TP ) SUM _ ABS _ ACCELER ( bc) AVG _ ABS _ ACCELER ( bc) COUNT _ TRAJECTORIES( bc) i i i

33 ETL processing: algorithms Cell-oriented approach (COA) Search for the portions of trajectories that they reside inside a spatiotemporal cell Perform a spatiotemporal range query that returns the portions of trajectories that satisfy the range constraints This is efficiently supported by the TB-tree Decompose the trajectory portions with respect to the user profiles they belong to Compute measures for this cell Repeat for the next cells y COUNT_TRAJECTORIES = 2 COUNT_USERS = 2 x G. Marketos: Analyzing Trajectory Data 34

34 ETL processing: algorithms Trajectory-oriented approach (TOA) Discover the spatiotemporal cells where each trajectory resides in In order to avoid checking all cells, use the trajectory MBR Identify the cells that overlap with the MBR and contain portions of the trajectory Compute measures for each cell Repeat for the next trajectories y COUNT_TRAJECTORIES = 12 COUNT_USERS = 12 x G. Marketos: Analyzing Trajectory Data 35

35 Our framework for trajectory analysis We focus our research on three issues that are critical to trajectory data warehousing: The preprocessing phase that deals with the explicit reconstruction of the trajectories, which are then stored into a MOD Alternative ETL processes that feed the trajectory data warehouse The solutions proposed on the challenging issue of measure aggregation G. Marketos: Analyzing Trajectory Data 36

36 The distinct count problem: definition During the ETL process, measures can be computed in an accurate way by executing MOD queries Once the fact table has been fed, aggregate-only information is stored inside the TDW (no trajectory / user ids) When rolling up, COUNT_USERS, COUNT_TRAJECTORIES and, hence, all other measures defined over COUNT_TRAJECTORIES are subject to the distinct count problem (Tao et al., 2004): if an object remains in the query region for several timestamps during the query interval, instead of counting this object once, it is counted multiple times in the result y G. Marketos: Analyzing Trajectory Data 37 x

37 The distinct count problem: solution (1/3) We store in the base cells (C (x,y),t,p ) a tuple of auxiliary measures that help us correct the errors due to the duplicates when rolling-up: C (x,y),t,p.traj : number of distinct trajectories of profile p intersecting the cell C (x,y),t,p.cross-x: number of distinct trajectories of profile p crossing the spatial border between C (x-1,y),t,p and C (x,y),t,p C (x,y),t,p.cross-y: number of distinct trajectories of profile p crossing the spatial border between C (x,y-1),t,p and C (x,y),t,p C (x,y),t,p.cross-t: number of distinct trajectories of profile p crossing the temporal border between C (x,y),t-1,p and C (x,y),t,p T Cell C (x,y),t,p X Y G. Marketos: Analyzing Trajectory Data 38 38

38 The distinct count problem: solution (2/3) Let C (x,y ),t,p be a cell consisting of the union of two adjacent cells (i.e. C (x,y),t.p C (x+1,y),t,p ) In order to compute the number of distinct trajectories: C (x,y ),t,p.traj = C (x,y),t,p.traj + C (x+1,y),t,p.traj C (x+1,y),t,p.cross-x application of the well-known Inclusion/Exclusion principle for sets: A B = A + B A B BUT in some cases it holds that C (x+1,y),t,p.cross-x A B Example: fast and agile trajectories G. Marketos: Analyzing Trajectory Data 39

39 The distinct count problem: solution (3/3) Compute the number of distinct trajectories: C x,y,t,p.traj = 1 C x+1,y,t,p.cross-x = 1 C x+1,y,t,p.traj = 1 C x+1,y,t,p.cross-x = 0 C x,y,t,p C x+1,y,t,p C x,y,t,p C x+1,y,t,p C x,y+1,t,p C x+1,y+1,t,p C x,y+1,t,p C x+1,y+1,t,p (a) Correct! (b) Not Correct! G. Marketos: Analyzing Trajectory Data 40

40 Conclusions (for this part) We explored solutions for the efficient and effective development of trajectory warehouses Related work does not include research work on the complete flow of processes in a TDW In particular, we discussed about techniques for: the solution of the trajectory reconstruction problem efficient ETL support for trajectory data handling measure aggregation issues, with a special attention to the distinct count problem G. Marketos: Analyzing Trajectory Data 41

41 Future work Extend the proposed framework by applying OLAP analysis and DM techniques over the aggregated data Examine new measures for TDW, specifically suited for trajectories. Two examples: A typical trajectory describing the trend of movement within a cell The average direction measure of the trajectories within a cell G. Marketos: Analyzing Trajectory Data 42

42 Traffic Mining G. Marketos: Analyzing Trajectory Data 43

43 The traffic problem and challenges Traffic in cities is a well-known problem: the majority of people living in large cities and use cars for their transportation the number of cars moving in a city increases from year to year Recently, the technology achievements allow us to collect huge amount of traffic related data: mobile phones, traffic cameras, sensors, GPS devices How can we exploit this huge volume of data so as to gain insights on the traffic problem? The focus of our work is to detect how the different road segments of the network are related to each other. G. Marketos: Analyzing Trajectory Data 44

44 Related work (Liu et al, 2006), proposed a distributed traffic stream mining system they focus on description of the distributed traffic stream system, rather on the discovery of traffic related patterns (Li et al., 2007) proposed a technique for the discovery of hot routes in a road network They introduced a density-based algorithm, called FlowScan The algorithm requires the trajectories of the objects that move within the network G. Marketos: Analyzing Trajectory Data 45

45 How can the road segments be related? - 1 Kifissias Ave. G. Marketos: Analyzing Trajectory Data 46

46 How can the road segments be related? - 2 Poseidonos Ave. Syngrou Ave. Poseidonos Ave. G. Marketos: Analyzing Trajectory Data 47

47 How can the road segments be related? - 3 Kifissias Ave. Mesogeion Ave. Vasilissis Sofias Ave. G. Marketos: Analyzing Trajectory Data 48

48 Network Traffic We consider a fixed network consisting of a set of nonoverlapping regions regions = road intersections or landmarks of interest R 1 R 2 R 4 R 3 G. Marketos: Analyzing Trajectory Data 49

49 Network Graph The network is modeled as a directed graph G=(V,E) nodes V regions edges E direct connections between regions R 1 R 2 R 3 R 4 R 5 R 6 R 7 R 8 R 9 R 10 R 11 R 12 R 13 R 14 R 15 R 16 G. Marketos: Analyzing Trajectory Data 50

50 Capturing traffic through sensors Each edge e=(v, v ) is equipped with sensor technology that captures the movement from region v to region v. Definition: The traffic series of a sensor s S during a time period [t s, t e ] consists of the number of cars passed through this sensor during this period, recorded at Δt intervals and ordered in time: TS s = {v i, t i }, t s t i t e, Δt=t i -t i-1 the transmission rate of the sensor t s t s +Δ t t s +2 Δt t e time v e v G. Marketos: Analyzing Trajectory Data 51

51 Network Traffic Traffic series of the network: TS = {TS s, s S} R 1 R 2 R 3 R 4 R 5 R 6 R 7 R 8 R 9 R 10 R 11 R 12 R 13 R 14 R 15 R 16 G. Marketos: Analyzing Trajectory Data 52

52 Similarity between edges: value distance Let e 1, e 2 be two network edges and let TS 1 ={(v 1i,t i )}, TS 2 ={(v 2i,t i )} be their corresponding traffic series, t i [t s, t e ] Definition: The value-based distance between e 1, e 2 during periods p and p is given by the absolute distance of their corresponding traffic series TS 1, TS 2 at these periods: dis value ( e p 1, e p' 2 ) dis value ( TS p 1, TS p' 2 ) G. Marketos: Analyzing Trajectory Data 53

53 Similarity between edges: shape distance Value-based distance is not appropriate for shape similarity task since it is sensible to: different baselines e.g. one series fluctuating around 100 and the other series fluctuating around 30 different scales e.g. one series having a small amplitude between 90 and 100 and the other series having a larger amplitude between 20 and 40 To deal with this problem, different techniques can be applied for comparing time series, e.g.: Dynamic Time Warping that allows us to compare time series either on the same or adjacent time periods Correlation-Coefficient measure Euclidean distance on normalized time series G. Marketos: Analyzing Trajectory Data 54

54 Traffic relationships Traffic propagation traffic from e 12 propagates to e 23 This might indicate objects that continue moving in a highway p p' Condition: dis ( TS, TS ) value e 12 e 23 value R 1 R 5 e 12 R 2 R 6 e 23 R 3 R 7 R 4 R 8 Traffic split/ spread traffic from e 12 splits into e 23 and e 26 This might indicate objects that leave a highway and follow different directions to their destination Conditions: R 1 e 12 e 26 R 2 e 23 R 3 R 4 dis shape ( TS, TS 23, 62 ) dis ( TS,( TS, TS )) value Traffic merge p e 12 p e 12 p' e e p' e 23 shape p' e 26 value R 5 R 6 R 7 R 8 traffic to e 23 merges traffic from e 12 and e 62 This might indicate objects that enter a highway from different directions Conditions: dis shape ( TS, TS 12, 62 ) dis TS, TS ), TS ) value p e 23 p' e p' e e p' e (( shape p e, value e62e R 1 e 12 R 2 23 R 3 R 4 R 5 R 6 R 7 R 8 G. Marketos: Analyzing Trajectory Data 55

55 Traffic relationships Traffic sink traffic in e 12 is sank because there is no propagation or split relationship with its outgoing edges R 1 e 12 R 2 R 3 R 4 Traffic source R 5 R 6 R 7 R 8 traffic starts (source) from e 23 because there is no propagation or merge relationship with its incoming edges R 1 R 2 R 3 e 23 e 34 R 4 R 5 R 6 R 7 R 8 G. Marketos: Analyzing Trajectory Data 56

56 Conclusions (for this part) We consider the problem of mining traffic flow in a road network monitored through sensors placed at regions of interest (crossroads, etc.). We employ edge similarity measures based on time series comparison Traffic relationships are introduced and formally defined G. Marketos: Analyzing Trajectory Data 57

57 Thank you! Questions? Acknowledgement: Research partially supported by EU under the GeoPKDD (Geographic Privacy-aware Knowledge Discovery and Delivery) G. Marketos: Analyzing Trajectory Data 58

58 References Han, J., Stefanovic, N., and Koperski, K. Selective Materialization: An Efficient Method for Spatial Data Cube Construction. Proc. PAKDD, Jensen, C.S., Kligys, A., Pedersen, T.B., Dyreson, C.E., and Timko, I. Multidimensional data modeling for locationbased services, The VLDB Journal, 13:1 21, Li, X., Han, J., Lee, J.-G. and Gonzalez, H.: Traffic density-based discovery of hot routes in road networks. in Proc. 10th International Symposium on Spatial and Temporal Databases., Liu, Y., Choudhary, A. N., Zhou, J., and Khokhar, A. A.: A scalable distributed stream mining system for highway traffic data. in Proc. 10th European Conference on Principles and Practice of Knowledge Discovery in Databases, 2006, pp Orlando, S., Orsini, R., Raffaetà, A., Roncato, A., and Silvestri, C. Spatio-Temporal Aggregations in Trajectory Data Warehouses. Proc. DaWaK, Pelekis, N., Frentzos, E., Giatrakos, N., Theodoridis, Y.: HERMES: aggregative LBS via a trajectory DB engine. SIGMOD Conference 2008: Pelekis, N., Raffaetà, A., Damiani, M.-L., Vangenot, C., Marketos, G., Frentzos, E., Ntoutsi, I., and Theodoridis, Y. Towards Trajectory Data Warehouses. Chapter in Mobility, Data Mining and Privacy: Geographic Knowledge Discovery. Springer-Verlag Pfoser, D., Jensen, C.S., and Theodoridis, Y. Novel Approaches to the Indexing of Moving Object Trajectories, Proc. VLDB, Shekhar, S.., Lu, C., Tan, X., Chawla, S., Vatsavai, R. Map Cube: a Visualization Tool for Spatial Data Warehouses, Chapter in Geographic Data Mining and Knowledge Discovery. Taylor and Francis, Tao, T., and Papadias, D. Historical Spatio-Temporal Aggregation. ACM TODS, 23(1):61-102, Tao, Y., Kollios, G., Considine, J., Li, F., and Papadias, D. Spatio-Temporal Aggregation Using Sketches. Proc. ICDE, G. Marketos: Analyzing Trajectory Data 59

59 Selected literature on Mobility Data Modeling & MOD engines Nikos Pelekis, Elias Frentzos, Nikos Giatrakos, Yannis Theodoridis: HERMES: aggregative LBS via a trajectory DB engine. SIGMOD Conference 2008: Nikos Pelekis, Yannis Theodoridis, Spyros Vosinakis, Themis Panayiotopoulos: Hermes - A Framework for Location-Based Data Management. EDBT 2006: Güting, R.H. et al. (2000) A Foundation for Representing and Querying Moving Objects. ACM Transactions on Database Systems, 25(1):1-42. Karimi, H. and X. Liu (2003) A Predictive Location Model for Location-Based Services, Proceedings of ACM- GIS. Pelekis, N. et al. (2005) An Oracle Data Cartridge for Moving Objects. UNIPI-ISL-TR , Univ. of Piraeus. Retrieved from Schlieder, C. et al. (2001) Location Modeling for Intentional Behavior in Spatial Partonomies. Proceedings of Location Modeling for Ubiquitous Computing Workshop. Sistla, P. et al. (1997) Modeling and Querying Moving Objects. Proceedings of IEEE ICDE Conference. Wolfson, O. et al. (1998) Moving Objects Databases: Issues and Solutions. Proceedings of SSDBM Conference. G. Marketos: Analyzing Trajectory Data 60

60 Selected literature on Mobility Data Warehousing Han, J., Stefanovic, N., and Koperski, K. Selective Materialization: An Efficient Method for Spatial Data Cube Construction. Proc. PAKDD, Jensen, C.S., Kligys, A., Pedersen, T.B., Dyreson, C.E., and Timko, I. Multidimensional data modeling for location-based services, The VLDB Journal, 13:1 21, Leonardi, L., Orlando, S., Raffaetà, A., Roncato, A. and Silvestri, C. Frequent Spatio-Temporal Patterns in Trajectory Data Warehouses. Proceedings of 24th Annual ACM Symposium on Applied Computing, 2009 Marketos, G., Frentzos, E., Ntoutsi, I., Pelekis, N., Raffaetà, A., and Theodoridis, Y. Building Real World Trajectory Warehouses. Proc. MobiDE 08, Vancouver, Canada Orlando, S., Orsini, R., Raffaetà, A., Roncato, A., and Silvestri, C. Spatio-Temporal Aggregations in Trajectory Data Warehouses. Proc. DaWaK, Pelekis, N., Raffaetà, A., Damiani, M.-L., Vangenot, C., Marketos, G., Frentzos, E., Ntoutsi, I., and Theodoridis, Y. Towards Trajectory Data Warehouses. Chapter in Mobility, Data Mining and Privacy: Geographic Knowledge Discovery. Springer-Verlag Pfoser, D., Jensen, C.S., and Theodoridis, Y. Novel Approaches to the Indexing of Moving Object Trajectories, Proc. VLDB, Shekhar, S., Lu, C., Tan, X., Chawla, S., Vatsavai, R. Map Cube: a Visualization Tool for Spatial Data Warehouses, Chapter in Geographic Data Mining and Knowledge Discovery. Taylor and Francis, Tao, Y., Kollios, G., Considine, J., Li, F., and Papadias, D. Spatio-Temporal Aggregation Using Sketches. Proc. ICDE, G. Marketos: Analyzing Trajectory Data 61

61 Selected literature on Mining Traffic Patterns Horvitz, E., Apacible, J., Sarin, R., and Liao, L.: Prediction, expectation, and surprise: Methods, designs, and study of a deployed traffic forecasting service, in In Twenty-First Conference on Uncertainty in Artificial Intelligence, Li, X., Han, J., Lee, J.-G. and Gonzalez, H.: Traffic density-based discovery of hot routes in road networks. in Proc. 10th International Symposium on Spatial and Temporal Databases., Liu, Y., Choudhary, A. N., Zhou, J., and Khokhar, A. A.: A scalable distributed stream mining system for highway traffic data. in Proc. 10th European Conference on Principles and Practice of Knowledge Discovery in Databases, 2006, pp Nakata T., and Takeuchi, J.: Mining traffic data from probe-car system for travel time prediction, in Proc. 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004, pp Ntoutsi, I., Mitsou, N., Marketos, G.: Traffic mining in a road-network: How does the traffic flow? IJBIDM 3(1): (2008) Shekhar, S., Lu, C.-T., Chawla, S., and Zhang, P.: Data mining and visualization of twin-cities traffic data, in Technical Report (TR ), University of Minnesota, G. Marketos: Analyzing Trajectory Data 62

Traffic mining in a road-network: How does the

Traffic mining in a road-network: How does the 82 Int. J. Business Intelligence and Data Mining, Vol. 3, No. 1, 2008 Traffic mining in a road-network: How does the traffic flow? Irene Ntoutsi Department of Informatics, University of Piraeus, Greece

More information

SPATIO-TEMPORAL QUERIES FOR MOVING OBJECTS DATA WAREHOUSING

SPATIO-TEMPORAL QUERIES FOR MOVING OBJECTS DATA WAREHOUSING SPATIO-TEMPORAL QUERIES FOR MOVING OBJECTS DATA WAREHOUSING 1 Leila Esheiba, Hoda M.O.Mokhtar 2, and Mohamed El-Sharkawi 3 1 Department of Information Systems, Faculty of Computers and Information, Cairo

More information

Trajectory Data Warehouses: Design and Implementation Issues

Trajectory Data Warehouses: Design and Implementation Issues Trajectory Data Warehouses: Design and Implementation Issues Salvatore Orlando, Renzo Orsini, Alessandra Raffaetà, Alessandro Roncato Dipartimento di Informatica - Università Ca Foscari Venezia {orlando,

More information

Mining Mobile Group Patterns: A Trajectory-Based Approach

Mining Mobile Group Patterns: A Trajectory-Based Approach Mining Mobile Group Patterns: A Trajectory-Based Approach San-Yih Hwang, Ying-Han Liu, Jeng-Kuen Chiu, and Ee-Peng Lim Department of Information Management National Sun Yat-Sen University, Kaohsiung, Taiwan

More information

Tracking System for GPS Devices and Mining of Spatial Data

Tracking System for GPS Devices and Mining of Spatial Data Tracking System for GPS Devices and Mining of Spatial Data AIDA ALISPAHIC, DZENANA DONKO Department for Computer Science and Informatics Faculty of Electrical Engineering, University of Sarajevo Zmaja

More information

St-Toolkit: A Framework for Trajectory Data Warehousing

St-Toolkit: A Framework for Trajectory Data Warehousing St-Toolkit: A Framework for Trajectory Data Warehousing Simone Campora 1, Jose Antonio Fernandes de Macedo 2, Laura Spinsanti 3 1 LBD,EPFL Lausanne, CH 2 LIA, UFC Fortaleza, BR 3 LBD,EPFL Lausanne CH)

More information

Oracle8i Spatial: Experiences with Extensible Databases

Oracle8i Spatial: Experiences with Extensible Databases Oracle8i Spatial: Experiences with Extensible Databases Siva Ravada and Jayant Sharma Spatial Products Division Oracle Corporation One Oracle Drive Nashua NH-03062 {sravada,jsharma}@us.oracle.com 1 Introduction

More information

CONTENTS. List of Contributors Preface Acknowledgments. mobility data modeling and representation

CONTENTS. List of Contributors Preface Acknowledgments. mobility data modeling and representation CONTENTS List of Contributors Preface Acknowledgments page xi xv xix part i mobility data modeling and representation 1 Trajectories and Their Representations 3 S. Spaccapietra, C. Parent, L. Spinsanti

More information

Analyzing Polls and News Headlines Using Business Intelligence Techniques

Analyzing Polls and News Headlines Using Business Intelligence Techniques Analyzing Polls and News Headlines Using Business Intelligence Techniques Eleni Fanara, Gerasimos Marketos, Nikos Pelekis and Yannis Theodoridis Department of Informatics, University of Piraeus, 80 Karaoli-Dimitriou

More information

A Survey on Spatio-Temporal Data Warehousing

A Survey on Spatio-Temporal Data Warehousing A Survey on Spatio-Temporal Data Warehousing Leticia Gómez Instituto Tecnológico de Buenos Aires Bart Kuijpers, Bart Moelans, Alejandro Vaisman University of Hasselt and Transnational University of Limburg,

More information

Introduction to Data Mining

Introduction to Data Mining Introduction to Data Mining 1 Why Data Mining? Explosive Growth of Data Data collection and data availability Automated data collection tools, Internet, smartphones, Major sources of abundant data Business:

More information

IMPLEMENTING SPATIAL DATA WAREHOUSE HIERARCHIES IN OBJECT-RELATIONAL DBMSs

IMPLEMENTING SPATIAL DATA WAREHOUSE HIERARCHIES IN OBJECT-RELATIONAL DBMSs IMPLEMENTING SPATIAL DATA WAREHOUSE HIERARCHIES IN OBJECT-RELATIONAL DBMSs Elzbieta Malinowski and Esteban Zimányi Computer & Decision Engineering Department, Université Libre de Bruxelles 50 av.f.d.roosevelt,

More information

Spatial Data Warehouse and Mining. Rajiv Gandhi

Spatial Data Warehouse and Mining. Rajiv Gandhi Spatial Data Warehouse and Mining Rajiv Gandhi Roll Number 05331002 Centre of Studies in Resource Engineering Indian Institute of Technology Bombay Powai, Mumbai -400076 India. As part of the first stage

More information

CubeView: A System for Traffic Data Visualization

CubeView: A System for Traffic Data Visualization CUBEVIEW: A SYSTEM FOR TRAFFIC DATA VISUALIZATION 1 CubeView: A System for Traffic Data Visualization S. Shekhar, C.T. Lu, R. Liu, C. Zhou Computer Science Department, University of Minnesota 200 Union

More information

A Seismic Data Management and Mining System

A Seismic Data Management and Mining System A Seismic Data Management and Mining System Sotiris Brakatsoulas and Yannis Theodoridis Computer Technology Institute, P.O. Box 1122, GR-26110 Patras, Greece http://www.cti.gr/rd3/dke Abstract. A Seismic

More information

Continuous Spatial Data Warehousing

Continuous Spatial Data Warehousing Continuous Spatial Data Warehousing Taher Omran Ahmed Faculty of Science Aljabal Algharby University Azzentan - Libya Taher.ahmed@insa-lyon.fr Abstract Decision support systems are usually based on multidimensional

More information

EasyTracker: An Android application for capturing mobility behavior

EasyTracker: An Android application for capturing mobility behavior EasyTracker: An Android application for capturing mobility behavior Alexandros Doulamis Dept. of Informatics University of Piraeus Piraeus, Greece alex.doulamis@gmail.com Nikos Pelekis Dept. of Statistics

More information

Continuous Fastest Path Planning in Road Networks by Mining Real-Time Traffic Event Information

Continuous Fastest Path Planning in Road Networks by Mining Real-Time Traffic Event Information Continuous Fastest Path Planning in Road Networks by Mining Real-Time Traffic Event Information Eric Hsueh-Chan Lu Chi-Wei Huang Vincent S. Tseng Institute of Computer Science and Information Engineering

More information

CHAPTER-24 Mining Spatial Databases

CHAPTER-24 Mining Spatial Databases CHAPTER-24 Mining Spatial Databases 24.1 Introduction 24.2 Spatial Data Cube Construction and Spatial OLAP 24.3 Spatial Association Analysis 24.4 Spatial Clustering Methods 24.5 Spatial Classification

More information

Building Data Cubes and Mining Them. Jelena Jovanovic Email: jeljov@fon.bg.ac.yu

Building Data Cubes and Mining Them. Jelena Jovanovic Email: jeljov@fon.bg.ac.yu Building Data Cubes and Mining Them Jelena Jovanovic Email: jeljov@fon.bg.ac.yu KDD Process KDD is an overall process of discovering useful knowledge from data. Data mining is a particular step in the

More information

IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH

IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH Kalinka Mihaylova Kaloyanova St. Kliment Ohridski University of Sofia, Faculty of Mathematics and Informatics Sofia 1164, Bulgaria

More information

Distributed Continuous Range Query Processing on Moving Objects

Distributed Continuous Range Query Processing on Moving Objects Distributed Continuous Range Query Processing on Moving Objects Haojun Wang, Roger Zimmermann, and Wei-Shinn Ku University of Southern California, Computer Science Department, Los Angeles, CA, USA {haojunwa,

More information

Survey On: Nearest Neighbour Search With Keywords In Spatial Databases

Survey On: Nearest Neighbour Search With Keywords In Spatial Databases Survey On: Nearest Neighbour Search With Keywords In Spatial Databases SayaliBorse 1, Prof. P. M. Chawan 2, Prof. VishwanathChikaraddi 3, Prof. Manish Jansari 4 P.G. Student, Dept. of Computer Engineering&

More information

The STC for Event Analysis: Scalability Issues

The STC for Event Analysis: Scalability Issues The STC for Event Analysis: Scalability Issues Georg Fuchs Gennady Andrienko http://geoanalytics.net Events Something [significant] happened somewhere, sometime Analysis goal and domain dependent, e.g.

More information

Indexing and Retrieval of Historical Aggregate Information about Moving Objects

Indexing and Retrieval of Historical Aggregate Information about Moving Objects Indexing and Retrieval of Historical Aggregate Information about Moving Objects Dimitris Papadias, Yufei Tao, Jun Zhang, Nikos Mamoulis, Qiongmao Shen, and Jimeng Sun Department of Computer Science Hong

More information

Speed Up Your Moving Object Using Spatio-Temporal Predictors

Speed Up Your Moving Object Using Spatio-Temporal Predictors Time-Series Prediction with Applications to Traffic and Moving Objects Databases Bo Xu Department of Computer Science University of Illinois at Chicago Chicago, IL 60607, USA boxu@cs.uic.edu Ouri Wolfson

More information

Mobility data analysis to understand unknown diseases behavior The case of facial paralysis

Mobility data analysis to understand unknown diseases behavior The case of facial paralysis Mobility data analysis to understand unknown diseases behavior The case of facial paralysis Jalel Akaichi PhD, University of Sciences and Technologies of Lille Associate Professor, University of Tunis

More information

Privacy-preserving data warehousing for spatiotemporal

Privacy-preserving data warehousing for spatiotemporal Privacy-preserving data warehousing for spatiotemporal data Maria L. Damiani, Università Milano (I) GEOPKDD - Meeting Venezia 17 Oct 05 1 Report The report contains two contributions: M.L. Damiani, S.

More information

A Platform for Supporting Data Analytics on Twitter: Challenges and Objectives 1

A Platform for Supporting Data Analytics on Twitter: Challenges and Objectives 1 A Platform for Supporting Data Analytics on Twitter: Challenges and Objectives 1 Yannis Stavrakas Vassilis Plachouras IMIS / RC ATHENA Athens, Greece {yannis, vplachouras}@imis.athena-innovation.gr Abstract.

More information

Multi-dimensional index structures Part I: motivation

Multi-dimensional index structures Part I: motivation Multi-dimensional index structures Part I: motivation 144 Motivation: Data Warehouse A definition A data warehouse is a repository of integrated enterprise data. A data warehouse is used specifically for

More information

Taxi Trajectories and Multidimensional Modeling

Taxi Trajectories and Multidimensional Modeling A CONCEPTUAL TRAJECTORY MULTIDIMENSIONAL MODEL: AN APPLICATION TO PUBLIC TRANSPORTATION UN MODELO CONCEPTUAL MULTIDIMENSIONAL PARA TRAYECTORIAS: UNA APLICACIÓN EN EL TRANSPORTE PÚBLICO FRANCISCO MORENO

More information

MOBILITY DATA MODELING AND REPRESENTATION

MOBILITY DATA MODELING AND REPRESENTATION PART I MOBILITY DATA MODELING AND REPRESENTATION 1 Trajectories and Their Representations Stefano Spaccapietra, Christine Parent, and Laura Spinsanti 1.1 Introduction For a long time, applications have

More information

Discovering Trajectory Outliers between Regions of Interest

Discovering Trajectory Outliers between Regions of Interest Discovering Trajectory Outliers between Regions of Interest Vitor Cunha Fontes 1, Lucas Andre de Alencar 1, Chiara Renso 2, Vania Bogorny 1 1 Dep. de Informática e Estatística Universidade Federal de Santa

More information

On Mining Group Patterns of Mobile Users

On Mining Group Patterns of Mobile Users On Mining Group Patterns of Mobile Users Yida Wang 1, Ee-Peng Lim 1, and San-Yih Hwang 2 1 Centre for Advanced Information Systems, School of Computer Engineering Nanyang Technological University, Singapore

More information

SeaCloudDM: Massive Heterogeneous Sensor Data Management in the Internet of Things

SeaCloudDM: Massive Heterogeneous Sensor Data Management in the Internet of Things SeaCloudDM: Massive Heterogeneous Sensor Data Management in the Internet of Things Jiajie Xu Institute of Software, Chinese Academy of Sciences (ISCAS) 2012-05-15 Outline 1. Challenges in IoT Data Management

More information

1.5.3 Project 3: Traffic Monitoring

1.5.3 Project 3: Traffic Monitoring 1.5.3 Project 3: Traffic Monitoring This project aims to provide helpful information about traffic in a given geographic area based on the history of traffic patterns, current weather, and time of the

More information

Big Data Analytics in Mobile Environments

Big Data Analytics in Mobile Environments 1 Big Data Analytics in Mobile Environments 熊 辉 教 授 罗 格 斯 - 新 泽 西 州 立 大 学 2012-10-2 Rutgers, the State University of New Jersey Why big data: historical view? Productivity versus Complexity (interrelatedness,

More information

Measuring Performance in the Retail Industry

Measuring Performance in the Retail Industry Measuring Performance in the Retail Industry (Position Paper) Gerasimos Marketos and Yannis Theodoridis Department of Informatics, University of Piraeus, 80 Karaoli-Dimitriou St., GR-18534 Piraeus, Greece

More information

Intelligent Stock Market Assistant using Temporal Data Mining

Intelligent Stock Market Assistant using Temporal Data Mining Intelligent Stock Market Assistant using Temporal Data Mining Gerasimos Marketos 1, Konstantinos Pediaditakis 2, Yannis Theodoridis 1, and Babis Theodoulidis 2 1 Database Group, Information Systems Laboratory,

More information

Abstract. Key words: space-time cube; GPS; spatial-temporal data model; spatial-temporal query; trajectory; cube cell. 1.

Abstract. Key words: space-time cube; GPS; spatial-temporal data model; spatial-temporal query; trajectory; cube cell. 1. Design and Implementation of Spatial-temporal Data Model in Vehicle Monitor System Jingnong Weng, Weijing Wang, ke Fan, Jian Huang College of Software, Beihang University No. 37, Xueyuan Road, Haidian

More information

Dynamic Modeling of Trajectory Patterns using Data Mining and Reverse Engineering

Dynamic Modeling of Trajectory Patterns using Data Mining and Reverse Engineering Dynamic Modeling of Trajectory Patterns using Data Mining and Reverse Engineering Luis Otavio Alvares 1,2 Vania Bogorny 1 Jose Antonio Fernandes de Macedo 3 Bart Moelans 1 Stefano Spaccapietra 3 1 Theoretical

More information

International Journal of Scientific & Engineering Research, Volume 5, Issue 4, April-2014 442 ISSN 2229-5518

International Journal of Scientific & Engineering Research, Volume 5, Issue 4, April-2014 442 ISSN 2229-5518 International Journal of Scientific & Engineering Research, Volume 5, Issue 4, April-2014 442 Over viewing issues of data mining with highlights of data warehousing Rushabh H. Baldaniya, Prof H.J.Baldaniya,

More information

Business Intelligence: Using Data for More Than Analytics

Business Intelligence: Using Data for More Than Analytics Business Intelligence: Using Data for More Than Analytics Session 672 Session Overview Business Intelligence: Using Data for More Than Analytics What is Business Intelligence? Business Intelligence Solution

More information

MONIC and Followups on Modeling and Monitoring Cluster Transitions

MONIC and Followups on Modeling and Monitoring Cluster Transitions MONIC and Followups on Modeling and Monitoring Cluster Transitions Myra Spiliopoulou 1, Eirini Ntoutsi 2, Yannis Theodoridis 3, and Rene Schult 4 1 Otto-von-Guericke University of Magdeburg, Germany, myra@iti.cs.uni-magdeburg.de,

More information

Requirements engineering for a user centric spatial data warehouse

Requirements engineering for a user centric spatial data warehouse Int. J. Open Problems Compt. Math., Vol. 7, No. 3, September 2014 ISSN 1998-6262; Copyright ICSRS Publication, 2014 www.i-csrs.org Requirements engineering for a user centric spatial data warehouse Vinay

More information

Concept and Applications of Data Mining. Week 1

Concept and Applications of Data Mining. Week 1 Concept and Applications of Data Mining Week 1 Topics Introduction Syllabus Data Mining Concepts Team Organization Introduction Session Your name and major The dfiiti definition of dt data mining i Your

More information

Example application (1) Telecommunication. Lecture 1: Data Mining Overview and Process. Example application (2) Health

Example application (1) Telecommunication. Lecture 1: Data Mining Overview and Process. Example application (2) Health Lecture 1: Data Mining Overview and Process What is data mining? Example applications Definitions Multi disciplinary Techniques Major challenges The data mining process History of data mining Data mining

More information

SPATIAL DATA CLASSIFICATION AND DATA MINING

SPATIAL DATA CLASSIFICATION AND DATA MINING , pp.-40-44. Available online at http://www. bioinfo. in/contents. php?id=42 SPATIAL DATA CLASSIFICATION AND DATA MINING RATHI J.B. * AND PATIL A.D. Department of Computer Science & Engineering, Jawaharlal

More information

OLAP Online Privacy Control

OLAP Online Privacy Control OLAP Online Privacy Control M. Ragul Vignesh and C. Senthil Kumar Abstract--- The major issue related to the protection of private information in online analytical processing system (OLAP), is the privacy

More information

Introduction to Data Mining

Introduction to Data Mining Introduction to Data Mining Jay Urbain Credits: Nazli Goharian & David Grossman @ IIT Outline Introduction Data Pre-processing Data Mining Algorithms Naïve Bayes Decision Tree Neural Network Association

More information

SQL Server 2012 Business Intelligence Boot Camp

SQL Server 2012 Business Intelligence Boot Camp SQL Server 2012 Business Intelligence Boot Camp Length: 5 Days Technology: Microsoft SQL Server 2012 Delivery Method: Instructor-led (classroom) About this Course Data warehousing is a solution organizations

More information

International Journal of Advance Research in Computer Science and Management Studies

International Journal of Advance Research in Computer Science and Management Studies Volume 3, Issue 11, November 2015 ISSN: 2321 7782 (Online) International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online

More information

Data Structures for Moving Objects

Data Structures for Moving Objects Data Structures for Moving Objects Pankaj K. Agarwal Department of Computer Science Duke University Geometric Data Structures S: Set of geometric objects Points, segments, polygons Ask several queries

More information

MINING CLICKSTREAM-BASED DATA CUBES

MINING CLICKSTREAM-BASED DATA CUBES MINING CLICKSTREAM-BASED DATA CUBES Ronnie Alves and Orlando Belo Departament of Informatics,School of Engineering, University of Minho Campus de Gualtar, 4710-057 Braga, Portugal Email: {alvesrco,obelo}@di.uminho.pt

More information

Dimensional Modeling for Data Warehouse

Dimensional Modeling for Data Warehouse Modeling for Data Warehouse Umashanker Sharma, Anjana Gosain GGS, Indraprastha University, Delhi Abstract Many surveys indicate that a significant percentage of DWs fail to meet business objectives or

More information

Indexing the Trajectories of Moving Objects in Networks

Indexing the Trajectories of Moving Objects in Networks Indexing the Trajectories of Moving Objects in Networks Victor Teixeira de Almeida Ralf Hartmut Güting Praktische Informatik IV Fernuniversität Hagen, D-5884 Hagen, Germany {victor.almeida, rhg}@fernuni-hagen.de

More information

Lecture Data Warehouse Systems

Lecture Data Warehouse Systems Lecture Data Warehouse Systems Eva Zangerle SS 2013 PART A: Architecture Chapter 1: Motivation and Definitions Motivation Goal: to build an operational general view on a company to support decisions in

More information

Big Data Mining Services and Knowledge Discovery Applications on Clouds

Big Data Mining Services and Knowledge Discovery Applications on Clouds Big Data Mining Services and Knowledge Discovery Applications on Clouds Domenico Talia DIMES, Università della Calabria & DtoK Lab Italy talia@dimes.unical.it Data Availability or Data Deluge? Some decades

More information

Dynamic Data in terms of Data Mining Streams

Dynamic Data in terms of Data Mining Streams International Journal of Computer Science and Software Engineering Volume 2, Number 1 (2015), pp. 1-6 International Research Publication House http://www.irphouse.com Dynamic Data in terms of Data Mining

More information

Quality Assessment in Spatial Clustering of Data Mining

Quality Assessment in Spatial Clustering of Data Mining Quality Assessment in Spatial Clustering of Data Mining Azimi, A. and M.R. Delavar Centre of Excellence in Geomatics Engineering and Disaster Management, Dept. of Surveying and Geomatics Engineering, Engineering

More information

Alejandro Vaisman Esteban Zimanyi. Data. Warehouse. Systems. Design and Implementation. ^ Springer

Alejandro Vaisman Esteban Zimanyi. Data. Warehouse. Systems. Design and Implementation. ^ Springer Alejandro Vaisman Esteban Zimanyi Data Warehouse Systems Design and Implementation ^ Springer Contents Part I Fundamental Concepts 1 Introduction 3 1.1 A Historical Overview of Data Warehousing 4 1.2 Spatial

More information

Data Mining, Predictive Analytics with Microsoft Analysis Services and Excel PowerPivot

Data Mining, Predictive Analytics with Microsoft Analysis Services and Excel PowerPivot www.etidaho.com (208) 327-0768 Data Mining, Predictive Analytics with Microsoft Analysis Services and Excel PowerPivot 3 Days About this Course This course is designed for the end users and analysts that

More information

INTEROPERABILITY IN DATA WAREHOUSES

INTEROPERABILITY IN DATA WAREHOUSES INTEROPERABILITY IN DATA WAREHOUSES Riccardo Torlone Roma Tre University http://torlone.dia.uniroma3.it/ SYNONYMS Data warehouse integration DEFINITION The term refers to the ability of combining the content

More information

Efficient Integration of Data Mining Techniques in Database Management Systems

Efficient Integration of Data Mining Techniques in Database Management Systems Efficient Integration of Data Mining Techniques in Database Management Systems Fadila Bentayeb Jérôme Darmont Cédric Udréa ERIC, University of Lyon 2 5 avenue Pierre Mendès-France 69676 Bron Cedex France

More information

Spatial Hierarchy & OLAP-Favored Search in Spatial Data Warehouse

Spatial Hierarchy & OLAP-Favored Search in Spatial Data Warehouse Spatial Hierarchy & OLAP-Favored Search in Spatial Data Warehouse Fangyan Rao IBM China Research Lab Nov 7, 23 DOLAP 23 Outline Motivation Spatial hierarchy OLAP-favored search Heuristic OLAP-favored search

More information

A Knowledge Management Framework Using Business Intelligence Solutions

A Knowledge Management Framework Using Business Intelligence Solutions www.ijcsi.org 102 A Knowledge Management Framework Using Business Intelligence Solutions Marwa Gadu 1 and Prof. Dr. Nashaat El-Khameesy 2 1 Computer and Information Systems Department, Sadat Academy For

More information

Investigating the Effects of Spatial Data Redundancy in Query Performance over Geographical Data Warehouses

Investigating the Effects of Spatial Data Redundancy in Query Performance over Geographical Data Warehouses Investigating the Effects of Spatial Data Redundancy in Query Performance over Geographical Data Warehouses Thiago Luís Lopes Siqueira Ricardo Rodrigues Ciferri Valéria Cesário Times Cristina Dutra de

More information

DIMENSION HIERARCHIES UPDATES IN DATA WAREHOUSES A User-driven Approach

DIMENSION HIERARCHIES UPDATES IN DATA WAREHOUSES A User-driven Approach DIMENSION HIERARCHIES UPDATES IN DATA WAREHOUSES A User-driven Approach Cécile Favre, Fadila Bentayeb, Omar Boussaid ERIC Laboratory, University of Lyon, 5 av. Pierre Mendès-France, 69676 Bron Cedex, France

More information

Clustering Data Streams

Clustering Data Streams Clustering Data Streams Mohamed Elasmar Prashant Thiruvengadachari Javier Salinas Martin gtg091e@mail.gatech.edu tprashant@gmail.com javisal1@gatech.edu Introduction: Data mining is the science of extracting

More information

PartJoin: An Efficient Storage and Query Execution for Data Warehouses

PartJoin: An Efficient Storage and Query Execution for Data Warehouses PartJoin: An Efficient Storage and Query Execution for Data Warehouses Ladjel Bellatreche 1, Michel Schneider 2, Mukesh Mohania 3, and Bharat Bhargava 4 1 IMERIR, Perpignan, FRANCE ladjel@imerir.com 2

More information

Spatial Data Preparation for Knowledge Discovery

Spatial Data Preparation for Knowledge Discovery Spatial Data Preparation for Knowledge Discovery Vania Bogorny 1, Paulo Martins Engel 1, Luis Otavio Alvares 1 1 Instituto de Informática Universidade Federal do Rio Grande do Sul (UFRGS) Caixa Postal

More information

Advanced Data Management Technologies

Advanced Data Management Technologies ADMT 2015/16 Unit 2 J. Gamper 1/44 Advanced Data Management Technologies Unit 2 Basic Concepts of BI and Data Warehousing J. Gamper Free University of Bozen-Bolzano Faculty of Computer Science IDSE Acknowledgements:

More information

Business Intelligence: Effective Decision Making

Business Intelligence: Effective Decision Making Business Intelligence: Effective Decision Making Bellevue College Linda Rumans IT Instructor, Business Division Bellevue College lrumans@bellevuecollege.edu Current Status What do I do??? How do I increase

More information

Chapter 3 - Data Replication and Materialized Integration

Chapter 3 - Data Replication and Materialized Integration Prof. Dr.-Ing. Stefan Deßloch AG Heterogene Informationssysteme Geb. 36, Raum 329 Tel. 0631/205 3275 dessloch@informatik.uni-kl.de Chapter 3 - Data Replication and Materialized Integration Motivation Replication:

More information

Course Outline: Course: Implementing a Data Warehouse with Microsoft SQL Server 2012 Learning Method: Instructor-led Classroom Learning

Course Outline: Course: Implementing a Data Warehouse with Microsoft SQL Server 2012 Learning Method: Instructor-led Classroom Learning Course Outline: Course: Implementing a Data with Microsoft SQL Server 2012 Learning Method: Instructor-led Classroom Learning Duration: 5.00 Day(s)/ 40 hrs Overview: This 5-day instructor-led course describes

More information

Scalable Cluster Analysis of Spatial Events

Scalable Cluster Analysis of Spatial Events International Workshop on Visual Analytics (2012) K. Matkovic and G. Santucci (Editors) Scalable Cluster Analysis of Spatial Events I. Peca 1, G. Fuchs 1, K. Vrotsou 1,2, N. Andrienko 1 & G. Andrienko

More information

Data Warehousing and Data Mining

Data Warehousing and Data Mining Data Warehousing and Data Mining Part I: Data Warehousing Gao Cong gaocong@cs.aau.dk Slides adapted from Man Lung Yiu and Torben Bach Pedersen Course Structure Business intelligence: Extract knowledge

More information

How To Understand The History Of Navigation In French Marine Science

How To Understand The History Of Navigation In French Marine Science E-navigation, from sensors to ship behaviour analysis Laurent ETIENNE, Loïc SALMON French Naval Academy Research Institute Geographic Information Systems Group laurent.etienne@ecole-navale.fr loic.salmon@ecole-navale.fr

More information

Mario Guarracino. Data warehousing

Mario Guarracino. Data warehousing Data warehousing Introduction Since the mid-nineties, it became clear that the databases for analysis and business intelligence need to be separate from operational. In this lecture we will review the

More information

Fuzzy Spatial Data Warehouse: A Multidimensional Model

Fuzzy Spatial Data Warehouse: A Multidimensional Model 4 Fuzzy Spatial Data Warehouse: A Multidimensional Model Pérez David, Somodevilla María J. and Pineda Ivo H. Facultad de Ciencias de la Computación, BUAP, Mexico 1. Introduction A data warehouse is defined

More information

Information Management course

Information Management course Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 01 : 06/10/2015 Practical informations: Teacher: Alberto Ceselli (alberto.ceselli@unimi.it)

More information

Oracle9i Data Warehouse Review. Robert F. Edwards Dulcian, Inc.

Oracle9i Data Warehouse Review. Robert F. Edwards Dulcian, Inc. Oracle9i Data Warehouse Review Robert F. Edwards Dulcian, Inc. Agenda Oracle9i Server OLAP Server Analytical SQL Data Mining ETL Warehouse Builder 3i Oracle 9i Server Overview 9i Server = Data Warehouse

More information

Mapping Linear Networks Based on Cellular Phone Tracking

Mapping Linear Networks Based on Cellular Phone Tracking Ronen RYBOWSKI, Aaron BELLER and Yerach DOYTSHER, Israel Key words: Cellular Phones, Cellular Network, Linear Networks, Mapping. ABSTRACT The paper investigates the ability of accurately mapping linear

More information

Distance Learning and Examining Systems

Distance Learning and Examining Systems Lodz University of Technology Distance Learning and Examining Systems - Theory and Applications edited by Sławomir Wiak Konrad Szumigaj HUMAN CAPITAL - THE BEST INVESTMENT The project is part-financed

More information

Chapter ML:XI. XI. Cluster Analysis

Chapter ML:XI. XI. Cluster Analysis Chapter ML:XI XI. Cluster Analysis Data Mining Overview Cluster Analysis Basics Hierarchical Cluster Analysis Iterative Cluster Analysis Density-Based Cluster Analysis Cluster Evaluation Constrained Cluster

More information

Introduction to Data Mining

Introduction to Data Mining Bioinformatics Ying Liu, Ph.D. Laboratory for Bioinformatics University of Texas at Dallas Spring 2008 Introduction to Data Mining 1 Motivation: Why data mining? What is data mining? Data Mining: On what

More information

Mining various patterns in sequential data in an SQL-like manner *

Mining various patterns in sequential data in an SQL-like manner * Mining various patterns in sequential data in an SQL-like manner * Marek Wojciechowski Poznan University of Technology, Institute of Computing Science, ul. Piotrowo 3a, 60-965 Poznan, Poland Marek.Wojciechowski@cs.put.poznan.pl

More information

Data Warehousing. Jens Teubner, TU Dortmund jens.teubner@cs.tu-dortmund.de. Winter 2014/15. Jens Teubner Data Warehousing Winter 2014/15 1

Data Warehousing. Jens Teubner, TU Dortmund jens.teubner@cs.tu-dortmund.de. Winter 2014/15. Jens Teubner Data Warehousing Winter 2014/15 1 Jens Teubner Data Warehousing Winter 2014/15 1 Data Warehousing Jens Teubner, TU Dortmund jens.teubner@cs.tu-dortmund.de Winter 2014/15 Jens Teubner Data Warehousing Winter 2014/15 152 Part VI ETL Process

More information

Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis

Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis IOSR Journal of Computer Engineering (IOSRJCE) ISSN: 2278-0661, ISBN: 2278-8727 Volume 6, Issue 5 (Nov. - Dec. 2012), PP 36-41 Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis

More information

Introduction. Introduction. Spatial Data Mining: Definition WHAT S THE DIFFERENCE?

Introduction. Introduction. Spatial Data Mining: Definition WHAT S THE DIFFERENCE? Introduction Spatial Data Mining: Progress and Challenges Survey Paper Krzysztof Koperski, Junas Adhikary, and Jiawei Han (1996) Review by Brad Danielson CMPUT 695 01/11/2007 Authors objectives: Describe

More information

Data Warehouse: Introduction

Data Warehouse: Introduction Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of base and data mining group,

More information

East Asia Network Sdn Bhd

East Asia Network Sdn Bhd Course: Analyzing, Designing, and Implementing a Data Warehouse with Microsoft SQL Server 2014 Elements of this syllabus may be change to cater to the participants background & knowledge. This course describes

More information

Gain insight, agility and advantage by analyzing change across time and space.

Gain insight, agility and advantage by analyzing change across time and space. White paper Location Intelligence Gain insight, agility and advantage by analyzing change across time and space. Spatio-temporal information analysis is a Big Data challenge. The visualization and decision

More information

CHAPTER 5: BUSINESS ANALYTICS

CHAPTER 5: BUSINESS ANALYTICS Chapter 5: Business Analytics CHAPTER 5: BUSINESS ANALYTICS Objectives The objectives are: Describe Business Analytics. Explain the terminology associated with Business Analytics. Describe the data warehouse

More information

Turkish Journal of Engineering, Science and Technology

Turkish Journal of Engineering, Science and Technology Turkish Journal of Engineering, Science and Technology 03 (2014) 106-110 Turkish Journal of Engineering, Science and Technology journal homepage: www.tujest.com Integrating Data Warehouse with OLAP Server

More information

Load Distribution in Large Scale Network Monitoring Infrastructures

Load Distribution in Large Scale Network Monitoring Infrastructures Load Distribution in Large Scale Network Monitoring Infrastructures Josep Sanjuàs-Cuxart, Pere Barlet-Ros, Gianluca Iannaccone, and Josep Solé-Pareta Universitat Politècnica de Catalunya (UPC) {jsanjuas,pbarlet,pareta}@ac.upc.edu

More information

Data Mining and Database Systems: Where is the Intersection?

Data Mining and Database Systems: Where is the Intersection? Data Mining and Database Systems: Where is the Intersection? Surajit Chaudhuri Microsoft Research Email: surajitc@microsoft.com 1 Introduction The promise of decision support systems is to exploit enterprise

More information

Constructing Semantic Interpretation of Routine and Anomalous Mobility Behaviors from Big Data

Constructing Semantic Interpretation of Routine and Anomalous Mobility Behaviors from Big Data Constructing Semantic Interpretation of Routine and Anomalous Mobility Behaviors from Big Data Georg Fuchs 1,3, Hendrik Stange 1, Dirk Hecker 1, Natalia Andrienko 1,2,3, Gennady Andrienko 1,2,3 1 Fraunhofer

More information

Why Business Intelligence

Why Business Intelligence Why Business Intelligence Ferruccio Ferrando z IT Specialist Techline Italy March 2011 page 1 di 11 1.1 The origins In the '50s economic boom, when demand and production were very high, the only concern

More information

SQL Server 2012 End-to-End Business Intelligence Workshop

SQL Server 2012 End-to-End Business Intelligence Workshop USA Operations 11921 Freedom Drive Two Fountain Square Suite 550 Reston, VA 20190 solidq.com 800.757.6543 Office 206.203.6112 Fax info@solidq.com SQL Server 2012 End-to-End Business Intelligence Workshop

More information