How To Mine Location Based Social Network Data From Social Networks
|
|
|
- Frederick Bond
- 5 years ago
- Views:
Transcription
1 Smart Cities, Urban Sensing and Big Data: Mining Geo-location in Social Networks Daniele Sacco 1, Gianmario Motta 2, Linlin You 3, Nicola Bertolazzo 4, Chen Chen 5 Università di Pavia Via Ferrata 1, 27100, Pavia (PV) 1 [email protected] 2 [email protected] 3 [email protected] 4 [email protected] 5 [email protected] Location based social networks offer spatio-temporal information which can be accessed through public Application Programming Interfaces (APIs) and drew the interest of researchers with diverse scientific backgrounds. This availability of data enables a potential use of geolocated content as an additional, low cost and infrastructureless source of information for urban sensing in Smart Cities. All these aspects bounded with the need of real-time analytics for urban sensing takes to Big Data management and its related issues. A systematic literature review outlines related works and gaps in current research. We propose a reference model to exploit Big Data and Open Data for urban sensing and we validate it by a case study. Finally, we give recommendations for future research about location and mobility mining of social network data. Keywords: big data, urban sensing, smart city, locationbased social network, twitter, data mining, open data 1. Introduction Last years have seen a paradigm shift of the role of the user on the web, from consumer to producer of information. Due to last technological developments and their world-wide dissemination, Web 2.0 changed users approach to information creation and exploitation. The paradigm shift has been also a social shift. Web has become an essential need for people, a sort of commodity, that is used to communicate, interact, share information and even maintain relationships thanks to social networks. The last advent of smartphones, equipped with GPS sensors allowing users to geo-locate themselves, can take to the next shift, from a social and collaborative Web 2.0 to a local and mobile Web 3.0. One among the first achievements has been the integration of Geographic Information Systems Congresso Nazionale AICA 2013
2 Congresso Nazionale AICA 2013 (GISs) and social networks resulting in new location-based capabilities. Social networks that include location information into their contents are called Location Based Social Networks (LBSNs). So, LBSNs offer spatio-temporal information which can be accessed through public Application Programming Interfaces (APIs) and draw the interest of researchers with diverse scientific backgrounds. This availability of data enables a potential use of geo-located content as an additional, low cost and infrastructure-less source of information for urban sensing in Smart Cities. All these aspects bounded with the need of real-time analytics for urban sensing takes to Big Data management and its related issues. Real-time urban sensing use citizens as active and passive sensors and can reveal important insights of human behavior in the city. Diverse use scenarios can enable new perspectives in society level, e.g. community healthcare, public safety, city resource management and transportation management. 2. Systematic Literature Review Systematic Literature Review (SLR) can be regarded as explicitly formulated, reproducible and up-to-date summary [Egger et al, 2008] that includes and extends the statistical results of a meta-analysis methodology. As opposed to narrative reviews, it is based on a structured method that is always explicitly specified at the beginning of the review. Our objective is to identify initiatives, experiences and viewpoints on location and mobility mining of social network data. So, our research question is How to exploit geo-located data from social networks and what level of maturity has reached its application?. So, the expected outcomes of our SLR are: (a) a complete overview of the state of the art, (b) the identification of gaps in current research, solutions, trends and future research and suggestions to the community of researchers and practitioners, and (c) recommendations about best practices for location and mobility mining of social network data. Extracted information contains techniques, issues, models and any other kind of topic useful to provide an accurate snapshot of the current state of location and mobility mining of social network data. 87 out of 109 articles have been selected and classified by year of publication, geographical area, research method and publication channel. There are no significant articles before 2009 because Location Based Social Networks (LBSNs) raised in the same year. Afterwards researchers could start to consider data provided by LBSNs. The articles considered in our research range from 2009 to second quarter of Considering that publications in 2013 refer only to the first months of the year, the number of publications tends to have an exponential growth (2 x ): 2 publications in 2009, 4 in 2010, 14 in 2011, 38 in Case studies represent the majority of the study types. The number of case studies and instrument development publications (87.3% in total) reflects the experimental approach to the issue. It is also motivated by the low number of theoretical papers (5.7%) and by the lack of position papers.
3 3. Discussion Smart Cities, Urban Sensing and Big Data: Mining Geo-location in Social Networks We have identified 3 main domains: (a) data sources, (b) technologies, (c) use scenarios. Let us consider the main contributions to each domain. 4.1 Data Sources The data sources that can be inferred for urban sensing are heterogeneous. Three innovative data sources exist: (a) mobile sensor data about the individual devices, (b) infrastructure sensor data about the context, and (c) social data from social network and other internet services [Zhang et al, 2011]. Data sources can be used independently but the combination of the three kinds can provide a comprehensive understanding of human behavior and its context. Here we focus only on data sources for geo-location from the social network tier and how they have been used in literature. Twitter is a widely-used platform for the real-time social sharing of short textbased messages called tweets. Twitter and smartphone usage reflected same growth, indeed Twitter users interact frequently on mobile devices. As Twitter is easy to use and interactions are short, many users post tweets despite they are engaged in other activities. This gives Twitter data good spatial and temporal coverage because tweets can be automatically geo-located [Mai et Hranac, 2013]. Twitter provides a free real-time streaming API through which a sample of all tweets can be retrieved. The API provides filters that can be set on these data streams to capture tweets within a geographic area or only those containing certain terms. However, data stream is limited to 1% of total tweet volume. So, only a subset of the total tweets can be used. Foursquare is a location-based social network where users can check in to different locations and share them with friends both on Foursquare itself and other social networks. Users can upload pictures at a venue or leave tips on the venue page (e.g. a user may check-in to a hotel and leave a tip about how bad the service is) [Cheng et al, 2011]. Foursquare check-in data is not directly accessible: however, users typically decide to share their check-ins publicly on Twitter, so they can be retrieved via Twitter streaming API. Several other papers use data sets published for research purposes, because no API is available or the social network stopped its service. For example, Gowalla was a location-based social network created in The concept behind the service was to advertise your exact location to all your friends in real-time [Scellato et al, 2010]. Also BrightKite was a social networking website where users could share their location, to post notes and to upload photos. By making check-ins at places, users could see people who is nearby. Now, only already collected data sets are available [Li et Chen, 2009]. Other available services are Momo and Flickr. However, a question could rise: why not to use Facebook, the most popular service? The main reason is that Facebook API can be used to retrieve data from those users who accepted to publish their posts to your application or system, so it is not publicly available.
4 Congresso Nazionale AICA Technologies We discuss here the main technologies used for urban sensing. The SLR shows that related works focus on machine learning techniques. K-means can be used to reveal clusters of common behaviors across land segments. The land use of each cluster can be derived by analyzing the activity vectors of the regions within the cluster. K-means depends on the initial random selected seeds and it needs to specify the number of clusters k (land uses) to identify [Frias-Martinex et al, 2012]. [Lee et al, 2011] used a similar approach, but they also formed a Voronoi diagram using the center points (latitude, longitude) of the K-means results and regarded the formed regions as a set of region of interests, to identify the occurrence of local events. A Self-Organizing Map (SOM) is an unsupervised neural network that reduces the input data dimensionality to be able to represent its distribution as a map. So, SOM forms a map where similar samples are mapped close together. [Frias-Martinex et al, 2012] used SOM to build a map that segments the urban land into geographical areas with different concentrations of tweets in the time period under study. Density-Based Spatial Clustering of Applications with Noise (DBSCAN) has specific characteristics: (a) it is based on the concept of density reachability, producing satisfying results identifying arbitrarily shaped clusters, (b) the number of clusters is not given a priori, and (c) the algorithm tolerates noise, allowing for some data points not to be assigned to any cluster [Villatoro et al, 2013]. The advantage of density based clustering algorithms is that clusters are defined by the density of data points and not by spatial size and form of cluster. Spectral methods for data clustering are popular because of the quality of the clusters that are produced and the simplicity of implementation. Spectral clustering is able to find arbitrarily shaped clusters and does not pose any constraints on them (in contrast to the k-means, for example, which assumes cluster to be convex). It requires anyway the parameter k to define the number of desired clusters [Roslrer et Liebig, 2013]. Mean-shift is a non-parametric clustering technique that detects the modes of an underlying probability distribution from a set of discrete samples. So, mean-shift can be used both as an algorithm to detect local maxima (modes) as well as a clustering technique (areas associated to the modes). [Frias-Martinex et al, 2012] assume that there exists an unobservable underlying probability distribution of where people tweet from. The modes of that distribution are determined to represent urban landmarks or points of interest in the city. 4.3 Use scenarios The identification of use scenarios aims to enable new perspectives in society level, e.g. community healthcare, public safety, city resource management and transportation management.. We here propose a classification for visualization objectives in urban sensing, representing an evolution of existing Business Intelligence (BI) solutions towards Geographic Information Systems (GISs) and Big Data visualization [Stodder, 2013], as shown in Tab. 1.
5 Smart Cities, Urban Sensing and Big Data: Mining Geo-location in Social Networks Class Description BI similarities Objectives Urban characterization The results can be stored for users as a snapshot of a certain point in time. Users examine snapshots to identify changes in data over time, so they must be provisioned and presented consistently so that trends and comparisons are valid It recalls dashboards. The view is static and it is previously defined by data analysts. To visualize and predict social ties and urban structure Spatial discovery Exception alerting It enables users to interact with data through analytical processes. Visual functionality for filtering, comparing, slicing and dicing, drilling down, and correlating data can then be integrated with the users analytical application functions for forecasting, modeling, and statistical, what-if, and predictive analytics. It notifies users of particularly important changes in the data or when situations arise that demand immediate attention. Alerts mean that something important in real-time data or event streams is happening. It recalls On-Line Analytical Processing (OLAP). The view is dynamic and it allows users to navigate data. It recalls event processing in modern Business Activity Monitoring (BAM) solutions, that detect and warn of problems or exceptions in realtime. Tab. 1 Visualization objectives classification To analyze behaviors online, in space and time To detect events or exceptions to standard behaviors, e.g. disasters, diseases, unexpected crowds The three visualization classes reflect papers analyzed by our SLR. Respectively, 30 papers deal with urban characterization, 18 with spatial discovery, 12 with exception alerting. The rest of papers use social networks geo-located data mainly to build recommender systems or user profiling. These papers are single user oriented, so we do not take them into account in following paragraphs because they do not give a broad view of the city Urban characterization [Rosler et Liebig, 2013] provide insights on the activity profiles in urban environments. Clusters identified by Foursquare check-ins help to describe the socio-dynamics of urban districts in different times of the day. [Ferrari et al, 2011] extend the work on activity profiles by providing also mobility patterns that occur in an urban environment and understanding of social commonalities between people. Traditional municipal organizational units such as neighborhoods are studied by Livehoods project from [Cranshaw et al, 2012], who shows that their boundaries do not always reflect the character of life in these areas. [Joseph et al, 2012] mine check-ins to identify groups of people which are of different types (e.g. tourists), communities (e.g. users tightly clustered in space) and interests, and how they use urban space.
6 Congresso Nazionale AICA 2013 By processing Twitter data, [Wakamiya et al, 2011] are able to examine the relation between regions of common crowd activity patterns and major categories of local facilities. [Wakamiya et al, 2012] extend their work and correlate psychological and geospatial proximity of urban areas by borrowing crowd s experiences from geo-tagged tweets, in order to demonstrate that people often rely on geospatial cognition to the real space than the exact physical distance in the real world Spatial discovery [Silva et al, 2012] study social behaviors by monitoring check-ins in Foursquare, [Sagl et al, 2012] by Twitter and Flickr data. They are able to analyze city dynamics spatially and temporally and to identify seasonality in human behaviors. [Cheng et al, 2011] investigate 22 million check-ins across 220,000 users and report a quantitative assessment of human mobility patterns by analyzing the spatial, temporal, social, and textual aspects associated with these footprints Exception alerting [Boettcher et Lee, 2012] introduce EventRadar, a detection system that finds local events such as release parties, musicians in a park, or art exhibitions. [Watanabe et al, 2011] provide a similar system to detect events. [Baldwin et al, 2012] also try to predict events impact. [Mai et Hranac, 2013] assert that tweets can be matched to traffic incidents by examining the content of the tweets for key words and comparing locations of the tweets and incidents. Results are confirmed for areas with sufficient density of Twitter usage. 4. A Big Data-based approach Big data is a popular term used to refer to the exponential growth, availability and use of information. However, as Gartner states ( Big Data doesn t focus only on the high volumes of information, but also on data velocity, that involves streams of data that are produced and processed in real-time, and on data variety, that refers to more types of information to analyze, e.g. social media, context aware data, documents, images, videos, audio, etc. All papers in our review obviously consider data variety because locationbased social networks data is geo-located and unstructured (typically JSON files), however the challenge results from the expansion of all three properties given by Gartner, rather than just data volume or data velocity alone. We here propose a big data-based approach to mining of social network geo-located data. Our approach intends to consider the 3 properties by applying technologies that fit Big Data management. Our literature review found out no papers take into account big data-based system architectures to process variety, velocity and volume of social network data. Here we propose a simple architecture to process this kind of data and possible extensions to support more analytics.
7 Smart Cities, Urban Sensing and Big Data: Mining Geo-location in Social Networks In order to validate our approach, we report a case study about public transport monitoring. First we developed a simulator that transforms timetables from open transport data in Torino (Italy) to a visualization tool that shows planned position of each bus in a specific time (each moving point in Fig. 1). We decided to enrich this visualization and leverage urban sensing. Our specific question was: how to correlate transport planning and urban activity areas?. Exploitation of location-based data from social network is a viable way. To define activity areas in a city we built a tool for spatial discovery that could allow us to model crowds by density clusters within the city and drill down them according to different scales. In order to keep the case study simple, our approach has been to compute density on a fixed grid applied over the map (rectangles with same dimension). However, what kind of data could be used? We exploited Twitter streaming API, collected geo-located data in Torino area in real-time and clustered them on the map for each specific time range (1 hour). Fig. 1 Public transport simulator To retrieve Twitter data we implemented FluenTD agents by Node.js on a virtual machine in Amazon Elastic BeanStalk. FluenTD ( is an open-source log collector that enables to have a logging architecture with more than an hundred types of systems, by treating logs as JSON. Node.js ( is a platform built on JavaScript for easily building fast, scalable network applications. It uses an event-driven, non-blocking I/O model that makes it lightweight, efficient, and oriented to data-intensive real-time applications that run across distributed devices. FluenTD and Node.js integration allows to build real-time collection and filtering of geo-located tweets, and their storage in TreasureData ( TreasureData is a Big Data as a Service cloud solution that offers a time series, columnar, Hadoop-based, schema-free data warehouse stored on Amazon S3. It allows you to access data using Hive query language ( by JDBC. An Extraction-Transformation-Loading (ETL) process access data in TreasureData, processes it and uploads it in CartoDB every 5 minutes. CartoDB is a database as a service cloud solution, based on a PostgreSQL database with GIS extension. As PostgreSQL databases can use
8 Congresso Nazionale AICA 2013 extensions to run data mining algorithms, we decided to apply density-based clustering online, whenever the user changes the zoom scale in the map. Fig. 2 shows the final architecture for our prototype. As this solution intends to implement Service Oriented Architecture (SOA), it has two main advantages: (a) it can be easily extended, and (b) a component can be replaced by others because of decoupling. For example, the data stored in the data mart can be used to perform different analytics or to build an alerting system. As SAP Hana has a Predictive Analysis Library that offers native support to DBSCAN, we could use SAP Hana to implement it as a supplementary layer between TreasureData and CartoDB. If we want to easily implement Latent Dirichelet Allocation we could replace TreasureData with Mahout on a Hadoop cluster. The ETL process, that may represent a bottleneck in terms of performances, can be replaced by ad hoc solutions, e.g. interfaces developed in Node.js to provide data filtering and storage in the data mart. Fig. 2 A Service Oriented Architecture for Big Data management Fig. 4 shows early results of our prototype. It considers tweets collected from 8 AM to 9 AM on July 10 th, Color density of rectangles change according to tweet density in the same area. Fig. 4 - Tweet density in Torino Final integration between the crowd modeling tool and the public transport simulator allows spatial discovery of urban areas not covered by transport service during peaks of presence of people who may need to move from one place to another within the city.
9 5.Conclusion Smart Cities, Urban Sensing and Big Data: Mining Geo-location in Social Networks Our SLR demonstrated the exponential growth in number of papers about this trending topic. The high number of case studies and instrument development publications reflects the experimental approach to the issue and it may also lead developers and freelancers to provide smart solutions to contextual problems in the city. Most of related works focus on validation of data mining techniques, rather than their application in real use scenarios. Our prototype demonstrated that the integration of urban sensing and open data can help municipalities to reveal enhanced insights about their services to citizens. So, future research should move from validation of techniques to their real application in Smart Cities by exploiting Big Data technologies, thus providing real time analytics to municipalities and end users. Our future works intend to integrate different data sources, such as sensors, and more social media, in order to provide deeper insights for urban sensing. Furthermore, our system needs to scale up and validate performances against user needs. Next step will be the development of a service orchestration layer to provide complete validation of our Service Oriented Architecture. References [Baldwin et al, 2012] Baldwin, T., Cook, P., Han, B., Harwood, A., Karunasekera, S., & Moshtaghi, M. (2012, April). A support platform for event detection using social intelligence. 13th Conference of the European Chapter of the Association for Computational Linguistics (pp ). [Boettcher et Lee, 2012] Boettcher, A., & Lee, D. (2012, November). EventRadar: A Real-Time Local Event Detection Scheme Using Twitter Stream. In Green Computing and Communications (GreenCom), IEEE International Conference on (pp ). [Chen et al, 2013] Chen, T., Kaafar, M. A., & Boreli, R. (2013). The Where and When of Finding New Friends: Analysis of a Location-based Social Discovery Network. International Conference On Weblogs And Social Media [Cheng et al, 2011] Cheng, Z., Caverlee, J., Lee, K., & Sui, D. Z. (2011). Exploring Millions of Footprints in Location Sharing Services. ICWSM, 2011, [Cranshaw et al, 2012] Cranshaw, J., Schwartz, R., Hong, J. I., & Sadeh, N. M. (2012, June). The Livehoods Project: Utilizing Social Media to Understand the Dynamics of a City. In ICWSM. [Egger et al, 2008] Egger, M., Smith, G. D., & Altman, D. (Eds.). (2008). Systematic reviews in health care: meta-analysis in context. Wiley. [Ferrari et al, 2011] Ferrari, L., Rosi, A., Mamei, M., & Zambonelli, F. (2011, November). Extracting urban patterns from location-based social networks. 3rd ACM SIGSPATIAL International Workshop on Location-Based Social Networks (pp. 9-16) [Frias-Martinez et al, 2012] Frias-Martinez, V., Soto, V., Hohwald, H., & Frias- Martinez, E. (2012, September). Characterizing Urban Landscapes using Geolocated Tweets IEEE International Conference on Social Computing (pp ).
10 Congresso Nazionale AICA 2013 [Joseph et al, 2012] Joseph, K., Tan, C. H., & Carley, K. M. (2012, September). Beyond local, categories and friends: clustering foursquare users with latent topics. In Proceedings of the 2012 ACM Conference on Ubiquitous Computing (pp ) [Lee, 2012] Lee, C. H. (2012). Mining spatio-temporal information on microblogging streams using a density-based online clustering method. Expert Systems with Applications, 39(10), [Lee et al, 2011] Lee, R., Wakamiya, S., & Sumiya, K. (2011). Discovery of unusual regional social activities using geo-tagged microblogs. World Wide Web, 14(4), [Li et Chen, 2009] Li, N., & Chen, G. (2009, August). Analysis of a location-based social network. IEEE International Conference on CSE'09 (Vol. 4, pp ).. [Mai et Hranac, 2013] Mai, E., & Hranac, R. (2013, January). Twitter Interactions as a Data Source for Transportation Incidents. In Transportation Research Board 92nd Annual Meeting (No ). [Rosler et Liebig, 2013] Rösler, R., & Liebig, T. (2013). Using Data from Location Based Social Networks for Urban Activity Clustering. In Geographic Information Science at the Heart of Europe (pp ). Springer International Publishing. [Sagl et al, 2012] Sagl, G., Resch, B., Hawelka, B., & Beinat, E. (2012). From Social Sensor Data to Collective Human Behaviour Patterns: Analysing Spatio-Temporal Dynamics in Urban Environments. GI-Forum: Geovisualization, Society and Learning. [Scellato et al, 2010] Scellato, S., Mascolo, C., Musolesi, M., & Latora, V. (2010, June). Distance matters: geo-social metrics for online social networks. In Proceedings of the 3rd conference on Online social networks (pp. 8-8). USENIX Association. [Silva et al, 2012] Silva, T. H., Melo, P. O., Almeida, J. M., Salles, J., & Loureiro, A. A. (2012, November). Visualizing the invisible image of cities. In Green Computing and Communications (GreenCom), 2012 IEEE International Conference on (pp ) [Stodder, 2013] Stodder, D. (2013). Data Visualization and Discovery for Better Business Decisions. TDWI Best Practices Report. TDWI Research [Villatoro et al, 2013] Villatoro, D., Serna, J., Rodríguez, V., & Torrent-Moreno, M. (2013). The TweetBeat of the City: Microblogging for Discovering Behavioural Patterns during the MWC2012. Citizen in Sensor Networks (pp ). Springer Berlin [Wakamiya et al, 2011] Wakamiya, S., Lee, R., & Sumiya, K. (2011, November). Crowd-based urban characterization: extracting crowd behavioral patterns in urban areas from twitter. ACM SIGSPATIAL Workshop on LBSNs (pp ) [Wakamiya et al, 2012] Wakamiya, S., Lee, R., & Sumiya, K. Measuring Crowdsourced Cognitive Distance between Urban Clusters with Twitter for Socio-cognitive Map Generation. International Conference On Emerging Databases [Watanabe et al, 2011] Watanabe, K., Ochi, M., Okabe, M., & Onai, R. (2011, October). Jasmine: a real-time local-event detection system based on geolocation information of microblogs. ACM conference on Information and knowledge management [Zhang et al, 2011] Zhang, D., Guo, B., & Yu, Z. (2011). The emergence of social and community intelligence. Computer, 44(7),
PhoCA: An extensible service-oriented tool for Photo Clustering Analysis
paper:5 PhoCA: An extensible service-oriented tool for Photo Clustering Analysis Yuri A. Lacerda 1,2, Johny M. da Silva 2, Leandro B. Marinho 1, Cláudio de S. Baptista 1 1 Laboratório de Sistemas de Informação
Tracking System for GPS Devices and Mining of Spatial Data
Tracking System for GPS Devices and Mining of Spatial Data AIDA ALISPAHIC, DZENANA DONKO Department for Computer Science and Informatics Faculty of Electrical Engineering, University of Sarajevo Zmaja
5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014
5 Keys to Unlocking the Big Data Analytics Puzzle Anurag Tandon Director, Product Marketing March 26, 2014 1 A Little About Us A global footprint. A proven innovator. A leader in enterprise analytics for
SAP Solution Brief SAP HANA. Transform Your Future with Better Business Insight Using Predictive Analytics
SAP Brief SAP HANA Objectives Transform Your Future with Better Business Insight Using Predictive Analytics Dealing with the new reality Dealing with the new reality Organizations like yours can identify
Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum
Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum Siva Ravada Senior Director of Development Oracle Spatial and MapViewer 2 Evolving Technology Platforms
Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012. Viswa Sharma Solutions Architect Tata Consultancy Services
Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012 Viswa Sharma Solutions Architect Tata Consultancy Services 1 Agenda What is Hadoop Why Hadoop? The Net Generation is here Sizing the
Demonstration of SAP Predictive Analysis 1.0, consumption from SAP BI clients and best practices
September 10-13, 2012 Orlando, Florida Demonstration of SAP Predictive Analysis 1.0, consumption from SAP BI clients and best practices Vishwanath Belur, Product Manager, SAP Predictive Analysis Learning
BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES
BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES Relational vs. Non-Relational Architecture Relational Non-Relational Rational Predictable Traditional Agile Flexible Modern 2 Agenda Big Data
Transforming the Telecoms Business using Big Data and Analytics
Transforming the Telecoms Business using Big Data and Analytics Event: ICT Forum for HR Professionals Venue: Meikles Hotel, Harare, Zimbabwe Date: 19 th 21 st August 2015 AFRALTI 1 Objectives Describe
Recommendations in Mobile Environments. Professor Hui Xiong Rutgers Business School Rutgers University. Rutgers, the State University of New Jersey
1 Recommendations in Mobile Environments Professor Hui Xiong Rutgers Business School Rutgers University ADMA-2014 Rutgers, the State University of New Jersey Big Data 3 Big Data Application Requirements
www.pwc.com/oracle Next presentation starting soon Business Analytics using Big Data to gain competitive advantage
www.pwc.com/oracle Next presentation starting soon Business Analytics using Big Data to gain competitive advantage If every image made and every word written from the earliest stirring of civilization
Big Data Analytics in Mobile Environments
1 Big Data Analytics in Mobile Environments 熊 辉 教 授 罗 格 斯 - 新 泽 西 州 立 大 学 2012-10-2 Rutgers, the State University of New Jersey Why big data: historical view? Productivity versus Complexity (interrelatedness,
Where is... How do I get to...
Big Data, Fast Data, Spatial Data Making Sense of Location Data in a Smart City Hans Viehmann Product Manager EMEA ORACLE Corporation August 19, 2015 Copyright 2014, Oracle and/or its affiliates. All rights
The 4 Pillars of Technosoft s Big Data Practice
beyond possible Big Use End-user applications Big Analytics Visualisation tools Big Analytical tools Big management systems The 4 Pillars of Technosoft s Big Practice Overview Businesses have long managed
Spatio-Temporal Patterns of Passengers Interests at London Tube Stations
Spatio-Temporal Patterns of Passengers Interests at London Tube Stations Juntao Lai *1, Tao Cheng 1, Guy Lansley 2 1 SpaceTimeLab for Big Data Analytics, Department of Civil, Environmental &Geomatic Engineering,
QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM
QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM QlikView Technical Case Study Series Big Data June 2012 qlikview.com Introduction This QlikView technical case study focuses on the QlikView deployment
Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.
Big Data Analytics 1 Priority Discussion Topics What are the most compelling business drivers behind big data analytics? Do you have or expect to have data scientists on your staff, and what will be their
Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database
Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica
PROGRAM DIRECTOR: Arthur O Connor Email Contact: URL : THE PROGRAM Careers in Data Analytics Admissions Criteria CURRICULUM Program Requirements
Data Analytics (MS) PROGRAM DIRECTOR: Arthur O Connor CUNY School of Professional Studies 101 West 31 st Street, 7 th Floor New York, NY 10001 Email Contact: Arthur O Connor, [email protected] URL:
Data Mining + Business Intelligence. Integration, Design and Implementation
Data Mining + Business Intelligence Integration, Design and Implementation ABOUT ME Vijay Kotu Data, Business, Technology, Statistics BUSINESS INTELLIGENCE - Result Making data accessible Wider distribution
Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.
Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Analytics
ANALYTICS CENTER LEARNING PROGRAM
Overview of Curriculum ANALYTICS CENTER LEARNING PROGRAM The following courses are offered by Analytics Center as part of its learning program: Course Duration Prerequisites 1- Math and Theory 101 - Fundamentals
Chapter 6 - Enhancing Business Intelligence Using Information Systems
Chapter 6 - Enhancing Business Intelligence Using Information Systems Managers need high-quality and timely information to support decision making Copyright 2014 Pearson Education, Inc. 1 Chapter 6 Learning
Microsoft Big Data Solutions. Anar Taghiyev P-TSP E-mail: [email protected];
Microsoft Big Data Solutions Anar Taghiyev P-TSP E-mail: [email protected]; Why/What is Big Data and Why Microsoft? Options of storage and big data processing in Microsoft Azure. Real Impact of Big
Analyzing Big Data with AWS
Analyzing Big Data with AWS Peter Sirota, General Manager, Amazon Elastic MapReduce @petersirota What is Big Data? Computer generated data Application server logs (web sites, games) Sensor data (weather,
A Knowledge Management Framework Using Business Intelligence Solutions
www.ijcsi.org 102 A Knowledge Management Framework Using Business Intelligence Solutions Marwa Gadu 1 and Prof. Dr. Nashaat El-Khameesy 2 1 Computer and Information Systems Department, Sadat Academy For
EO Data by using SAP HANA Spatial Hinnerk Gildhoff, Head of HANA Spatial, SAP Satellite Masters Conference 21 th October 2015 Public
Leveraging Geospatial Technologies EO Data by using SAP HANA Spatial Hinnerk Gildhoff, Head of HANA Spatial, SAP Satellite Masters Conference 21 th October 2015 Public Disclaimer This presentation outlines
Big Data and Analytics: Challenges and Opportunities
Big Data and Analytics: Challenges and Opportunities Dr. Amin Beheshti Lecturer and Senior Research Associate University of New South Wales, Australia (Service Oriented Computing Group, CSE) Talk: Sharif
Vendor briefing Business Intelligence and Analytics Platforms Gartner 15 capabilities
Vendor briefing Business Intelligence and Analytics Platforms Gartner 15 capabilities April, 2013 gaddsoftware.com Table of content 1. Introduction... 3 2. Vendor briefings questions and answers... 3 2.1.
How To Make Sense Of Data With Altilia
HOW TO MAKE SENSE OF BIG DATA TO BETTER DRIVE BUSINESS PROCESSES, IMPROVE DECISION-MAKING, AND SUCCESSFULLY COMPETE IN TODAY S MARKETS. ALTILIA turns Big Data into Smart Data and enables businesses to
Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap
Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap 3 key strategic advantages, and a realistic roadmap for what you really need, and when 2012, Cognizant Topics to be discussed
Are You Ready for Big Data?
Are You Ready for Big Data? Jim Gallo National Director, Business Analytics February 11, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?
Rapid Visualization with Big Data Analytics. Ravi Chalaka VP, Solution and Social Innovation Marketing
Rapid Visualization with Big Data Analytics Ravi Chalaka VP, Solution and Social Innovation Marketing Imagine the Future Innovative cities that dramatically enhance the wellbeing of its citizens Safer
An Analysis on Density Based Clustering of Multi Dimensional Spatial Data
An Analysis on Density Based Clustering of Multi Dimensional Spatial Data K. Mumtaz 1 Assistant Professor, Department of MCA Vivekanandha Institute of Information and Management Studies, Tiruchengode,
Leveraging Big Data Technologies to Support Research in Unstructured Data Analytics
Leveraging Big Data Technologies to Support Research in Unstructured Data Analytics BY FRANÇOYS LABONTÉ GENERAL MANAGER JUNE 16, 2015 Principal partenaire financier WWW.CRIM.CA ABOUT CRIM Applied research
Information Management course
Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 01 : 06/10/2015 Practical informations: Teacher: Alberto Ceselli ([email protected])
This Symposium brought to you by www.ttcus.com
This Symposium brought to you by www.ttcus.com Linkedin/Group: Technology Training Corporation @Techtrain Technology Training Corporation www.ttcus.com Big Data Analytics as a Service (BDAaaS) Big Data
Smart Cities Solution Overview Innovation Center Network, Research & Innovation. SAP SE Reiner Bildmayer
Smart Cities Solution Overview Innovation Center Network, Research & Innovation SAP SE Reiner Bildmayer Why Cities need to be Run Better Challenges and Opportunities ~50% of the world s population currently
How To Understand Business Intelligence
An Introduction to Advanced PREDICTIVE ANALYTICS BUSINESS INTELLIGENCE DATA MINING ADVANCED ANALYTICS An Introduction to Advanced. Where Business Intelligence Systems End... and Predictive Tools Begin
Machine Data Analytics with Sumo Logic
Machine Data Analytics with Sumo Logic A Sumo Logic White Paper Introduction Today, organizations generate more data in ten minutes than they did during the entire year in 2003. This exponential growth
MLg. Big Data and Its Implication to Research Methodologies and Funding. Cornelia Caragea TARDIS 2014. November 7, 2014. Machine Learning Group
Big Data and Its Implication to Research Methodologies and Funding Cornelia Caragea TARDIS 2014 November 7, 2014 UNT Computer Science and Engineering Data Everywhere Lots of data is being collected and
Big Data Are You Ready? Jorge Plascencia Solution Architect Manager
Big Data Are You Ready? Jorge Plascencia Solution Architect Manager Big Data: The Datafication Of Everything Thoughts Devices Processes Thoughts Things Processes Run the Business Organize data to do something
3/17/2009. Knowledge Management BIKM eclassifier Integrated BIKM Tools
Paper by W. F. Cody J. T. Kreulen V. Krishna W. S. Spangler Presentation by Dylan Chi Discussion by Debojit Dhar THE INTEGRATION OF BUSINESS INTELLIGENCE AND KNOWLEDGE MANAGEMENT BUSINESS INTELLIGENCE
Exploring Big Data in Social Networks
Exploring Big Data in Social Networks [email protected] ([email protected]) INWEB National Science and Technology Institute for Web Federal University of Minas Gerais - UFMG May 2013 Some thoughts about
Big Data Challenges and Success Factors. Deloitte Analytics Your data, inside out
Big Data Challenges and Success Factors Deloitte Analytics Your data, inside out Big Data refers to the set of problems and subsequent technologies developed to solve them that are hard or expensive to
Crime Hotspots Analysis in South Korea: A User-Oriented Approach
, pp.81-85 http://dx.doi.org/10.14257/astl.2014.52.14 Crime Hotspots Analysis in South Korea: A User-Oriented Approach Aziz Nasridinov 1 and Young-Ho Park 2 * 1 School of Computer Engineering, Dongguk
Big Data Efficiencies That Will Transform Media Company Businesses
Big Data Efficiencies That Will Transform Media Company Businesses TV, digital and print media companies are getting ever-smarter about how to serve the diverse needs of viewers who consume content across
Text Mining Approach for Big Data Analysis Using Clustering and Classification Methodologies
Text Mining Approach for Big Data Analysis Using Clustering and Classification Methodologies Somesh S Chavadi 1, Dr. Asha T 2 1 PG Student, 2 Professor, Department of Computer Science and Engineering,
Are You Ready for Big Data?
Are You Ready for Big Data? Jim Gallo National Director, Business Analytics April 10, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?
COMP9321 Web Application Engineering
COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 11 (Part II) http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2411
The STC for Event Analysis: Scalability Issues
The STC for Event Analysis: Scalability Issues Georg Fuchs Gennady Andrienko http://geoanalytics.net Events Something [significant] happened somewhere, sometime Analysis goal and domain dependent, e.g.
RESEARCH ON THE FRAMEWORK OF SPATIO-TEMPORAL DATA WAREHOUSE
RESEARCH ON THE FRAMEWORK OF SPATIO-TEMPORAL DATA WAREHOUSE WANG Jizhou, LI Chengming Institute of GIS, Chinese Academy of Surveying and Mapping No.16, Road Beitaiping, District Haidian, Beijing, P.R.China,
NetView 360 Product Description
NetView 360 Product Description Heterogeneous network (HetNet) planning is a specialized process that should not be thought of as adaptation of the traditional macro cell planning process. The new approach
Big Data at Cloud Scale
Big Data at Cloud Scale Pushing the limits of flexible & powerful analytics Copyright 2015 Pentaho Corporation. Redistribution permitted. All trademarks are the property of their respective owners. For
International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: 2454-2377 Vol. 1, Issue 6, October 2015. Big Data and Hadoop
ISSN: 2454-2377, October 2015 Big Data and Hadoop Simmi Bagga 1 Satinder Kaur 2 1 Assistant Professor, Sant Hira Dass Kanya MahaVidyalaya, Kala Sanghian, Distt Kpt. INDIA E-mail: [email protected]
Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture
Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture Apps and data source extensions with APIs Future white label, embed or integrate Power BI Deploy Intelligent
Interactive Information Visualization of Trend Information
Interactive Information Visualization of Trend Information Yasufumi Takama Takashi Yamada Tokyo Metropolitan University 6-6 Asahigaoka, Hino, Tokyo 191-0065, Japan [email protected] Abstract This paper
VIEWPOINT. High Performance Analytics. Industry Context and Trends
VIEWPOINT High Performance Analytics Industry Context and Trends In the digital age of social media and connected devices, enterprises have a plethora of data that they can mine, to discover hidden correlations
Location-Based Social Networks: Users
Chapter 8 Location-Based Social Networks: Users Yu Zheng Abstract In this chapter, we introduce and define the meaning of location-based social network (LBSN) and discuss the research philosophy behind
How To Handle Big Data With A Data Scientist
III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution
White Paper. How Streaming Data Analytics Enables Real-Time Decisions
White Paper How Streaming Data Analytics Enables Real-Time Decisions Contents Introduction... 1 What Is Streaming Analytics?... 1 How Does SAS Event Stream Processing Work?... 2 Overview...2 Event Stream
BIG DATA TRENDS AND TECHNOLOGIES
BIG DATA TRENDS AND TECHNOLOGIES THE WORLD OF DATA IS CHANGING Cloud WHAT IS BIG DATA? Big data are datasets that grow so large that they become awkward to work with using onhand database management tools.
Data Warehousing and Data Mining in Business Applications
133 Data Warehousing and Data Mining in Business Applications Eesha Goel CSE Deptt. GZS-PTU Campus, Bathinda. Abstract Information technology is now required in all aspect of our lives that helps in business
Introducing Oracle Exalytics In-Memory Machine
Introducing Oracle Exalytics In-Memory Machine Jon Ainsworth Director of Business Development Oracle EMEA Business Analytics 1 Copyright 2011, Oracle and/or its affiliates. All rights Agenda Topics Oracle
Sensors talk and humans sense Part II
Sensors talk and humans sense Part II Athena Vakali Palic, 6 th September 2013 OSWINDS group Department of Informatics Aristotle University of Thessaloniki http://oswinds.csd.auth.gr SEN2SOC Architecture
Customized Efficient Collection of Big Data for Advertising Services
, pp.36-41 http://dx.doi.org/10.14257/astl.2015.94.09 Customized Efficient Collection of Big Data for Advertising Services Jun-Soo Yun 1, Jin-Tae Park 1, Hyun-Seo Hwang 1, Il-Young Moon 1 1 1600 Chungjeol-ro,
Big Data and Analytics: Getting Started with ArcGIS. Mike Park Erik Hoel
Big Data and Analytics: Getting Started with ArcGIS Mike Park Erik Hoel Agenda Overview of big data Distributed computation User experience Data management Big data What is it? Big Data is a loosely defined
International Journal of Advancements in Research & Technology, Volume 3, Issue 5, May-2014 18 ISSN 2278-7763. BIG DATA: A New Technology
International Journal of Advancements in Research & Technology, Volume 3, Issue 5, May-2014 18 BIG DATA: A New Technology Farah DeebaHasan Student, M.Tech.(IT) Anshul Kumar Sharma Student, M.Tech.(IT)
Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Oman College of Management and Technology Course 803401 DSS Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization CS/MIS Department Information Sharing
Big Data Mining Services and Knowledge Discovery Applications on Clouds
Big Data Mining Services and Knowledge Discovery Applications on Clouds Domenico Talia DIMES, Università della Calabria & DtoK Lab Italy [email protected] Data Availability or Data Deluge? Some decades
The Lab and The Factory
The Lab and The Factory Architecting for Big Data Management April Reeve DAMA Wisconsin March 11 2014 1 A good speech should be like a woman's skirt: long enough to cover the subject and short enough to
Using Social Media Data to Assess Spatial Crime Hotspots
Using Social Media Data to Assess Spatial Crime Hotspots 1 Introduction Nick Malleson 1 and Martin Andreson 2 1 School of Geography, University of Leeds 2 School of Criminology, Simon Fraser University,
Adobe Insight, powered by Omniture
Adobe Insight, powered by Omniture Accelerating government intelligence to the speed of thought 1 Challenges that analysts face 2 Analysis tools and functionality 3 Adobe Insight 4 Summary Never before
PRACTICAL DATA MINING IN A LARGE UTILITY COMPANY
QÜESTIIÓ, vol. 25, 3, p. 509-520, 2001 PRACTICAL DATA MINING IN A LARGE UTILITY COMPANY GEORGES HÉBRAIL We present in this paper the main applications of data mining techniques at Electricité de France,
How To Use Data Mining For Knowledge Management In Technology Enhanced Learning
Proceedings of the 6th WSEAS International Conference on Applications of Electrical Engineering, Istanbul, Turkey, May 27-29, 2007 115 Data Mining for Knowledge Management in Technology Enhanced Learning
BEYOND BI: Big Data Analytic Use Cases
BEYOND BI: Big Data Analytic Use Cases Big Data Analytics Use Cases This white paper discusses the types and characteristics of big data analytics use cases, how they differ from traditional business intelligence
Spatio-Temporal Networks:
Spatio-Temporal Networks: Analyzing Change Across Time and Place WHITE PAPER By: Jeremy Peters, Principal Consultant, Digital Commerce Professional Services, Pitney Bowes ABSTRACT ORGANIZATIONS ARE GENERATING
INTELLIGENT BUSINESS STRATEGIES WHITE PAPER
INTELLIGENT BUSINESS STRATEGIES WHITE PAPER Improving Access to Data for Successful Business Intelligence Part 2: Supporting Multiple Analytical Workloads in a Changing Analytical Landscape By Mike Ferguson
Web Archiving and Scholarly Use of Web Archives
Web Archiving and Scholarly Use of Web Archives Helen Hockx-Yu Head of Web Archiving British Library 15 April 2013 Overview 1. Introduction 2. Access and usage: UK Web Archive 3. Scholarly feedback on
Microsoft Big Data. Solution Brief
Microsoft Big Data Solution Brief Contents Introduction... 2 The Microsoft Big Data Solution... 3 Key Benefits... 3 Immersive Insight, Wherever You Are... 3 Connecting with the World s Data... 3 Any Data,
Data Analysis on Location-Based Social Networks
Data Analysis on Location-Based Social Networks Huiji Gao and Huan Liu Abstract The rapid growth of location-based social networks (LBSNs) has greatly enriched people s urban experience through social
Software Engineering for Big Data. CS846 Paulo Alencar David R. Cheriton School of Computer Science University of Waterloo
Software Engineering for Big Data CS846 Paulo Alencar David R. Cheriton School of Computer Science University of Waterloo Big Data Big data technologies describe a new generation of technologies that aim
Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers
60 Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative
locuz.com Big Data Services
locuz.com Big Data Services Big Data At Locuz, we help the enterprise move from being a data-limited to a data-driven one, thereby enabling smarter, faster decisions that result in better business outcome.
Advanced Analytics & Reporting. Enterprise Cloud Advanced Analytics & Reporting Solution
& Reporting Enterprise Cloud & Reporting Solution & Reporting Rivo transforms your data and provides you with powerful insights into current events, retrospectives on what has happened and predictions
PERSONALIZED WEB MAP CUSTOMIZED SERVICE
CO-436 PERSONALIZED WEB MAP CUSTOMIZED SERVICE CHEN Y.(1), WU Z.(1), YE H.(2) (1) Zhengzhou Institute of Surveying and Mapping, ZHENGZHOU, CHINA ; (2) North China Institute of Water Conservancy and Hydroelectric
OLAP Theory-English version
OLAP Theory-English version On-Line Analytical processing (Business Intelligence) [Ing.J.Skorkovský,CSc.] Department of corporate economy Agenda The Market Why OLAP (On-Line-Analytic-Processing Introduction
SPATIAL DATA CLASSIFICATION AND DATA MINING
, pp.-40-44. Available online at http://www. bioinfo. in/contents. php?id=42 SPATIAL DATA CLASSIFICATION AND DATA MINING RATHI J.B. * AND PATIL A.D. Department of Computer Science & Engineering, Jawaharlal
Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies
Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies Big Data: Global Digital Data Growth Growing leaps and bounds by 40+% Year over Year! 2009 =.8 Zetabytes =.08
Big Data Executive Survey
Big Data Executive Full Questionnaire Big Date Executive Full Questionnaire Appendix B Questionnaire Welcome The survey has been designed to provide a benchmark for enterprises seeking to understand the
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
