Tutorial, IEEE SERVICE 2014 Anchorage, Alaska
|
|
- Lesley Paul
- 8 years ago
- Views:
Transcription
1 Tutorial, IEEE SERVICE 2014 Anchorage, Alaska Big Data Science: Fundamental, Techniques, and Challenges (Data Mining on Big Data) By Neil Y. Yen Presented by Incheon Paik University of Aizu Japan Data Mining on Big Data 1
2 Background Data everywhere Growth of communication channels s (10-20 times / day) messenger (3-4 hours / day) social media (80% of a day) and coming Power of social media an integrated portal to interact with Changes on human behaviors information-sharing, experience crowdsourcing, and knowledge cultivation Note. Statistics summarized from IDC at 2
3 Background Data everywhere Wikipedia ( 30 million pages ; 80 million edits ) YouTube ( 100 million videos ; 150 million accesses ) Blogosphere ( 250 million blogs ; 500 million views ) number of posts are decreasing Twitter ( 30 billion tweets ; 5 million shares ) Facebook ( 900 million objects ; 250 million uploads ) Yahoo Answer / 知 恵 袋 ( 1.7 billion questions ; 900 million answers ) Flickr ( 5 billion photos) * 22% in 3.2 billions of Internet Users Note. Statistics summarized from IDC at Last update on January 01,
4 Background Data everywhere 4
5 Background Data everywhere in general case Platform, Person-oriented, Comprehensive Entry, Industry-oriented, Focus Note. Statistics summarized from IDC at 5
6 More Data More Information ( a foreseeable growth of the Internet and media ) More Data More Complexity ( scalability, integrity, consistency of data ) More Data More Heterogeneity ( different methods for data processing ) 6
7 Part I: Data Mining on Social network Introduction to social network Emerging models for social network Mining the social network(s) 7
8 Introduction to social network Primary participants Individuals (or objects) Node(s) Connections (or correlations) Link(s) Social networks can be interpreted as phenomenon derived by individuals with diverse interactions among them. The Small World: Six degrees of separation by S. Milgram (1967) 8
9 Introduction to social network Primary participants Individuals (or objects) Node(s) Connections (or correlations) Link(s) From another point of view, the Earth is an electronic nervous system, implementing by a conceptual network with nodes and links: nodes such as laptops, smartphones, satellites, etc. links such as cable lines, signals, etc. that make the node connected Communication networks: Many non-identical components with diverse connections between them 9
10 Introduction to social network Consider many kinds of networks: social, technological, business, economic, content, These networks tend to share certain informal properties: large scale, continual growth distributed, democratic growth: vertices decide who to link to mixture of local and long-distance connections abstract notions of distance: geographical, content, social, Main concerns Do natural networks share quantitative universals? What would these universals be? How can they be well modeled, analyzed, and explained? All the phenomenon follows the theories of social network, and can always be easily explained through link analysis 10
11 Introduction to social network Connected participants: how many, and how large Network diameter: maximum (worst-case) or average exclude infinite distances? (disconnected components) the small-world phenomenon Clustering: to what extent that links tend to cluster locally what is the balance between local and long-distance connections what roles do the two types of links play Degree distribution: what is the typical degree in the network what is the overall distribution 11
12 Introduction to social network Probabilistic and/or statistical models towards well management the generation of network(s) Various parameters to be concerned: network size degree of vertex the connections Statements are always statistical in nature: with high probability, diameter is small on average, degree distribution has heavy tail 12
13 Part I: Data Mining on Social network Introduction to social network Emerging models for social network Mining the social network(s) 13
14 Emerging models for social network Random graphs Erdös-Rényi model (1960): Few components and small diameter No high clustering and heavy-tailed degree distributions A well-studied and understood mathematical model in general case Random graphs Watts & Strogatz model (1998): Few components, small diameter and high clustering No heavy-tailed degree distributions Scale-free Networks: Few components, small diameter and heavy-tailed distribution No high clustering Hierarchical networks: Few components, small diameter, high clustering, heavy-tailed 14
15 Emerging models for social network Case I: The Internet Nodes: computers, routers Links: physical lines cluster end device connector link 15
16 Emerging models for social network Case II: The Actor Network Nodes: actors Links: the rest casts conceptual connected Flatliners (1990) A Few Good Men (1992) Sleepers (1996) connected The River Wild (1994) Apollo 13 (1995) In the Cut (2003) 16
17 Emerging models for social network Case III: Co-authorship Network Nodes: authors Links: coauthor/coedit on academic publications Retrieved from Microsoft Academic Search 17
18 Emerging models for social network Case IV: Academic Citation Network Nodes: authors Links: cite academic publications Retrieved from Microsoft Academic Search 18
19 Emerging models for social network Case V: Food Network R.J. Williams, N.D. Martinez Nature (2000) Nodes: trophic species Links: interactions among selected trophic species Case VI: Food Network Liljeros et al. Nature (2001) Nodes: human beings categorized by sexual property Links: sexual relationships You can find many kinds of network structure as long as interactions existing among potential participants. 19
20 Part I: Data Mining on Social network Introduction to social network Emerging models for social network Mining the social network(s) 20
21 Mining the social network(s) Heterogeneous, multi-relational data represented as a graph Nodes as objects Heterogeneous objects need to be concerned Attributes of objects matter Considering sub-classes of objects and their corresponding labels Edges as links Different types of link may exist on same graph Weighted graph, dual-weighted graph, or others Links represent relationships and interactions between objects All we expect to know is the meaning of links Understanding the meaning(s) of link can help identify the relationship between objects 21
22 Mining the social network(s) Conventional approaches applied in machine learning and data mining consider that a random sample of homogeneous objects from single relation However, the real-world datasets are supposed to be multi-relational, heterogeneous, and semi-structured, which are totally different from the traditional assumptions So, the link mining represents an emerging field of research that concentrates the intersection of network and link analysis, hypertext and web mining, graph mining, relational learning and inductive logic programming Simply speaking, it is a multi-disciplinary field of study although most of its core concepts are derived from the existing methods. 22
23 Mining the social network(s) taxonomy of link mining Object-Related Tasks Link-based object ranking Link-based object classification Object clustering (group detection) Object identification (entity resolution) Link-Related Tasks Link prediction Link re-construction Link understanding 23
24 Mining the social network(s) methods to link mining Properties: Scale free [Barabasi 99], Clustering [Watts-Strogatz 98], Navigation [Adamic- Adar 03, LibenNowell 05], Bipartite cores [Kumar et al. 99], Network Motifs [Milo et al. 02], Communities [Nawman 99], Conductance [Mihail-Papadimitriou 06], Hub and authorities [Page et al. 98, Kleinberg 99] PageRank [Page et al. 99], Hyperlink-Induced Topic Search [Kleinberg 99], EigenRumor [Fujimura 05] Models: Preferential attachment [Barabasi 99], Small-world [Watts-Strogatz 98], Copying model [Kleinberg et al. 01], Heuristically tradeoffs [Fabrikant et al. 02], Congestion [Mihail et al. 03], Searchability [Kleinberg 02], Bowtie [Broder et al. 00], Transitstub [Zegura 97], Jellyfish [Tauro et al. 01] Path-Oriented: Neighborhood Selection, Swarm Intelligence Efficiency-Oriented: Greedy approach, SSON (Semantic Social Overlay Network), ESLP (Efficient Social-like Peer-to-peer network) 24
25 Mining the social network(s) Link is defined as the relationship among data Two kinds of linked networks homogeneous vs. heterogeneous Homogeneous networks Single object type and single link type Single model social networks (e.g., friends) WWW: a collection of linked Web pages Heterogeneous networks Multiple object and link types Medical network: patients, doctors, disease, contacts, treatments Bibliographic network: publications, authors, venues 25
26 Mining the social network(s) Link-based Ranking is primarily to exploit the link structure in a graph and to order or prioritize the set of objects within the graph Web information analysis PageRank and HITS are typical approaches inspired by link-based ranking Link-based ranking is considered a core technique in mining the network structure (so as in social network analysis) It is applied to rank participants in terms of centrality Degree centrality vs. Eigen vector/power centrality Rank objects relative to one or more relevant objects in the graph vs. ranks object over time in dynamic graphs 26
27 Mining the social network(s) the PageRank Algorithm by Brin & Page (1998) PageRank is essentially citation counting, but improves over simple counting Considering the indirect citations Smoothing of citations PageRank can also be interpreted as random surfing P(B) P(C) referring to P(A) A B C D deriving from P(D) E F 27
28 Mining the social network(s) the PageRank Algorithm by Brin & Page (1998) Random surfing model: at any page, With prob., randomly jumping to a page With prob. (1 ), randomly picking a link to follow d 3 d 1 M 0 0 1/ 2 1/ / 2 1/ Transition matrix Same as /N d 2 d 4 1 p ( d ) (1 ) m p ( d ) p ( d ) t 1 i ji t j t k d IN ( d ) k N 1 p( di ) [ (1 ) mki ] p( dk ) N T p ( I (1 ) M ) p k j i I ij = 1/N Stationary ( stable ) distribution, so we ignore time Initial value p(d)=1/n Iterate until converge 28
29 Mining the social network(s) Another model, link-based classification, is to predict the category of an object based on its attributes, links and the attributes of correlated objects among graph(s) Here we may need to take the multi-modal, multi-layered graph and their corresponding attributes to design the methods for link and object mining Web: Predict the category of a web page, based on words that occur on the page, links between pages, anchor text, html tags, etc. Citation: Predict the topic of a paper, based on word occurrence, citations, co-citations Communication: Predict whether a communication contact is by , phone call or mail 29
30 Mining the social network(s) Group detection Cluster the nodes in the graph into groups that share common characteristics Web Identifying communities Citation identifying research communities Entity resolution To predict when two objects are the same, based on their attributes and their links Web predict when two sites are mirrors of each other Citation predicting when two citations are referring to the same paper Epidemics predicting when two disease strains are the same or similar Biology learning when two names refer to the same protein 30
31 Mining the social network(s) Methods in entity resolution was taken as pair-wise resolution problem: resolved based on the similarity of their attributes (i.e., association rule or model in data mining) All these methods consider the importance on links Links in entity resolution Collective resolution: one resolution decision affects another if connection exists among them Probabilistic models interact with different entity recognition decisions 31
32 Mining the social network(s) link prediction Predict whether the relationship exists between two participants in graph based on attributes and all correlated links Web: predict if there will be a link between two pages Citation: predicting if a paper will cite another paper Epidemics: predicting who a patient s contacts are Applied Methods Often viewed as a binary classification problem Local conditional probability model, based on structural and attribute features Difficulty: sparseness of existing links Collective prediction, e.g., Markov random field model 32
33 Mining the social network(s) link estimation Make prediction to the number of links of a connected participant Web: predict the authority of a page based on the number of in-links; identifying hubs based on the number of out-links Citation: predicting the impact of a paper based on the number of citations Epidemics: predicting the number of people that will be infected based on the infectiousness of a disease Make prediction to the number of participants reachable by a given participant Web: predicting number of pages retrieved by crawling a site Citation: predicting the number of citations of a particular author in a specific journal 33
34 Conclusion Big Data Big Opportunity? or Big Problem? What is your target or subjective? How will it be done? but do not forget the human Making Balance is a challenging issue Infrastructure (storage), Management (governance, analysis), Search (value discovery), Security (transparency v.s. privacy), Applications (human-centered) Next? building the strategic alliances industry, academia, and government worldwide making opportunities before intending to find them (think before acting, and sometimes act before thinking) 34
Complex Networks Analysis: Clustering Methods
Complex Networks Analysis: Clustering Methods Nikolai Nefedov Spring 2013 ISI ETH Zurich nefedov@isi.ee.ethz.ch 1 Outline Purpose to give an overview of modern graph-clustering methods and their applications
More informationGraph Mining and Social Network Analysis
Graph Mining and Social Network Analysis Data Mining and Text Mining (UIC 583 @ Politecnico di Milano) References Jiawei Han and Micheline Kamber, "Data Mining: Concepts and Techniques", The Morgan Kaufmann
More informationIC05 Introduction on Networks &Visualization Nov. 2009. <mathieu.bastian@gmail.com>
IC05 Introduction on Networks &Visualization Nov. 2009 Overview 1. Networks Introduction Networks across disciplines Properties Models 2. Visualization InfoVis Data exploration
More informationGraphs over Time Densification Laws, Shrinking Diameters and Possible Explanations
Graphs over Time Densification Laws, Shrinking Diameters and Possible Explanations Jurij Leskovec, CMU Jon Kleinberg, Cornell Christos Faloutsos, CMU 1 Introduction What can we do with graphs? What patterns
More informationPractical Graph Mining with R. 5. Link Analysis
Practical Graph Mining with R 5. Link Analysis Outline Link Analysis Concepts Metrics for Analyzing Networks PageRank HITS Link Prediction 2 Link Analysis Concepts Link A relationship between two entities
More informationGraph Mining Techniques for Social Media Analysis
Graph Mining Techniques for Social Media Analysis Mary McGlohon Christos Faloutsos 1 1-1 What is graph mining? Extracting useful knowledge (patterns, outliers, etc.) from structured data that can be represented
More informationCollective Behavior Prediction in Social Media. Lei Tang Data Mining & Machine Learning Group Arizona State University
Collective Behavior Prediction in Social Media Lei Tang Data Mining & Machine Learning Group Arizona State University Social Media Landscape Social Network Content Sharing Social Media Blogs Wiki Forum
More informationIntroduction to Networks and Business Intelligence
Introduction to Networks and Business Intelligence Prof. Dr. Daning Hu Department of Informatics University of Zurich Sep 17th, 2015 Outline Network Science A Random History Network Analysis Network Topological
More informationAsking Hard Graph Questions. Paul Burkhardt. February 3, 2014
Beyond Watson: Predictive Analytics and Big Data U.S. National Security Agency Research Directorate - R6 Technical Report February 3, 2014 300 years before Watson there was Euler! The first (Jeopardy!)
More informationOnline Social Networks and Network Economics. Aris Anagnostopoulos, Online Social Networks and Network Economics
Online Social Networks and Network Economics Who? Dr. Luca Becchetti Prof. Elias Koutsoupias Prof. Stefano Leonardi What will We Cover? Possible topics: Structure of social networks Models for social networks
More informationSocial Network Mining
Social Network Mining Data Mining November 11, 2013 Frank Takes (ftakes@liacs.nl) LIACS, Universiteit Leiden Overview Social Network Analysis Graph Mining Online Social Networks Friendship Graph Semantics
More informationAnalyzing the Facebook graph?
Logistics Big Data Algorithmic Introduction Prof. Yuval Shavitt Contact: shavitt@eng.tau.ac.il Final grade: 4 6 home assignments (will try to include programing assignments as well): 2% Exam 8% Big Data
More informationSocial Network Analysis: Introduzione all'analisi di reti sociali
Social Network Analysis: Introduzione all'analisi di reti sociali Michele Coscia Dipartimento di Informatica Università di Pisa www.di.unipi.it/~coscia Piano Lezioni Introduzione Misure + Modelli di Social
More informationMALLET-Privacy Preserving Influencer Mining in Social Media Networks via Hypergraph
MALLET-Privacy Preserving Influencer Mining in Social Media Networks via Hypergraph Janani K 1, Narmatha S 2 Assistant Professor, Department of Computer Science and Engineering, Sri Shakthi Institute of
More informationBig Graph Processing: Some Background
Big Graph Processing: Some Background Bo Wu Colorado School of Mines Part of slides from: Paul Burkhardt (National Security Agency) and Carlos Guestrin (Washington University) Mines CSCI-580, Bo Wu Graphs
More informationBig Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network
, pp.273-284 http://dx.doi.org/10.14257/ijdta.2015.8.5.24 Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network Gengxin Sun 1, Sheng Bin 2 and
More informationGeneral Network Analysis: Graph-theoretic. COMP572 Fall 2009
General Network Analysis: Graph-theoretic Techniques COMP572 Fall 2009 Networks (aka Graphs) A network is a set of vertices, or nodes, and edges that connect pairs of vertices Example: a network with 5
More informationDepartment of Cognitive Sciences University of California, Irvine 1
Mark Steyvers Department of Cognitive Sciences University of California, Irvine 1 Network structure of word associations Decentralized search in information networks Analogy between Google and word retrieval
More informationHealthcare Analytics. Aryya Gangopadhyay UMBC
Healthcare Analytics Aryya Gangopadhyay UMBC Two of many projects Integrated network approach to personalized medicine Multidimensional and multimodal Dynamic Analyze interactions HealthMask Need for sharing
More informationMachine Learning over Big Data
Machine Learning over Big Presented by Fuhao Zou fuhao@hust.edu.cn Jue 16, 2014 Huazhong University of Science and Technology Contents 1 2 3 4 Role of Machine learning Challenge of Big Analysis Distributed
More informationExploring Big Data in Social Networks
Exploring Big Data in Social Networks virgilio@dcc.ufmg.br (meira@dcc.ufmg.br) INWEB National Science and Technology Institute for Web Federal University of Minas Gerais - UFMG May 2013 Some thoughts about
More informationAN INTRODUCTION TO SOCIAL NETWORK DATA ANALYTICS
Chapter 1 AN INTRODUCTION TO SOCIAL NETWORK DATA ANALYTICS Charu C. Aggarwal IBM T. J. Watson Research Center Hawthorne, NY 10532 charu@us.ibm.com Abstract The advent of online social networks has been
More informationGraph models for the Web and the Internet. Elias Koutsoupias University of Athens and UCLA. Crete, July 2003
Graph models for the Web and the Internet Elias Koutsoupias University of Athens and UCLA Crete, July 2003 Outline of the lecture Small world phenomenon The shape of the Web graph Searching and navigation
More information1 o Semestre 2007/2008
Departamento de Engenharia Informática Instituto Superior Técnico 1 o Semestre 2007/2008 Outline 1 2 3 4 5 Outline 1 2 3 4 5 Exploiting Text How is text exploited? Two main directions Extraction Extraction
More informationAn Alternative Web Search Strategy? Abstract
An Alternative Web Search Strategy? V.-H. Winterer, Rechenzentrum Universität Freiburg (Dated: November 2007) Abstract We propose an alternative Web search strategy taking advantage of the knowledge on
More informationPart 1: Link Analysis & Page Rank
Chapter 8: Graph Data Part 1: Link Analysis & Page Rank Based on Leskovec, Rajaraman, Ullman 214: Mining of Massive Datasets 1 Exam on the 5th of February, 216, 14. to 16. If you wish to attend, please
More informationSocial Network Analysis
Social Network Analysis Challenges in Computer Science April 1, 2014 Frank Takes (ftakes@liacs.nl) LIACS, Leiden University Overview Context Social Network Analysis Online Social Networks Friendship Graph
More informationSocial Networks and Social Media
Social Networks and Social Media Social Media: Many-to-Many Social Networking Content Sharing Social Media Blogs Microblogging Wiki Forum 2 Characteristics of Social Media Consumers become Producers Rich
More informationMLg. Big Data and Its Implication to Research Methodologies and Funding. Cornelia Caragea TARDIS 2014. November 7, 2014. Machine Learning Group
Big Data and Its Implication to Research Methodologies and Funding Cornelia Caragea TARDIS 2014 November 7, 2014 UNT Computer Science and Engineering Data Everywhere Lots of data is being collected and
More informationData Mining on Social Networks. Dionysios Sotiropoulos Ph.D.
Data Mining on Social Networks Dionysios Sotiropoulos Ph.D. 1 Contents What are Social Media? Mathematical Representation of Social Networks Fundamental Data Mining Concepts Data Mining Tasks on Digital
More informationRandom graphs and complex networks
Random graphs and complex networks Remco van der Hofstad Honours Class, spring 2008 Complex networks Figure 2 Ye a s t p ro te in in te ra c tio n n e tw o rk. A m a p o f p ro tein p ro tein in tera c
More informationOverview of the Stateof-the-Art. Networks. Evolution of social network studies
Overview of the Stateof-the-Art in Social Networks INF5370 spring 2013 Evolution of social network studies 1950-1970: mathematical studies of networks formed by the actual human interactions Pandemics,
More informationEfficient target control of complex networks based on preferential matching
Efficient target control of complex networks based on preferential matching Xizhe Zhang 1, Huaizhen Wang 1, Tianyang Lv 2,3 1 (School of Computer Science and Engineering, Northeastern University, Shenyang110819,
More informationProtein Protein Interaction Networks
Functional Pattern Mining from Genome Scale Protein Protein Interaction Networks Young-Rae Cho, Ph.D. Assistant Professor Department of Computer Science Baylor University it My Definition of Bioinformatics
More information1. Write the number of the left-hand item next to the item on the right that corresponds to it.
1. Write the number of the left-hand item next to the item on the right that corresponds to it. 1. Stanford prison experiment 2. Friendster 3. neuron 4. router 5. tipping 6. small worlds 7. job-hunting
More informationWalk-Based Centrality and Communicability Measures for Network Analysis
Walk-Based Centrality and Communicability Measures for Network Analysis Michele Benzi Department of Mathematics and Computer Science Emory University Atlanta, Georgia, USA Workshop on Innovative Clustering
More informationExtracting Information from Social Networks
Extracting Information from Social Networks Aggregating site information to get trends 1 Not limited to social networks Examples Google search logs: flu outbreaks We Feel Fine Bullying 2 Bullying Xu, Jun,
More informationThe Shape of the Network. The Shape of the Internet. Why study topology? Internet topologies. Early work. More on topologies..
The Shape of the Internet Slides assembled by Jeff Chase Duke University (thanks to and ) The Shape of the Network Characterizing shape : AS-level topology: who connects to whom Router-level topology:
More informationRANKING WEB PAGES RELEVANT TO SEARCH KEYWORDS
ISBN: 978-972-8924-93-5 2009 IADIS RANKING WEB PAGES RELEVANT TO SEARCH KEYWORDS Ben Choi & Sumit Tyagi Computer Science, Louisiana Tech University, USA ABSTRACT In this paper we propose new methods for
More informationUSING SPECTRAL RADIUS RATIO FOR NODE DEGREE TO ANALYZE THE EVOLUTION OF SCALE- FREE NETWORKS AND SMALL-WORLD NETWORKS
USING SPECTRAL RADIUS RATIO FOR NODE DEGREE TO ANALYZE THE EVOLUTION OF SCALE- FREE NETWORKS AND SMALL-WORLD NETWORKS Natarajan Meghanathan Jackson State University, 1400 Lynch St, Jackson, MS, USA natarajan.meghanathan@jsums.edu
More informationInternational Journal of Engineering Research-Online A Peer Reviewed International Journal Articles are freely available online:http://www.ijoer.
RESEARCH ARTICLE SURVEY ON PAGERANK ALGORITHMS USING WEB-LINK STRUCTURE SOWMYA.M 1, V.S.SREELAXMI 2, MUNESHWARA M.S 3, ANIL G.N 4 Department of CSE, BMS Institute of Technology, Avalahalli, Yelahanka,
More informationGraph Processing and Social Networks
Graph Processing and Social Networks Presented by Shu Jiayu, Yang Ji Department of Computer Science and Engineering The Hong Kong University of Science and Technology 2015/4/20 1 Outline Background Graph
More informationSome questions... Graphs
Uni Innsbruck Informatik - 1 Uni Innsbruck Informatik - 2 Some questions... Peer-to to-peer Systems Analysis of unstructured P2P systems How scalable is Gnutella? How robust is Gnutella? Why does FreeNet
More informationA discussion of Statistical Mechanics of Complex Networks P. Part I
A discussion of Statistical Mechanics of Complex Networks Part I Review of Modern Physics, Vol. 74, 2002 Small Word Networks Clustering Coefficient Scale-Free Networks Erdös-Rényi model cover only parts
More informationBig Data Technology Motivating NoSQL Databases: Computing Page Importance Metrics at Crawl Time
Big Data Technology Motivating NoSQL Databases: Computing Page Importance Metrics at Crawl Time Edward Bortnikov & Ronny Lempel Yahoo! Labs, Haifa Class Outline Link-based page importance measures Why
More informationData Mining Cluster Analysis: Basic Concepts and Algorithms. Lecture Notes for Chapter 8. Introduction to Data Mining
Data Mining Cluster Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 8 by Tan, Steinbach, Kumar 1 What is Cluster Analysis? Finding groups of objects such that the objects in a group will
More informationEfficient Identification of Starters and Followers in Social Media
Efficient Identification of Starters and Followers in Social Media Michael Mathioudakis Department of Computer Science University of Toronto mathiou@cs.toronto.edu Nick Koudas Department of Computer Science
More informationInformation Management course
Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 01 : 06/10/2015 Practical informations: Teacher: Alberto Ceselli (alberto.ceselli@unimi.it)
More informationHow To Create A Cluster Ensemble
A Universal Similarity Model for Transactional Data Clustering 1 Mr.G.Kamalraja 2 Mrs.K.Prathiba 3 Ms.C.M.Parameshwari 1 Assistant Professor, Department Of IT,, Excel Engineering college, Namakkal. 2 Assistant
More informationDynamical Clustering of Personalized Web Search Results
Dynamical Clustering of Personalized Web Search Results Xuehua Shen CS Dept, UIUC xshen@cs.uiuc.edu Hong Cheng CS Dept, UIUC hcheng3@uiuc.edu Abstract Most current search engines present the user a ranked
More informationBig Data and Analytics: Challenges and Opportunities
Big Data and Analytics: Challenges and Opportunities Dr. Amin Beheshti Lecturer and Senior Research Associate University of New South Wales, Australia (Service Oriented Computing Group, CSE) Talk: Sharif
More informationSanjeev Kumar. contribute
RESEARCH ISSUES IN DATAA MINING Sanjeev Kumar I.A.S.R.I., Library Avenue, Pusa, New Delhi-110012 sanjeevk@iasri.res.in 1. Introduction The field of data mining and knowledgee discovery is emerging as a
More informationProtect Network Neutrality against Intellectual Property Rights A Legal and Social Network Perspective
Protect Network Neutrality against Intellectual Property Rights A Legal and Social Network Perspective Shinto Teramoto 1 Kyushu University Abstract. The ideal communication network in a democratic and
More informationSearch engines: ranking algorithms
Search engines: ranking algorithms Gianna M. Del Corso Dipartimento di Informatica, Università di Pisa, Italy ESP, 25 Marzo 2015 1 Statistics 2 Search Engines Ranking Algorithms HITS Web Analytics Estimated
More informationLink Mining: A New Data Mining Challenge
Link Mining: A New Data Mining Challenge Lise Getoor Dept. of Computer Science/UMIACS University of Maryland College Park, MD 20742 getoor@cs.umd.edu ABSTRACT A key challenge for data mining is tackling
More informationCOMP9321 Web Application Engineering
COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 11 (Part II) http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2411
More informationCS224W Project Report: Finding Top UI/UX Design Talent on Adobe Behance
CS224W Project Report: Finding Top UI/UX Design Talent on Adobe Behance Susanne Halstead, Daniel Serrano, Scott Proctor 6 December 2014 1 Abstract The Behance social network allows professionals of diverse
More informationGraph Theory and Complex Networks: An Introduction. Chapter 08: Computer networks
Graph Theory and Complex Networks: An Introduction Maarten van Steen VU Amsterdam, Dept. Computer Science Room R4.20, steen@cs.vu.nl Chapter 08: Computer networks Version: March 3, 2011 2 / 53 Contents
More informationSix Degrees of Separation in Online Society
Six Degrees of Separation in Online Society Lei Zhang * Tsinghua-Southampton Joint Lab on Web Science Graduate School in Shenzhen, Tsinghua University Shenzhen, Guangdong Province, P.R.China zhanglei@sz.tsinghua.edu.cn
More informationResearch Article A Comparison of Online Social Networks and Real-Life Social Networks: A Study of Sina Microblogging
Mathematical Problems in Engineering, Article ID 578713, 6 pages http://dx.doi.org/10.1155/2014/578713 Research Article A Comparison of Online Social Networks and Real-Life Social Networks: A Study of
More informationIdentification of Influencers - Measuring Influence in Customer Networks
Submitted to Decision Support Systems manuscript DSS Identification of Influencers - Measuring Influence in Customer Networks Christine Kiss, Martin Bichler Internet-based Information Systems, Dept. of
More informationA comparative study of social network analysis tools
Membre de Membre de A comparative study of social network analysis tools David Combe, Christine Largeron, Előd Egyed-Zsigmond and Mathias Géry International Workshop on Web Intelligence and Virtual Enterprises
More informationClustering Big Data. Anil K. Jain. (with Radha Chitta and Rong Jin) Department of Computer Science Michigan State University November 29, 2012
Clustering Big Data Anil K. Jain (with Radha Chitta and Rong Jin) Department of Computer Science Michigan State University November 29, 2012 Outline Big Data How to extract information? Data clustering
More information1 Six Degrees of Separation
Networks: Spring 2007 The Small-World Phenomenon David Easley and Jon Kleinberg April 23, 2007 1 Six Degrees of Separation The small-world phenomenon the principle that we are all linked by short chains
More informationAlgorithms for representing network centrality, groups and density and clustered graph representation
COSIN IST 2001 33555 COevolution and Self-organization In dynamical Networks Algorithms for representing network centrality, groups and density and clustered graph representation Deliverable Number: D06
More informationSocial Media Mining. Network Measures
Klout Measures and Metrics 22 Why Do We Need Measures? Who are the central figures (influential individuals) in the network? What interaction patterns are common in friends? Who are the like-minded users
More informationDATA ANALYSIS IN PUBLIC SOCIAL NETWORKS
International Scientific Conference & International Workshop Present Day Trends of Innovations 2012 28 th 29 th May 2012 Łomża, Poland DATA ANALYSIS IN PUBLIC SOCIAL NETWORKS Lubos Takac 1 Michal Zabovsky
More informationTopic-Sensitive Influencer Mining in Interest-Based Social Media Networks via Hypergraph Learning
796 IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 16, NO. 3, APRIL 2014 Topic-Sensitive Influencer Mining in Interest-Based Social Media Networks via Hypergraph Learning Quan Fang, Jitao Sang, Changsheng Xu, Fellow,
More informationIntinno: A Web Integrated Digital Library and Learning Content Management System
Intinno: A Web Integrated Digital Library and Learning Content Management System Synopsis of the Thesis to be submitted in Partial Fulfillment of the Requirements for the Award of the Degree of Master
More informationTemporal Dynamics of Scale-Free Networks
Temporal Dynamics of Scale-Free Networks Erez Shmueli, Yaniv Altshuler, and Alex Sandy Pentland MIT Media Lab {shmueli,yanival,sandy}@media.mit.edu Abstract. Many social, biological, and technological
More informationIJCSES Vol.7 No.4 October 2013 pp.165-168 Serials Publications BEHAVIOR PERDITION VIA MINING SOCIAL DIMENSIONS
IJCSES Vol.7 No.4 October 2013 pp.165-168 Serials Publications BEHAVIOR PERDITION VIA MINING SOCIAL DIMENSIONS V.Sudhakar 1 and G. Draksha 2 Abstract:- Collective behavior refers to the behaviors of individuals
More informationNAVIGATING SCIENTIFIC LITERATURE A HOLISTIC PERSPECTIVE. Venu Govindaraju
NAVIGATING SCIENTIFIC LITERATURE A HOLISTIC PERSPECTIVE Venu Govindaraju BIOMETRICS DOCUMENT ANALYSIS PATTERN RECOGNITION 8/24/2015 ICDAR- 2015 2 Towards a Globally Optimal Approach for Learning Deep Unsupervised
More informationDegrees of Separation in Social Networks
Proceedings, The Fourth International Symposium on Combinatorial Search (SoCS-2011) Degrees of Separation in Social Networks Reza Bakhshandeh Shiraz University Shiraz, Iran bakhshandeh@cse.shirazu.ac.ir
More informationSemantic Search in E-Discovery. David Graus & Zhaochun Ren
Semantic Search in E-Discovery David Graus & Zhaochun Ren This talk Introduction David Graus! Understanding e-mail traffic David Graus! Topic discovery & tracking in social media Zhaochun Ren 2 Intro Semantic
More informationCS 2750 Machine Learning. Lecture 1. Machine Learning. http://www.cs.pitt.edu/~milos/courses/cs2750/ CS 2750 Machine Learning.
Lecture Machine Learning Milos Hauskrecht milos@cs.pitt.edu 539 Sennott Square, x5 http://www.cs.pitt.edu/~milos/courses/cs75/ Administration Instructor: Milos Hauskrecht milos@cs.pitt.edu 539 Sennott
More informationGraph Algorithms and Graph Databases. Dr. Daisy Zhe Wang CISE Department University of Florida August 27th 2014
Graph Algorithms and Graph Databases Dr. Daisy Zhe Wang CISE Department University of Florida August 27th 2014 1 Google Knowledge Graph -- Entities and Relationships 2 Graph Data! Facebook Social Network
More informationAn Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015
An Introduction to Data Mining for Wind Power Management Spring 2015 Big Data World Every minute: Google receives over 4 million search queries Facebook users share almost 2.5 million pieces of content
More informationWeb Archiving and Scholarly Use of Web Archives
Web Archiving and Scholarly Use of Web Archives Helen Hockx-Yu Head of Web Archiving British Library 15 April 2013 Overview 1. Introduction 2. Access and usage: UK Web Archive 3. Scholarly feedback on
More informationThe ebay Graph: How Do Online Auction Users Interact?
The ebay Graph: How Do Online Auction Users Interact? Yordanos Beyene, Michalis Faloutsos University of California, Riverside {yordanos, michalis}@cs.ucr.edu Duen Horng (Polo) Chau, Christos Faloutsos
More informationMeasurement and Analysis of Online Social Networks
Measurement and Analysis of Online Social Networks Alan Mislove Massimiliano Marcon Krishna P. Gummadi Peter Druschel Bobby Bhattacharjee Max Planck Institute for Software Systems Rice University University
More informationLocation Analytics for Financial Services. An Esri White Paper October 2013
Location Analytics for Financial Services An Esri White Paper October 2013 Copyright 2013 Esri All rights reserved. Printed in the United States of America. The information contained in this document is
More informationSemantic Search in Portals using Ontologies
Semantic Search in Portals using Ontologies Wallace Anacleto Pinheiro Ana Maria de C. Moura Military Institute of Engineering - IME/RJ Department of Computer Engineering - Rio de Janeiro - Brazil [awallace,anamoura]@de9.ime.eb.br
More informationDistance Degree Sequences for Network Analysis
Universität Konstanz Computer & Information Science Algorithmics Group 15 Mar 2005 based on Palmer, Gibbons, and Faloutsos: ANF A Fast and Scalable Tool for Data Mining in Massive Graphs, SIGKDD 02. Motivation
More informationWeb Mining. Margherita Berardi LACAM. Dipartimento di Informatica Università degli Studi di Bari berardi@di.uniba.it
Web Mining Margherita Berardi LACAM Dipartimento di Informatica Università degli Studi di Bari berardi@di.uniba.it Bari, 24 Aprile 2003 Overview Introduction Knowledge discovery from text (Web Content
More informationOct 15, 2004 www.dcs.bbk.ac.uk/~gmagoulas/teaching.html 3. Internet : the vast collection of interconnected networks that all use the TCP/IP protocols
E-Commerce Infrastructure II: the World Wide Web The Internet and the World Wide Web are two separate but related things Oct 15, 2004 www.dcs.bbk.ac.uk/~gmagoulas/teaching.html 1 Outline The Internet and
More informationDistributed Database for Environmental Data Integration
Distributed Database for Environmental Data Integration A. Amato', V. Di Lecce2, and V. Piuri 3 II Engineering Faculty of Politecnico di Bari - Italy 2 DIASS, Politecnico di Bari, Italy 3Dept Information
More informationExploring Different Aspects of Social Network Analysis Using Web Mining Techniques
Exploring Different Aspects of Social Network Analysis Using Web Mining Techniques 1. Hilal Ahmad Khanday, Dr. Rana Hashmy 2 1, 2 Department of Computer Sciences, University of Kashmir Abstract A social
More informationNetwork Big Data: Facing and Tackling the Complexities Xiaolong Jin
Network Big Data: Facing and Tackling the Complexities Xiaolong Jin CAS Key Laboratory of Network Data Science & Technology Institute of Computing Technology Chinese Academy of Sciences (CAS) 2015-08-10
More informationCloud Computing and the Future of Internet Services. Wei-Ying Ma Principal Researcher, Research Area Manager Microsoft Research Asia
Cloud Computing and the Future of Internet Services Wei-Ying Ma Principal Researcher, Research Area Manager Microsoft Research Asia Computing as Utility Grid Computing Web Services in the Cloud What is
More informationKeywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.
Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Analytics
More informationStatistical Models in Data Mining
Statistical Models in Data Mining Sargur N. Srihari University at Buffalo The State University of New York Department of Computer Science and Engineering Department of Biostatistics 1 Srihari Flood of
More informationStudying Recommendation Algorithms by Graph Analysis
Studying Recommendation Algorithms by Graph Analysis Batul J. Mirza Department of Computer Science Virginia Tech Blacksburg, VA 24061 bmirza@csgrad.cs.vt.edu Naren Ramakrishnan Department of Computer Science
More informationNetwork Analysis Basics and applications to online data
Network Analysis Basics and applications to online data Katherine Ognyanova University of Southern California Prepared for the Annenberg Program for Online Communities, 2010. Relational data Node (actor,
More informationarxiv:cs.dm/0204001 v1 30 Mar 2002
A Steady State Model for Graph Power Laws David Eppstein Joseph Wang arxiv:cs.dm/0000 v 0 Mar 00 Abstract Power law distribution seems to be an important characteristic of web graphs. Several existing
More informationSome Economics of Cultural PSI: the Micro Perspective
Some Economics of Cultural PSI: the Micro Perspective Massimiliano Nuccio Research Affiliate ASK Bocconi Research Centre Bocconi University Milan - 10 October 2014 1 Agenda Which new sources of data can
More informationContent Delivery Networks. Shaxun Chen April 21, 2009
Content Delivery Networks Shaxun Chen April 21, 2009 Outline Introduction to CDN An Industry Example: Akamai A Research Example: CDN over Mobile Networks Conclusion Outline Introduction to CDN An Industry
More informationA Performance Evaluation of Open Source Graph Databases. Robert McColl David Ediger Jason Poovey Dan Campbell David A. Bader
A Performance Evaluation of Open Source Graph Databases Robert McColl David Ediger Jason Poovey Dan Campbell David A. Bader Overview Motivation Options Evaluation Results Lessons Learned Moving Forward
More informationWeb Graph Analyzer Tool
Web Graph Analyzer Tool Konstantin Avrachenkov INRIA Sophia Antipolis 2004, route des Lucioles, B.P.93 06902, France Email: K.Avrachenkov@sophia.inria.fr Danil Nemirovsky St.Petersburg State University
More informationThe average distances in random graphs with given expected degrees
Classification: Physical Sciences, Mathematics The average distances in random graphs with given expected degrees by Fan Chung 1 and Linyuan Lu Department of Mathematics University of California at San
More informationSearch Taxonomy. Web Search. Search Engine Optimization. Information Retrieval
Information Retrieval INFO 4300 / CS 4300! Retrieval models Older models» Boolean retrieval» Vector Space model Probabilistic Models» BM25» Language models Web search» Learning to Rank Search Taxonomy!
More information