CS 1944: Sophomore Seminar Big Data and Machine Learning

Size: px
Start display at page:

Download "CS 1944: Sophomore Seminar Big Data and Machine Learning"

Transcription

1 CS 1944: Sophomore Seminar Big Data and Machine Learning B. Aditya Prakash Assistant Professor Nov 3, 2015

2 About me Assistant Professor, CS Member, Discovery Analy>cs Center Previously Ph.D. in Computer Science, Carnegie Mellon University B.Tech in Computer Science and Engg, Indian Ins>tute of Technology (IIT) Bombay Internships at Sprint, Yahoo, MicrosoN Research Prakash

3 Prakash

4 Data contains value and knowledge Prakash

5 Data and Business Data1and1business* 4* Recommended'linksA +79%''clicksA Personalized'' News'InterestsA +250%'clicksA Top'SearchesA +43%'clicksA vs.1randomly1selected* vs.1editorial1onedsizedfitsdall* vs.1editor1selected* Prakash 2015 Source: A. Machhanavajjhala 5

6 Data and Science Data1and1science* 5* Red:1official1numbers1from1Center1for1Disease1Control1and1Prevention;1weekly11 Black:1based1on1Google1search1logs;1daily1(potentially1instantaneously)* Detecting'influenza'epidemics'using'search' engine'query'data1 nature07634.html Prakash

7 Data and Government Data1and1government* 6* president/2013/06/14/1d71fe2e-d391-11e2- b05f-3ea3f0e7bb5a_story.html business/economy/democratspush-to-redeploy-obamas-voterdatabase/2012/11/20/ d14793a4-2e83-11e2-89d4-040c a_story.html Democratizing-Data edward-snowden-nsa-files-timeline Prakash 2015 Source: A. Machhanavajjhala 7

8 Data and Culture Data1and1culture* 7* Word1frequencies1in1 EnglishDlanguage1 books1in1google s1 database /03/20/what-are-you-inthe-mood-for-emotionaltrends-in-20th-century-books/ Prakash 2015 Source: A. Machhanavajjhala 8

9 8* Data1and1 1 your1favorite1subject* Sports* Journalism* Prakash

10 Good news: Demand for Data Mining Prakash

11 How to extract value from data? Manipulate Data CS, Domain exper>se Analyze Data Math, CS, Stat Communicate your results CS, Domain Exper>se Prakash

12 13* CommunicaEon is important! Communicating1results* The"British"government"spends"" 13"billion"a"year"on"universities. F So?* Try1instead1 bubbletree-map.html#/~/total/education/university On"average,"1"in"every"15"Europeans"" is"totally"illiterate. F True* But1about111in1every1141is1under171years1old!* Prakash

13 What is Data Mining? Given lots of data Discover paherns and models that are: Valid: hold on new data with some certainty Useful: should be possible to act on the item Unexpected: non-obvious to the system Understandable: humans should be able to interpret the pa]ern Prakash

14 Data Mining Tasks DescripEve methods Find human-interpretable pa]erns that describe the data Example: Clustering PredicEve methods Use some variables to predict unknown or future values of other variables Example: Recommender systems Prakash

15 Comp. Systems Theory & Algo. Biology Physics Social Science ML & Stats. Big data Econ. 15 Prakash 2015

16 Data at CS, VT Knowledge, Informa>on and Data h]p:// kid People: Fox, Harrison, Huang, Lu (in NVA), Ramakrishnan (in NVA), Rozovskaya, Prakash Prakash

17 Courses Background in some areas: CS3414 (Numerical Methods); also prob/stat 4000 level 4244 Internet SoNware Development 4604 Database Management Systems 4624 Capstone (Mul>media, Informa>on Access) 4634 Design of Informa>on (Capstone) 4804 AI 4984 Computa>onal Linguis>cs (Capstone) Prakash

18 Discovery AnalyEcs Center Prakash

19 MY RESEARCH Prakash

20 Networks are everywhere! Facebook Network [2010] Human Disease Network [Barabasi 2007] Gene Regulatory Network [Decourty 2008] The Internet [2005] Prakash

21 What else do they have in common? Prakash

22 High School DaEng Network Bearman et. al. Am. Jnl. of Sociology, Image: Mark Newman Blue: Male Pink: Female Interes>ng observa>ons? Prakash

23 The Internet Skewed Degrees Robustness Prakash

24 Karate Club Network Prakash

25 Dynamical Processes over networks are also everywhere! Prakash

26 Why do we care? Social collabora>on Informa>on Diffusion Viral Marke>ng Epidemiology and Public Health Cyber Security Human mobility Games and Virtual Worlds Ecology Prakash 2015

27 Why do we care? (1: Epidemiology) Dynamical Processes over networks [AJPH 2007] Diseases over contact networks Prakash 2015 CDC data: Visualiza>on of the first 35 tuberculosis (TB) pa>ents and their 1039 contacts 27

28 Why do we care? (1: Epidemiology) Dynamical Processes over networks Each circle is a hospital ~3000 hospitals More than 30,000 pa>ents transferred [US-MEDICARE NETWORK 2005] Prakash 2015 Problem: Given k units of disinfectant, whom to immunize? 28

29 Prakash Hospital-acquired inf. took 99K+ lives, cost $5B+ (all per year) Why do we care? (1: Epidemiology) ~6x fewer! [US-MEDICARE NETWORK 2005] CURRENT PRACTICE OUR METHOD

30 Why do we care? (2: Online Diffusion) > 800m users, ~$1B revenue [WSJ 2010] ~100m achve users > 50m users Prakash

31 Why do we care? (2: Online Diffusion) Dynamical Processes over networks Buy Versace! Followers Celebrity Prakash 2015 Social Media Marke>ng 31

32 Social ßà Biological Contagion AutomaEcally learn models Prakash

33 Why do we care? (3: To change the world?) Dynamical Processes over networks Social networks and CollaboraHve AcHon Prakash

34 High Impact MulEple Seangs epidemic out-breaks Q. How to squash rumors faster? products/viruses Q. How do opinions spread? transmit s/w patches Q. How to market beser? Prakash

35 Dynamical Processes = (a lot of) Networks + (some) Time-Series Prakash

36 Research Theme ANALYSIS Understanding DATA Large real-world POLICY/ ACTION Managing networks & processes Prakash

37 Research Theme Public Health ANALYSIS Will an epidemic happen? DATA Modeling # pa>ent transfers Prakash 2015 POLICY/ ACTION How to control out-breaks? 37

38 Research Theme Social Media ANALYSIS # cascades in future? DATA Modeling Tweets spreading Prakash 2015 POLICY/ ACTION How to market be]er? 38

39 A QuesEon How many of you think your friends have more friends than you? J A recent Facebook study Examined all of FB s users: 721 million people with 69 billion friendships. about 10 percent of the world s popula>on! Found that user s friend count was less than the average friend count of his or her friends, 93 percent of the >me. Users had an average of 190 friends, while their friends averaged 635 friends of their own. Prakash

40 Possible Reasons? You are a loner? Your friends are extroverts? There are more extroverts than introverts in the world? Prakash

41 Example Average number of friends? Source: S. Strogatz, NYT 2012 Prakash

42 Example Average number of friends = ( ) / 4 = 2 Source: S. Strogatz, NYT 2012 Prakash

43 Example Average number of friends = ( ) / 4 = 2 Average number of friends of friends Source: S. Strogatz, NYT 2012 Prakash

44 Example Average number of friends = ( ) / 4 = 2 Average number of friends of friends = ( )/8 = ((1x1) + (3x3) + (2x2) + (2x2))/8 Source: S. Strogatz, NYT 2012 Prakash

45 Example Average number of friends = ( ) / 4 = 2 Average number of friends of friends = ( )/8 = ((1x1) + (3x3) + (2x2) + (2x2))/8 = 2.25! Source: S. Strogatz, NYT 2012 Prakash

46 Actually it is (almost) always true! Proof? Prakash

47 Actually it is (almost) always true! Proof? E[X] = x i / N Prakash

48 Actually it is (almost) always true! Proof? E[X] = x i / N Var[X] = E[(X E[X]) 2 ] = E[X 2 ] E[X] 2 Prakash

49 Actually it is (almost) always true! Proof? E[X] = x i / N Var[X] = E[(X E[X]) 2 ] = E[X 2 ] E[X] 2 E[X 2 ] E[X] = E[X]+ Var[X] E[X] Prakash

50 Actually it is (almost) always true! Proof? EssenEally, it is true if there is any spread in # of friends (nonzero variance)! E[X] = x i / N Var[X] = E[(X E[X]) 2 ] = E[X 2 ] E[X] 2 E[X 2 ] E[X] = E[X]+ Var[X] E[X] Prakash

51 Immuniza>on ImplicaEons Figure 1. Network Illustrating Structural Parameters. This real network of 105 students shows variation in structural attributes and topological position. Each circle represents a person and each line represents a friendship tie. Nodes A and B have different degree, a measure that indicates the number of ties. Nodes with higher degree also tend to exhibit higher centrality (node A with six friends is more central than B and C who both only have four friends). If contagions infect people at random at the beginning of an epidemic, central individuals are likely to be infected sooner because they lie a shorter number of steps (on average) from all other individuals in the network. Finally, although nodes B and C have the same degree, they differ in transitivity (the probability that any two of one s friends are friends with each other). Node B exhibits high transitivity with many friends that know one another. In contrast, node C s friends are not connected to one another and therefore they offer more independent possibilities acquaintance immuniza>on for becoming infected earlier in the epidemic. doi: /journal.pone g001 Immunize friend-of-friend Early warning of outbreaks Again, monitor friends of friends the variance of the degree distribution divided by m. Hence, when there is variance in degree in a population, and especially when there is high variance, the mean number of contacts for the friends will be greater (and potentially much greater) than the mean for the random sample. This is sometimes known as the friendship paradox ( your friends have more friends than you do ) [15 19]. While the idea of immunizing such friends of randomly chosen people has previously been explored in a stimulating theoretical paper [12], to our knowledge, a method that uses nominated friends as sensors for early detection of an outbreak has not previously been proposed, nor has it been tested on any sort of real outbreak. To evaluate the effectiveness of nominated friends as social network sensors, we therefore monitored the spread of flu at Harvard College from September 1 to December 31, In the fall of 2009, both seasonal flu (which typically kills 41,000 Americans each year [20]) and the H1N1 strain were prevalent in the US, though the great majority of cases in 2009 have been attributed to the latter.[1] It is estimated that this H1N1 epidemic, which began roughly in April 2009, infected over 50 million Americans. Unlike seasonal flu, which typically affects individuals older than 65, H1N1 tends to affect young people. Nationally, according to the CDC, the epidemic peaked in late October 2009, and vaccination only became widely available in December Whether another outbreak of H1N1 will occur (for example, in areas and populations that have heretofore been spared) is a Figure 2. Theoretical expectations of differences in contagion between central individuals and the population as a whole. A contagious process passes through two phases, one in which the number of infected individuals exponentially increases as the contagion spreads, and one in which incidence exponentially decreases as susceptible individuals become increasingly scarce. These dynamics can be modeled by a logistic function. Central individuals lie on more paths in a network compared to the average person in a population and are therefore more likely to be infected early by a contagion that randomly infects some individuals and then spreads from person to person within the network. This shifts the S- shaped logistic cumulative incidence function forward in time for central individuals compared to the population as a whole (left panel). It also shifts the peak infection rate forward (right panel). doi: /journal.pone g002 Prakash

52 Thanks---QuesEons? B. Aditya Prakash 3160 F Torgersen Hall badityap@cs.vt.edu See my homepage for more details and papers: h]p:// Prakash

COS 116 The Computational Universe Laboratory 9: Virus and Worm Propagation in Networks

COS 116 The Computational Universe Laboratory 9: Virus and Worm Propagation in Networks COS 116 The Computational Universe Laboratory 9: Virus and Worm Propagation in Networks You learned in lecture about computer viruses and worms. In this lab you will study virus propagation at the quantitative

More information

Pandemic Risk Assessment

Pandemic Risk Assessment Research Note Pandemic Risk Assessment By: Katherine Hagan Copyright 2013, ASA Institute for Risk & Innovation Keywords: pandemic, Influenza A, novel virus, emergency response, monitoring, risk mitigation

More information

ECDC SURVEILLANCE REPORT

ECDC SURVEILLANCE REPORT ECDC SURVEILLANCE REPORT Pandemic (H1N1) 2009 Weekly report: Individual case reports EU/EEA countries 31 July 2009 Summary The pandemic A(H1N1) 2009 is still spreading despite the fact that the regular

More information

Follow links Class Use and other Permissions. For more information, send email to: permissions@pupress.princeton.edu

Follow links Class Use and other Permissions. For more information, send email to: permissions@pupress.princeton.edu COPYRIGHT NOTICE: Mark Newman, Albert-László Barabási, and Duncan J. Watts: The Structure and Dynamics of Networks is published by Princeton University Press and copyrighted, 2006, by Princeton University

More information

Big Data and Complex Networks Analytics. Timos Sellis, CSIT Kathy Horadam, MGS

Big Data and Complex Networks Analytics. Timos Sellis, CSIT Kathy Horadam, MGS Big Data and Complex Networks Analytics Timos Sellis, CSIT Kathy Horadam, MGS Big Data What is it? Most commonly accepted definition, by Gartner (the 3 Vs) Big data is high-volume, high-velocity and high-variety

More information

Introduction to infectious disease epidemiology

Introduction to infectious disease epidemiology Introduction to infectious disease epidemiology Mads Kamper-Jørgensen Associate professor, University of Copenhagen, maka@sund.ku.dk Public health science 24 September 2013 Slide number 1 Practicals Elective

More information

What is Big Data? The three(or four) Vs in Big Data In 2013 the total amount of stored information is estimated to be Volume.

What is Big Data? The three(or four) Vs in Big Data In 2013 the total amount of stored information is estimated to be Volume. 8/26/2014 CS581 Big Data - Fall 2014 1 8/26/2014 CS581 Big Data - Fall 2014 2 CS535/CS581A BIG DATA What is Big Data? PART 0. INTRODUCTION 1. INTRODUCTION TO BIG DATA 2. COURSE INTRODUCTION PART 0. INTRODUCTION

More information

Exploring Big Data in Social Networks

Exploring Big Data in Social Networks Exploring Big Data in Social Networks virgilio@dcc.ufmg.br (meira@dcc.ufmg.br) INWEB National Science and Technology Institute for Web Federal University of Minas Gerais - UFMG May 2013 Some thoughts about

More information

ISSH 2011 ABM Track - Hands-on Exercise

ISSH 2011 ABM Track - Hands-on Exercise ISSH 2011 ABM Track - Hands-on Exercise Authors: Shawn Brown, University of Pittsburgh, stbrown@psc.edu John Grefenstette, University of Pittsburgh, gref@pitt.edu Nathan Stone, Pittsburgh Supercomputing

More information

Math 425 (Fall 08) Solutions Midterm 2 November 6, 2008

Math 425 (Fall 08) Solutions Midterm 2 November 6, 2008 Math 425 (Fall 8) Solutions Midterm 2 November 6, 28 (5 pts) Compute E[X] and Var[X] for i) X a random variable that takes the values, 2, 3 with probabilities.2,.5,.3; ii) X a random variable with the

More information

IC05 Introduction on Networks &Visualization Nov. 2009. <mathieu.bastian@gmail.com>

IC05 Introduction on Networks &Visualization Nov. 2009. <mathieu.bastian@gmail.com> IC05 Introduction on Networks &Visualization Nov. 2009 Overview 1. Networks Introduction Networks across disciplines Properties Models 2. Visualization InfoVis Data exploration

More information

Making Sense of Big Data. Dr. Thomas E. Potok Computa2onal Data Analy2cs Group Leader Oak Ridge Na2onal Laboratory potokte@ornl.

Making Sense of Big Data. Dr. Thomas E. Potok Computa2onal Data Analy2cs Group Leader Oak Ridge Na2onal Laboratory potokte@ornl. Making Sense of Big Data Dr. Thomas E. Potok Computa2onal Data Analy2cs Group Leader Oak Ridge Na2onal Laboratory potokte@ornl.gov 865-574- 0834 ORNL s Big Data Legacy Science National Security Energy

More information

Network Analytics in Marketing

Network Analytics in Marketing Network Analytics in Marketing Prof. Dr. Daning Hu Department of Informatics University of Zurich Nov 13th, 2014 Introduction: Network Analytics in Marketing Marketing channels and business networks have

More information

Planning for Pandemic Flu. pandemic flu table-top exercise

Planning for Pandemic Flu. pandemic flu table-top exercise Planning for Pandemic Flu Lawrence Dickson - University of Edinburgh Background Previous phase of health and safety management audit programme raised topic of Business Continuity Management Limited ability

More information

Overview. Why this policy? Influenza. Vaccine or mask policies. Other approaches Conclusion. epidemiology transmission vaccine

Overview. Why this policy? Influenza. Vaccine or mask policies. Other approaches Conclusion. epidemiology transmission vaccine Overview Why this policy? Influenza epidemiology transmission vaccine Vaccine or mask policies development and implementation Other approaches Conclusion Influenza or mask policy Receive the influenza

More information

Congrats to Game Winners. How can computation use data to solve problems? What topics have we covered in CS 202? Part 1: Completed!

Congrats to Game Winners. How can computation use data to solve problems? What topics have we covered in CS 202? Part 1: Completed! CS 202: Introduction to Computation " UNIVERSITY of WISCONSIN-MADISON Computer Sciences Department Professor Andrea Arpaci-Dusseau How can computation use data to solve problems? Congrats to Game Winners

More information

Key Facts about Influenza (Flu) & Flu Vaccine

Key Facts about Influenza (Flu) & Flu Vaccine Key Facts about Influenza (Flu) & Flu Vaccine mouths or noses of people who are nearby. Less often, a person might also get flu by touching a surface or object that has flu virus on it and then touching

More information

What s Up at CISE. Jeannette M. Wing

What s Up at CISE. Jeannette M. Wing What s Up at CISE Assistant Director Computer and Information Science and Engineering Directorate and President s Professor of Computer Science Carnegie Mellon University BMSA 2 November 2007 Outline Looking

More information

EPSRC Cross-SAT Big Data Workshop: Well Sorted Materials

EPSRC Cross-SAT Big Data Workshop: Well Sorted Materials EPSRC Cross-SAT Big Data Workshop: Well Sorted Materials 5th August 2015 Contents Introduction 1 Dendrogram 2 Tree Map 3 Heat Map 4 Raw Group Data 5 For an online, interactive version of the visualisations

More information

Data Science at the University of Virginia

Data Science at the University of Virginia 1/ 16 Data Science at the University of Virginia Donald E. Brown, Director Data Science Institute University of Virginia Charlottesville, VA USA brown@virginia.edu Data Science Institute 1 3/ 16 Society

More information

College of Public Health Course Bulletin. Credit Hours

College of Public Health Course Bulletin. Credit Hours Course abbreviation and number ex. PUBHLTH 3550) Course description Credit Hours Prerequisites or Exclusions Public Health (PUBHLTH) PUBHLTH 1100 Survey of Public Health 1 none PUBHLTH 2010 Introduction

More information

June Zhang (Zhong-Ju Zhang)

June Zhang (Zhong-Ju Zhang) (Zhong-Ju Zhang) Carnegie Mellon University Dept. Electrical and Computer Engineering, 5000 Forbes Ave. Pittsburgh, PA 15213 Phone: 678-899-2492 E-Mail: junez@andrew.cmu.edu http://users.ece.cmu.edu/~junez

More information

Plumas County Public Health Agency. Preparing the Community for Public Health Emergencies

Plumas County Public Health Agency. Preparing the Community for Public Health Emergencies Plumas County Public Health Agency Preparing the Community for Public Health Emergencies Safeguarding Your Investment Local businesses have invested significant time and resources into being successful.

More information

Network Maps for End Users: Collect, Analyze, Visualize and Communicate Network Insights with Zero Coding

Network Maps for End Users: Collect, Analyze, Visualize and Communicate Network Insights with Zero Coding Network Maps for End Users: Collect, Analyze, Visualize and Communicate Network Insights with Zero Coding A project from the Social Media Research Founda8on: h:p://www.smrfounda8on.org About Me Introduc8ons

More information

Graph theoretic approach to analyze amino acid network

Graph theoretic approach to analyze amino acid network Int. J. Adv. Appl. Math. and Mech. 2(3) (2015) 31-37 (ISSN: 2347-2529) Journal homepage: www.ijaamm.com International Journal of Advances in Applied Mathematics and Mechanics Graph theoretic approach to

More information

Introduction to Networks and Business Intelligence

Introduction to Networks and Business Intelligence Introduction to Networks and Business Intelligence Prof. Dr. Daning Hu Department of Informatics University of Zurich Sep 17th, 2015 Outline Network Science A Random History Network Analysis Network Topological

More information

The Imperative of Big Data in Public Health Transformation

The Imperative of Big Data in Public Health Transformation The Imperative of Big Data in Public Health Transformation Charles Safran, MD Chief, Division of Clinical Informatics, BIDMC Associate Professor of Medicine, Harvard Medical School 1 2 Volume Velocity

More information

Using Real Data in an SIR Model

Using Real Data in an SIR Model Using Real Data in an SIR Model D. Sulsky June 21, 2012 In most epidemics it is difficult to determine how many new infectives there are each day since only those that are removed, for medical aid or other

More information

Principles of Disease and Epidemiology. Copyright 2010 Pearson Education, Inc.

Principles of Disease and Epidemiology. Copyright 2010 Pearson Education, Inc. Principles of Disease and Epidemiology Pathology, Infection, and Disease Disease: An abnormal state in which the body is not functioning normally Pathology: The study of disease Etiology: The study of

More information

Summary of infectious disease epidemiology course

Summary of infectious disease epidemiology course Summary of infectious disease epidemiology course Mads Kamper-Jørgensen Associate professor, University of Copenhagen, maka@sund.ku.dk Public health science 3 December 2013 Slide number 1 Aim Possess knowledge

More information

Six Degrees: The Science of a Connected Age. Duncan Watts Columbia University

Six Degrees: The Science of a Connected Age. Duncan Watts Columbia University Six Degrees: The Science of a Connected Age Duncan Watts Columbia University Outline The Small-World Problem What is a Science of Networks? Why does it matter? Six Degrees Six degrees of separation between

More information

Influenza Surveillance Weekly Report CDC MMWR Week 16: Apr 17 to 23, 2016

Influenza Surveillance Weekly Report CDC MMWR Week 16: Apr 17 to 23, 2016 Office of Surveillance & Public Health Preparedness Program of Public Health Informatics Influenza Surveillance Weekly Report CDC MMWR Week 16: Apr 17 to 23, 2016 Influenza Activity by County, State, and

More information

Search for the optimal strategy to spread a viral video: An agent-based model optimized with genetic algorithms

Search for the optimal strategy to spread a viral video: An agent-based model optimized with genetic algorithms Search for the optimal strategy to spread a viral video: An agent-based model optimized with genetic algorithms Michal Kvasnička 1 Abstract. Agent-based computational papers on viral marketing have been

More information

CAS CS 565, Data Mining

CAS CS 565, Data Mining CAS CS 565, Data Mining Course logistics Course webpage: http://www.cs.bu.edu/~evimaria/cs565-10.html Schedule: Mon Wed, 4-5:30 Instructor: Evimaria Terzi, evimaria@cs.bu.edu Office hours: Mon 2:30-4pm,

More information

B490 Mining the Big Data. 0 Introduction

B490 Mining the Big Data. 0 Introduction B490 Mining the Big Data 0 Introduction Qin Zhang 1-1 Data Mining What is Data Mining? A definition : Discovery of useful, possibly unexpected, patterns in data. 2-1 Data Mining What is Data Mining? A

More information

Ontario Pandemic Influenza Plan for Continuity of Electricity Operations

Ontario Pandemic Influenza Plan for Continuity of Electricity Operations Planning Guideline GDE-162 Ontario Pandemic Influenza Plan for Continuity of Electricity Operations Planning Guideline Issue 4.0 October 13, 2015 Emergency Preparedness Task Force This planning guide provides

More information

Outline. What is Big data and where they come from? How we deal with Big data?

Outline. What is Big data and where they come from? How we deal with Big data? What is Big Data Outline What is Big data and where they come from? How we deal with Big data? Big Data Everywhere! As a human, we generate a lot of data during our everyday activity. When you buy something,

More information

PREPARING FOR A PANDEMIC. Lessons from the Past Plans for the Present and Future

PREPARING FOR A PANDEMIC. Lessons from the Past Plans for the Present and Future PREPARING FOR A PANDEMIC Lessons from the Past Plans for the Present and Future Pandemics Are Inevitable TM And their impact can be devastating 1918 Spanish Flu 20-100 million deaths worldwide 600,000

More information

Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network

Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network , pp.273-284 http://dx.doi.org/10.14257/ijdta.2015.8.5.24 Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network Gengxin Sun 1, Sheng Bin 2 and

More information

Summary of infectious disease epidemiology course

Summary of infectious disease epidemiology course Summary of infectious disease epidemiology course Mads Kamper-Jørgensen Associate professor, University of Copenhagen, maka@sund.ku.dk Public health science 3 December 2013 Slide number 1 Aim Possess knowledge

More information

VIRAL MARKETING. Teacher s Guide Getting Started. Benjamin Dickman Brookline, MA

VIRAL MARKETING. Teacher s Guide Getting Started. Benjamin Dickman Brookline, MA Teacher s Guide Getting Started Benjamin Dickman Brookline, MA Purpose In this two-day lesson, students will model viral marketing. Viral marketing refers to a marketing strategy in which people pass on

More information

Pu#ng together a bioinforma1cs team: 2014 compared with 1997

Pu#ng together a bioinforma1cs team: 2014 compared with 1997 Pu#ng together a bioinforma1cs team: 2014 compared with 1997 BIG DATA and Healthcare Analy3cs Melbourne, Thursday 3 rd April 2014 Terry Speed, Walter & Eliza Hall Ins3tute of Medical Research 1 Overview

More information

IOWA DEPARTMENT OF PUBLIC HEALTH

IOWA DEPARTMENT OF PUBLIC HEALTH Thomas J. Vilsack, Governor Sally J. Pederson, Lt. Governor IOWA DEPARTMENT OF PUBLIC HEALTH Mary Mincer Hansen, R.N., Ph.D., Director Patricia Quinlisk, M.D., State Medical Director Division of Acute

More information

PANDEMIC INFLUENZA RESPONSE PLAN OFFICE OF ENVIRONMENTAL HEALTH AND SAFETY

PANDEMIC INFLUENZA RESPONSE PLAN OFFICE OF ENVIRONMENTAL HEALTH AND SAFETY PANDEMIC INFLUENZA RESPONSE PLAN OFFICE OF ENVIRONMENTAL HEALTH AND SAFETY REVISED JANUARY 2010 TABLE OF CONTENTS Background 3 Purpose.3 Local Public Health Leadership 3 World Health Organization (WHO)

More information

WHY IS THIS IMPORTANT?

WHY IS THIS IMPORTANT? CHAPTER 1 WHAT IS MICROBIOLOGY AND WHY IS IT IMPORTANT? WHO / TDR / Crump WHY IS THIS IMPORTANT? Microbiology is more relevant than ever in today s world. Infectious diseases are a leading health-related

More information

Big Data and Health. Google Maps: Satellite View. Four Characteristics of Big Data. Personalized Medicine 2/5/2016. Health Big Data Ecosystem

Big Data and Health. Google Maps: Satellite View. Four Characteristics of Big Data. Personalized Medicine 2/5/2016. Health Big Data Ecosystem Big Data and Health Google Maps: Satellite View CLS&I Study Group January 28, 2016 Gary Marchant 3 Years Ago Today Four Characteristics of Big Data Cost efficiently processing the growing Volume 2010 50x

More information

Norovirus Outbreak Among Residents of an Assisted Living Facility, Houston County, Alabama 2010 (AL1003NRV 35a)

Norovirus Outbreak Among Residents of an Assisted Living Facility, Houston County, Alabama 2010 (AL1003NRV 35a) Norovirus Outbreak Among Residents of an Assisted Living Facility, Houston County, Alabama 2010 (AL1003NRV 35a) Introduction On March 9, 2010, Public Health Area (PHA) 10 surveillance nurse contacted the

More information

Social Network Analysis: Introduzione all'analisi di reti sociali

Social Network Analysis: Introduzione all'analisi di reti sociali Social Network Analysis: Introduzione all'analisi di reti sociali Michele Coscia Dipartimento di Informatica Università di Pisa www.di.unipi.it/~coscia Piano Lezioni Introduzione Misure + Modelli di Social

More information

Tim Hsu. Updated Fall 2012

Tim Hsu. Updated Fall 2012 Updated Fall 2012 You ve heard about teaching math... A bachelor s degree in math, plus additional training (single-subject credential) qualifies you to teach high school math. A master s degree (2 extra

More information

How To Use A Webmail On A Pc Or Macodeo.Com

How To Use A Webmail On A Pc Or Macodeo.Com Big data workloads and real-world data sets Gang Lu Institute of Computing Technology, Chinese Academy of Sciences BigDataBench Tutorial MICRO 2014 Cambridge, UK INSTITUTE OF COMPUTING TECHNOLOGY 1 Five

More information

Big Data. What is Big Data? Over the past years. Big Data. Big Data: Introduction and Applications

Big Data. What is Big Data? Over the past years. Big Data. Big Data: Introduction and Applications Big Data Big Data: Introduction and Applications August 20, 2015 HKU-HKJC ExCEL3 Seminar Michael Chau, Associate Professor School of Business, The University of Hong Kong Ample opportunities for business

More information

CHAPTER 6 SECURE PACKET TRANSMISSION IN WIRELESS SENSOR NETWORKS USING DYNAMIC ROUTING TECHNIQUES

CHAPTER 6 SECURE PACKET TRANSMISSION IN WIRELESS SENSOR NETWORKS USING DYNAMIC ROUTING TECHNIQUES CHAPTER 6 SECURE PACKET TRANSMISSION IN WIRELESS SENSOR NETWORKS USING DYNAMIC ROUTING TECHNIQUES 6.1 Introduction The process of dispersive routing provides the required distribution of packets rather

More information

Algorithmic Trading Session 6 Trade Signal Generation IV Momentum Strategies. Oliver Steinki, CFA, FRM

Algorithmic Trading Session 6 Trade Signal Generation IV Momentum Strategies. Oliver Steinki, CFA, FRM Algorithmic Trading Session 6 Trade Signal Generation IV Momentum Strategies Oliver Steinki, CFA, FRM Outline Introduction What is Momentum? Tests to Discover Momentum Interday Momentum Strategies Intraday

More information

Dynamics of information spread on networks. Kristina Lerman USC Information Sciences Institute

Dynamics of information spread on networks. Kristina Lerman USC Information Sciences Institute Dynamics of information spread on networks Kristina Lerman USC Information Sciences Institute Information spread in online social networks Diffusion of activation on a graph, where each infected (activated)

More information

Chapter 20: Analysis of Surveillance Data

Chapter 20: Analysis of Surveillance Data Analysis of Surveillance Data: Chapter 20-1 Chapter 20: Analysis of Surveillance Data Sandra W. Roush, MT, MPH I. Background Ongoing analysis of surveillance data is important for detecting outbreaks and

More information

Asking Questions About Public Health: The Role of Epidemiology. Sapp SENCER 8-3-13 1

Asking Questions About Public Health: The Role of Epidemiology. Sapp SENCER 8-3-13 1 Asking Questions About Public Health: The Role of Epidemiology Sapp SENCER 8-3-13 1 Why Teach Epidemiology? Public health perspective Scientific method (reasoning & research skills) Critical thinking &

More information

Nodes, Ties and Influence

Nodes, Ties and Influence Nodes, Ties and Influence Chapter 2 Chapter 2, Community Detec:on and Mining in Social Media. Lei Tang and Huan Liu, Morgan & Claypool, September, 2010. 1 IMPORTANCE OF NODES 2 Importance of Nodes Not

More information

Automated Discovery and Visualization of Communication Networks from Social Media

Automated Discovery and Visualization of Communication Networks from Social Media Automated Discovery and Visualization of Communication Networks from Social Media Anatoliy Gruzd @gruzd gruzd@dal.ca Associate Professor, School of Information Management Director, Social Media Lab Faculty

More information

Computing Load Aware and Long-View Load Balancing for Cluster Storage Systems

Computing Load Aware and Long-View Load Balancing for Cluster Storage Systems 215 IEEE International Conference on Big Data (Big Data) Computing Load Aware and Long-View Load Balancing for Cluster Storage Systems Guoxin Liu and Haiying Shen and Haoyu Wang Department of Electrical

More information

Take Steps to Control TB TUBERCULOSIS. When You HaveHIV

Take Steps to Control TB TUBERCULOSIS. When You HaveHIV Take Steps to Control TB TUBERCULOSIS When You HaveHIV What s Inside: People with HIV are are also at risk for getting TB. Read this brochure today to learn about TB and HIV. 4PAGE 5PAGE About TB infection

More information

Cost effective Outbreak Detection in Networks

Cost effective Outbreak Detection in Networks Cost effective Outbreak Detection in Networks Jure Leskovec Joint work with Andreas Krause, Carlos Guestrin, Christos Faloutsos, Jeanne VanBriesen, and Natalie Glance Diffusion in Social Networks One of

More information

Software Engineering for Big Data. CS846 Paulo Alencar David R. Cheriton School of Computer Science University of Waterloo

Software Engineering for Big Data. CS846 Paulo Alencar David R. Cheriton School of Computer Science University of Waterloo Software Engineering for Big Data CS846 Paulo Alencar David R. Cheriton School of Computer Science University of Waterloo Big Data Big data technologies describe a new generation of technologies that aim

More information

TRACKS INFECTIOUS DISEASE EPIDEMIOLOGY

TRACKS INFECTIOUS DISEASE EPIDEMIOLOGY Dr. Shruti Mehta, Director The development of antibiotics, improved access to safe food, clean water, sewage disposal and vaccines has led to dramatic progress in controlling infectious diseases. Despite

More information

The Reality Pertussis can be a serious illness, part icularly for babies and young children.

The Reality Pertussis can be a serious illness, part icularly for babies and young children. Sounds of Pertussis Pertussis, also known as whooping cough, is a poten tially deadly infection that can strike at any age, but is particularly dangerous for babies. The sounds of pertussis are like no

More information

Optimization and Inference for Cyber Security in Complex Engineered Networks

Optimization and Inference for Cyber Security in Complex Engineered Networks Optimization and Inference for Cyber Security in Complex Engineered Networks Chee Wei Tan City University of Hong Kong 28 August, 2014 2014 IEEE SPS Summer School on IoT and M2M National Taiwan University

More information

Online Social Networks and Network Economics. Aris Anagnostopoulos, Online Social Networks and Network Economics

Online Social Networks and Network Economics. Aris Anagnostopoulos, Online Social Networks and Network Economics Online Social Networks and Network Economics Who? Dr. Luca Becchetti Prof. Elias Koutsoupias Prof. Stefano Leonardi What will We Cover? Possible topics: Structure of social networks Models for social networks

More information

Social Influence Analysis in Social Networking Big Data: Opportunities and Challenges. Presenter: Sancheng Peng Zhaoqing University

Social Influence Analysis in Social Networking Big Data: Opportunities and Challenges. Presenter: Sancheng Peng Zhaoqing University Social Influence Analysis in Social Networking Big Data: Opportunities and Challenges Presenter: Sancheng Peng Zhaoqing University 1 2 3 4 35 46 7 Contents Introduction Relationship between SIA and BD

More information

Pandemic Influenza Vaccines: Lessons Learned from the H1N1 Influenza Pandemic

Pandemic Influenza Vaccines: Lessons Learned from the H1N1 Influenza Pandemic Pandemic Influenza Vaccines: Lessons Learned from the H1N1 Influenza Pandemic Nancy J. Cox, Ph.D. Director, Influenza Division Director WHO Collaborating Center for Influenza NCIRD, Centers for Disease

More information

Bill Minor Ventura Foods, LLC PLANNING FOR A PANDEMIC

Bill Minor Ventura Foods, LLC PLANNING FOR A PANDEMIC Bill Minor Ventura Foods, LLC PLANNING FOR A PANDEMIC Today s Topics What is a pandemic A tale of two pandemics Why plan for a pandemic Possible effects of a severe pandemic Developing a pandemic plan

More information

A Correlation of. to the. South Carolina Data Analysis and Probability Standards

A Correlation of. to the. South Carolina Data Analysis and Probability Standards A Correlation of to the South Carolina Data Analysis and Probability Standards INTRODUCTION This document demonstrates how Stats in Your World 2012 meets the indicators of the South Carolina Academic Standards

More information

Immunology Ambassador Guide (updated 2014)

Immunology Ambassador Guide (updated 2014) Immunology Ambassador Guide (updated 2014) Immunity and Disease We will talk today about the immune system and how it protects us from disease. Also, we ll learn some unique ways that our immune system

More information

Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA

Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA http://kzhang6.people.uic.edu/tutorial/amcis2014.html August 7, 2014 Schedule I. Introduction to big data

More information

Data Mining on Social Networks. Dionysios Sotiropoulos Ph.D.

Data Mining on Social Networks. Dionysios Sotiropoulos Ph.D. Data Mining on Social Networks Dionysios Sotiropoulos Ph.D. 1 Contents What are Social Media? Mathematical Representation of Social Networks Fundamental Data Mining Concepts Data Mining Tasks on Digital

More information

MSc Data Science at the University of Sheffield. Started in September 2014

MSc Data Science at the University of Sheffield. Started in September 2014 MSc Data Science at the University of Sheffield Started in September 2014 Gianluca Demar?ni Lecturer in Data Science at the Informa?on School since 2014 Ph.D. in Computer Science at U. Hannover, Germany

More information

SARS Experiences in China:

SARS Experiences in China: SARS Experiences in China: Public Health Ethics Issues Jesse Huang,MB,MHPE,MPH,MBA Assistant President/Dean for Continuing Education Professor of Epidemiology Chinese Academy of Medical Sciences Peking

More information

Executive summary. Executive summary 8

Executive summary. Executive summary 8 Executive summary Q fever is a zoonosis an infectious disease that can be transmitted from animals to humans caused by the bacterium Coxiella burnetii (C. burnetii). Until 2006, Q fever was a rare disease

More information

ANALYTICS PREDICTIVE. Tool of Providence or the End of Coincidence? He who does not expect the unexpected will not find it out.

ANALYTICS PREDICTIVE. Tool of Providence or the End of Coincidence? He who does not expect the unexpected will not find it out. PREDICTIVE ANALYTICS Tool of Providence or the End of Coincidence? He who does not expect the unexpected will not find it out. Unless you expect the unexpected you will ever find truth, for it is hard

More information

Healthcare data analytics. Da-Wei Wang Institute of Information Science wdw@iis.sinica.edu.tw

Healthcare data analytics. Da-Wei Wang Institute of Information Science wdw@iis.sinica.edu.tw Healthcare data analytics Da-Wei Wang Institute of Information Science wdw@iis.sinica.edu.tw Outline Data Science Enabling technologies Grand goals Issues Google flu trend Privacy Conclusion Analytics

More information

Adult Vaccination Frequently Asked Questions: The Basics

Adult Vaccination Frequently Asked Questions: The Basics The Basics Why should I get vaccinated? Vaccination is the best way to protect against infections that can make you sick and be passed on to those around you. 1 What kinds of side effects will I get from

More information

http://www.who.int/csr/disease/avian_influenza/phase/en 4 http://new.paho.org/hq/index.php?option=com_content&task=view&id=1283&itemid=569

http://www.who.int/csr/disease/avian_influenza/phase/en 4 http://new.paho.org/hq/index.php?option=com_content&task=view&id=1283&itemid=569 Food and Agriculture Organization of the United Nations International Food Safety Authorities Network (INFOSAN) (Update) 30 April 2009 INFOSAN Information Note No. 2/2009 Human-animal interface aspects

More information

PREPARING YOUR ORGANIZATION FOR PANDEMIC FLU. Pandemic Influenza:

PREPARING YOUR ORGANIZATION FOR PANDEMIC FLU. Pandemic Influenza: PREPARING YOUR ORGANIZATION FOR PANDEMIC FLU Pandemic Influenza: What Business and Organization Leaders Need to Know About Pandemic Influenza Planning State of Alaska Frank H. Murkowski, Governor Department

More information

Graph Mining Techniques for Social Media Analysis

Graph Mining Techniques for Social Media Analysis Graph Mining Techniques for Social Media Analysis Mary McGlohon Christos Faloutsos 1 1-1 What is graph mining? Extracting useful knowledge (patterns, outliers, etc.) from structured data that can be represented

More information

Program in Public Health Course Descriptions

Program in Public Health Course Descriptions Program in Public Health Course Descriptions June 19, 2012 All courses are 3 credits unless indicated www.publichealth.msu.edu HM 101: Public Health 101 (undergraduate course) Provides an overview of public

More information

Harnessing the Potential of Data Scientists and Big Data for Scientific Discovery

Harnessing the Potential of Data Scientists and Big Data for Scientific Discovery Harnessing the Potential of Data Scientists and Big Data for Scientific Discovery Ed Lazowska, University of Washington Saul Perlmu=er, UC Berkeley Yann LeCun, New York University Josh Greenberg, Alfred

More information

Social Network Mining

Social Network Mining Social Network Mining Data Mining November 11, 2013 Frank Takes (ftakes@liacs.nl) LIACS, Universiteit Leiden Overview Social Network Analysis Graph Mining Online Social Networks Friendship Graph Semantics

More information

A simple decision tool to help optimize the control strategy 2 weeks into a Danish FMD epidemic

A simple decision tool to help optimize the control strategy 2 weeks into a Danish FMD epidemic A simple decision tool to help optimize the control strategy 2 weeks into a Danish FMD epidemic Preben Willeberg DVM, PhD, Dr.med.vet., Dr.med.vet.h.c. Senior Veterinary Global Health Specialist Center

More information

Math 370/408, Spring 2008 Prof. A.J. Hildebrand. Actuarial Exam Practice Problem Set 5 Solutions

Math 370/408, Spring 2008 Prof. A.J. Hildebrand. Actuarial Exam Practice Problem Set 5 Solutions Math 370/408, Spring 2008 Prof. A.J. Hildebrand Actuarial Exam Practice Problem Set 5 Solutions About this problem set: These are problems from Course 1/P actuarial exams that I have collected over the

More information

Source Anonymity in Sensor Networks

Source Anonymity in Sensor Networks Source Anonymity in Sensor Networks Bertinoro PhD. Summer School, July 2009 Radha Poovendran Network Security Lab Electrical Engineering Department University of Washington, Seattle, WA http://www.ee.washington.edu/research/nsl

More information

MLg. Big Data and Its Implication to Research Methodologies and Funding. Cornelia Caragea TARDIS 2014. November 7, 2014. Machine Learning Group

MLg. Big Data and Its Implication to Research Methodologies and Funding. Cornelia Caragea TARDIS 2014. November 7, 2014. Machine Learning Group Big Data and Its Implication to Research Methodologies and Funding Cornelia Caragea TARDIS 2014 November 7, 2014 UNT Computer Science and Engineering Data Everywhere Lots of data is being collected and

More information

International Journal of Advanced Computer Technology (IJACT) ISSN:2319-7900 PRIVACY PRESERVING DATA MINING IN HEALTH CARE APPLICATIONS

International Journal of Advanced Computer Technology (IJACT) ISSN:2319-7900 PRIVACY PRESERVING DATA MINING IN HEALTH CARE APPLICATIONS PRIVACY PRESERVING DATA MINING IN HEALTH CARE APPLICATIONS First A. Dr. D. Aruna Kumari, Ph.d, ; Second B. Ch.Mounika, Student, Department Of ECM, K L University, chittiprolumounika@gmail.com; Third C.

More information

Pandemic. PlanningandPreparednesPacket

Pandemic. PlanningandPreparednesPacket Pandemic PlanningandPreparednesPacket I m p o r t a n t I n f o r m a t i o n F r o m N e w Yo r k S t a t e s H e a l t h C o m m i s s i o n e r February 15, 2006 Dear New York State Employer: As you

More information

Bacteria vs. Virus: What s the Difference? Grade 11-12

Bacteria vs. Virus: What s the Difference? Grade 11-12 Bacteria vs. Virus: What s the Difference? Grade 11-12 Subject: Biology Topic: Bacteria, viruses, and the differences between them. The role that water plays in spreading bacteria and viruses, and the

More information

Planning for 2009 H1N1 Influenza. A Preparedness Guide for Small Business

Planning for 2009 H1N1 Influenza. A Preparedness Guide for Small Business 09 Planning for 2009 H1N1 Influenza A Preparedness Guide for Small Business Table of Contents 02 Foreword 03 Introduction 04 How to Write Your Plan 05 Keeping Healthy: 10 Tips for Businesses 06 Keeping

More information

Social Networks Analysis, Models and Effects

Social Networks Analysis, Models and Effects This study was funded by CNPq CONTRIBUTIONS OF SOCIAL NETWORK ANALYSIS TO UNDERSTANDING LEPROSY TRANSMISSION L. S. Kerr, J. V. Miranda, S. R. Pinho, R. F. Andrade, C. C. Frota, L. C. Rodrigues, M. L. Barreto,

More information

Opportuni)es and Challenges of Textual Big Data for the Humani)es

Opportuni)es and Challenges of Textual Big Data for the Humani)es Opportuni)es and Challenges of Textual Big Data for the Humani)es Dr. Adam Wyner, Department of Compu)ng Prof. Barbara Fennell, Department of Linguis)cs THiNK Network Knowledge Exchange in the Humani)es

More information

BSc BUSINESS AND MANAGEMENT (NN12) MODULES FOR ASSOCIATE AND ERASMUS STUDENTS

BSc BUSINESS AND MANAGEMENT (NN12) MODULES FOR ASSOCIATE AND ERASMUS STUDENTS Sc USINESS ND MNGEMENT (NN12) MODULES FOR SSOCITE ND ERSMUS STUDENTS Level 4 modules MODULE NME Fundamentals of Management MODULE CODE US001 MODULE ORGNISER Mr Ron Holland This module aims to provide an

More information

Benchmark Research Is Conduc2ng Several Clinical Trials In Your Area: Which Study Is Right For You? Why should I par2cipate in a clinical trial?

Benchmark Research Is Conduc2ng Several Clinical Trials In Your Area: Which Study Is Right For You? Why should I par2cipate in a clinical trial? Y FAMIL NIGHT FUN T BASKE WAY A GIVE 4 PAGE The Benchmark Research Is Conduc2ng Several Clinical Trials In Your Area: Which Study Is Right For You? FLU VACCINE- Infants, Children, Adults, Senior Adults

More information

Network Analysis For Sustainability Management

Network Analysis For Sustainability Management Network Analysis For Sustainability Management 1 Cátia Vaz 1º Summer Course in E4SD Outline Motivation Networks representation Structural network analysis Behavior network analysis 2 Networks Over the

More information

Hialeah Nursing and Rehabilitation Center Combines Technology and Best Practices to Improve Infection Control Specific to C.diff

Hialeah Nursing and Rehabilitation Center Combines Technology and Best Practices to Improve Infection Control Specific to C.diff RESEARCH ARTICLE Page 1 of 5 Hialeah Nursing and Rehabilitation Center Combines Technology and Best Practices to Improve Infection Control Specific to C.diff ABSTRACT RB Health Partners, Inc., June 24,

More information