a Data Science initiative @ Univ. Piraeus [GR] The Data Science Lab members June 2015 What is Data Science source: quora.com! Looking at data! Tools and methods used to analyze large amounts of data! Anything you can do to get knowledge out of data! finding and gathering data,! data mining and preprocessing,! EDA, statistics, machine learning,! natural language processing,! data visualization, 2
A short of history source: wikipedia.org! (1997) C.F. Jeff Wu gave the inaugural lecture entitled Statistics = Data Science? at the Univ. Michigan, and advocated that statistics be renamed data science and statisticians data scientists.! (2001) W.S. Cleveland introduced the notion of data science as an independent discipline, extending the field of statistics to incorporate advances in computing with data! (2008) D.J. Patil and J. Hammerbacher coined the term "data scientist" to define their jobs at LinkedIn and Facebook, resp. 3 Data Science is multi-disciplinary image source: http://blog.andreamostosi.name 4
Data Scientist is a key player has been called as the sexiest job of the 21st century (*) (*) T.H. Davenport & D.J. Patil (2012) HBR 5 image source: http://dsaa2015.lip6.fr Who we are! 8 faculty from! 4 departments Statistics & Insurance Sc. Maritime Studies Digital Systems Informatics 6
What we aim at! 3 pillars Education: Research: Networking:! Train our students to become highly-skilled Data Scientists! Advance research in Data Science / Big Data Analytics! Develop skills and competencies in modern Big Data exploration! Co-advise PhD students and postdoc researchers! Establish a sustainable network of researchers and practitioners in the field! Our overall goal: to join UniPi forces towards excellence in this emerging interdisciplinary domain!! 7 Main research topics! Big data management! Mining massive datasets! Data, text, audio analytics! Mobility data exploration! Social network analysis! Semantic integration! Data privacy! Data visualization 8
Projects and prototypes 9 DataStories seminar series 10
Useful links - I! Events:! IEEE Int. Conf. on Data Science and Advanced Analytics! IEEE Int. Conf. on Big Data!! Journals:! IEEE Trans. Big Data (IEEE)! Data Science & Analytics (Springer)! Big Data Research (Elsevier)! 11 Useful links - II! Textbooks, Monographs, Reports:! H.V. Jagadish et al. (2014) Big Data and its Technical Challenges! W.H. Inmon & D. Linstedt (2014) Data Architecture: A Primer for the Data Scientist: Big Data, Data Warehouse and Data Vault! J. Hopcroft & R. Kannan (2014) Foundations of Data Science! A.M. Townsend (2013) Smart Cities: Big Data, Civic Hackers, and the Quest for a New Utopia! M. Stonebraker (2012) What Does Big Data Mean?! M. Loukides (2010) What is Data Science?! T. Hey et al. (eds.) (2009) The Fourth Paradigm: Data-Intensive Scientific Discovery! 12
Contact us! Are you a student?! We are seeking motivated students (at MSc or PhD level) to work with on DataSci-related topics! Are you a colleague from academia or a person from industry or public sector?! We are happy to present our activities and discuss potential collaboration If interested, send an e-mail @ datastories@unipi.gr In any case, stay tuned @ www.datastories.org 13 2015 diploma theses proposals! Empirical Comparison of Complex Event Recognition Engines! Complex Event Recognition for Maritime Surveillance! Reasoning with ontologies for real time event recognition! ChoroChronos! Cleaning and Prediction of Trajectory Data! Context aware recommendation techniques! Hermoupolis: On the Simulation of our LifeSteps! Hospital Performance Based on Financial and Clinical Data! Partitioning of RDF graphs! Reverse Top k Queries for Fast Data! Semantic data integration! Sentiment Analysis on Twitter Data! Spatio Temporal Textual Data Management on Twitter Data! Health Data Analytics! Data Clustering on the Million Song Dataset 14