Paris-Saclay Center for Data Science

Size: px
Start display at page:

Download "Paris-Saclay Center for Data Science"

Transcription

1 BALÁZS KÉGL DR / CNRS LAL & LRI CNRS & University Paris-Sud ARNAK DALALYAN CÉCILE GERMAIN ALEXANDRE GRAMFORT AKIN KAZAKÇI Pr / ENSAE Pr / UPSud MdC / Telecom ParisTech MdC / Mines ParisTech Laboratoire de Statistique LRI LTCI CGS 1

2 Meta 2

3 I will not talk about science 3

4 I will talk about management (of) (data) science 4

5 WHERE DOES IT COME FROM? My eight-year of experience interfacing between high-energy physics and data science Our one-year experience of running PSCDS Extensive discussions with management scientist and the MI/Mastodons/MaDICS for the last year 5

6 UNIVERSITÉ PARIS-SACLAY 19 founding partners 6

7 UNIVERSITÉ PARIS-SACLAY + horizontal multi-disciplinary and multi-partner initiatives ( lidex ) to create cohesion 7

8 machine learning information retrieval signal processing data visualization databases Tool building Domain science software engineering clouds/grids high-performance computing optimization human society life brain earth universe A multi-disciplinary initiative to define, structure, and manage the data science ecosystem at the Université Domain scientist Software engineer researchers in 35 laboratories Biology & bioinformatics IBISC/UEvry LRI/UPSud Hepatinov CESP/UPSud-UVSQ-Inserm IGM-I2BC/UPSud MIA/Agro MIAj-MIG/INRA LMAS/Centrale Chemistry EA4041/UPSud Earth sciences LATMOS/UVSQ GEOPS/UPSud IPSL/UVSQ LSCE/UVSQ LMD/Polytechnique Economy LM/ENSAE RITM/UPSud LFA/ENSAE Machine learning LRI/UPSud LTCI/Telecom CMLA/Cachan LS/ENSAE LIX/Polytechnique MIA/Agro CMA/Polytechnique LSS/Supélec CVN/Centrale LMAS/Centrale DTIM/ONERA IBISC/UEvry Neuroscience UNICOG/Inserm U1000/Inserm NeuroSpin/CEA Particle physics astrophysics & cosmology LPP/Polytechnique DMPH/ONERA CosmoStat/CEA IAS/UPSud AIM/CEA LAL/UPSud datascience-paris-saclay.fr Visualization INRIA LIMSI 8 Signal processing LTCI/Telecom CMA/Polytechnique CVN/Centrale LSS/Supélec CMLA/Cachan LIMSI DTIM/ONERA Statistics LMO/UPSud LS/ENSAE LSS/Supélec CMA/Polytechnique LMAS/Centrale MIA/AgroParisTech

9 DATA SCIENCE Design of automated methods to analyze massive and complex data to extract useful information 9

10 CENTER FOR DATA SCIENCE = DATA CENTER We are focusing on inference: data knowledge Interfacing with HPC, cloud, storage, production 10

11 PARAMETERS 2 years: April June 2016, 1.2M +1 year, conditional on evaluation Light management executive committee of 17 members work groups management, strategy (around objectives) thematic (around scientific themes), open to everyone to propose and to participate 11

12 THE DATA SCIENCE LANDSCAPE Data scientist Data engineer Data science statistics machine learning information retrieval signal processing data visualization databases Applied scientist Tool building Domain science Software engineer software engineering clouds/grids high-performance computing optimization energy and physical sciences health and life sciences Earth and environment economy and society brain Domain scientist Data trainer 12

13 CHALLENGES Manpower especially at the interfaces industrial brain-drain Incentives data scientists are not incentivized to work on domain science scientists are not incentivized to work on tools Access no well-developed channels to identify the right experts for a given problem Tools few tools that can help domain scientists and data scientists to collaborate efficiently 13

14 TOOLS We are designing and learning to manage tools to accompany data science projects with different needs 14

15 TOOLS: LANDSCAPE TO ECOSYSTEM Data scientist Data engineer coding sprints Open Software Initiative code consolidator and engineering projects Software engineer Tool building software engineering clouds/grids high-performance computing optimization Data science statistics machine learning information retrieval signal processing data visualization databases Data domains Applied scientist interdisciplinary projects matchmaking tool design and innovation strategy workshops data challenges energy and physical sciences health and life sciences Earth and environment economy and society brain data science bootcamps IT platform for linked data annotation tool SaaS data science platform Domain expert Data trainer 15

16 POSTDOCS, THESES, SABBATICALS Common selection criteria scientific quality expected results both in domain science and data science relevance and feasibility (real) scientific data, available at the start of the project interdisciplinarity (PIs both from domain and data sciences, different LABEXs) community building organizing and participating in thematic days, bootcamps, workshops 16

17 ENGINEERING AND CODE CONSOLIDATING PROJECTS No research implementation, maintenance, integration 1 year engineering projects 3-6 month code consolidating projects a postdoc or PhD student drops research during the project, implements his/her research code in a professional software, or integrates it into a toolbox 17

18 IT PLATFORM FOR LINKED DATA A window to open data at We are not storing or handling existing large data sets Rather indexing, linking, and mapping, embedding in the worldwide linked data (RDF) ecosystem Storing small data sets of small teams is possible Subsets of large sets for prototyping Or simply store metadata plus pointer 18

19 IT PLATFORM FOR LINKED DATA 19

20 BOOTCAMPS Single-day coding sessions participants Goals training PhD students, postdocs, engineers, senior researchers for hands-on data science (problem types, tools) solving (prototyping) real data science problems networking, knowing each other 20

21 BOOTCAMPS 21

22 BOOTCAMPS 22

23 BOOTCAMPS 23

24 BOOTCAMPS 24

25 DATA CHALLENGES A data challenge is a recently developed unconventional dissemination and communication tool a scientific or industrial data producer arrives with a well-defined problem and a corresponding annotated data set defines a quantitative goal makes the problem and part of the data set (the training set) public on a dedicated site data science experts then take the public training data and submit solutions for a test set with hidden annotations submissions are evaluated numerically using the quantitative measure contestants are listed on a leaderboard after a predefined time, typically a couple of months, the final results are revealed and the winners are awarded 25

26 DATA CHALLENGES The HiggsML challenge on Kaggle teams, huge publicity significant improvement on baseline 18 month preparation, yet partially missing the target 26

27 DATA CHALLENGES Challenges are useful for generating visibility in the data science community about novel application domains benchmarking in a fair way state-of-the-art techniques on well-defined problems finding talented data scientists Limitations not necessary adapted to solving complex and open-ended data science problems in realistic environments emphasizes competition 27

28 DESIGN AND INNOVATION STRATEGY WORKSHOPS 28

29 DESIGN AND INNOVATION STRATEGY WORKSHOPS Putting domain scientists, data scientists, and management scientist in the same room Getting them understand each other Keeping them collectively creative The goal: identifying and defining projects low-hanging fruits breakthrough projects long-term vision 29

30 DESIGN AND INNOVATION STRATEGY WORKSHOPS C/K design theory innovative design = interaction and joint expansion of concepts and knowledge 30

31 DESIGN AND INNOVATION STRATEGY WORKSHOPS DKCP process: linearizing C-K dynamics Ini3alisa3on$ [K]$Knowledge$ sharing$ Workshops$ [C]$IFM?Design$ Workshops$ [P]$Project$ building! [RUN]! 31

32 TAKE HOME MESSAGE NO1 If you are interested in adapting any of these tools in your project/site, feel free to contact us, we would be happy to share our experience 32

33 WHAT CNRS CAN DO We need data engineers and trainers to support research ideally: 75% research scientists, 25% research engineers We are lucky in France that the position exists in public research Tasks integrating research code into general-purpose professional software (e.g., scikit-learn, Torch) providing an interface between computational infrastructure (e.g., clouds) and scientists training scientists to use the tools 33

34 WHAT CNRS CAN DO Incentives 34

35 WHAT CNRS CAN DO Most data scientists, as other scientists, are trained and incentivized to do research on highly specialized domains. They search scientific visibility in their international community, which is equally highly specialized, because their carrier advancement is almost entirely based on peer-reviewed publications. Even when they would have the expertise, they have little incentive to venture into the tool builder (data engineer) role since software authorship has little value in their evaluation, and it can only serve them implicitly through the visibility they gain in the community of tool users. By the same token, they have little incentive to venture into domain sciences and to tackle economic or societal challenges. It is possible that a domain science or an industrial project requires new techniques which then can be published in data science venues, but this is not guaranteed at all. It usually takes heavy investment of time and effort to be able to understand domain problems, so excursions into domain sciences are highly risky. Even when such collaborations are established, the data scientist has a strong prior to use his/her favorite methodology which is not necessarily the best solution for a given problem. Finally, the data scientist has little incentive to bring the project to full fruition, and he/she often runs away with an abstract data science problem (and solution) extracted from the project. Symmetrically, domain scientist and industrial domain experts have no incentive to advance data science and to develop and publish new techniques, as long as their data science problems get solved. When they venture into tool development, they have little incentive in developing general purpose tools. 35

36 TAKE HOME MESSAGE NO2 Affirmative action 36

37 TAKE HOME MESSAGE NO2 Affirmative action Overweight (e.g, double count) out-of-domain papers in every evaluation Make software tool ownership count 37

38 TAKE HOME MESSAGE NO3 Co-locality is important. It would be desirable that data science for scientific data in France be grouped into 5-6 sites of similar size. Resources and experience can and should be shared. 38

39 THANK YOU! 39

SURVEY REPORT DATA SCIENCE SOCIETY 2014

SURVEY REPORT DATA SCIENCE SOCIETY 2014 SURVEY REPORT DATA SCIENCE SOCIETY 2014 TABLE OF CONTENTS Contents About the Initiative 1 Report Summary 2 Participants Info 3 Participants Expertise 6 Suggested Discussion Topics 7 Selected Responses

More information

Proposal for the Theme on Big Data. Analytics. Qiang Yang, HKUST Jiannong Cao, PolyU Qi-man Shao, CUHK. May 2015

Proposal for the Theme on Big Data. Analytics. Qiang Yang, HKUST Jiannong Cao, PolyU Qi-man Shao, CUHK. May 2015 Proposal for the Theme on Big Data Analytics May 2015 Qiang Yang, HKUST Jiannong Cao, PolyU Qi-man Shao, CUHK Motivation The world's technological per-capita capacity to store information doubled every

More information

TERMS OF REFERENCE FINAL VERSION 30 JULY 2014

TERMS OF REFERENCE FINAL VERSION 30 JULY 2014 I. Introduction Agropolis Fondation 2014 Call for Proposals (CfP) Open Science [Ref. CfP 2014-03] TERMS OF REFERENCE FINAL VERSION 30 JULY 2014 Agropolis Fondation s mission is to promote and support interdisciplinary

More information

RFI Summary: Executive Summary

RFI Summary: Executive Summary RFI Summary: Executive Summary On February 20, 2013, the NIH issued a Request for Information titled Training Needs In Response to Big Data to Knowledge (BD2K) Initiative. The response was large, with

More information

Bachelor of Science Degree Structure

Bachelor of Science Degree Structure Pan-University Bachelor of Science Degree Structure Background With the restructuring of Faculties at York University in the last few years it became evident that pan- University structures for both BA

More information

High Performance Computing

High Performance Computing High Parallel Computing Hybrid Program Coding Heterogeneous Program Coding Heterogeneous Parallel Coding Hybrid Parallel Coding High Performance Computing Highly Proficient Coding Highly Parallelized Code

More information

Standards for Big Data in the Cloud

Standards for Big Data in the Cloud Standards for Big Data in the Cloud International Cloud Symposium 15/10/2013 Carola Carstens (Project Officer) DG CONNECT, Unit G3 Data Value Chain European Commission Outline 1) Data Value Chain Unit

More information

ASQT 2015. 13 th User Conference for Software Quality, Test and Innovation

ASQT 2015. 13 th User Conference for Software Quality, Test and Innovation ASQT 2015 13 th User Conference for Software Quality, Test and Innovation Congress Graz April 16 th - 17 th, 2015 www.asqt.org Motivation Twice before over the past 50 years, two waves of information technology

More information

Workprogramme 2014-15

Workprogramme 2014-15 Workprogramme 2014-15 e-infrastructures DCH-RP final conference 22 September 2014 Wim Jansen einfrastructure DG CONNECT European Commission DEVELOPMENT AND DEPLOYMENT OF E-INFRASTRUCTURES AND SERVICES

More information

Call for Proposals 2015 Open Science - Training and Higher Education TERMS OF REFERENCE 21 JANUARY 2015

Call for Proposals 2015 Open Science - Training and Higher Education TERMS OF REFERENCE 21 JANUARY 2015 AGROPOLIS FONDATION Call for Proposals 2015 Open Science - Training and Higher Education [Ref. CfP 2015-01] TERMS OF REFERENCE 21 JANUARY 2015 Introduction Agropolis Fondation s mission is to promote and

More information

2016 POST-DOCTORAL PROGRAM Applicant Guide

2016 POST-DOCTORAL PROGRAM Applicant Guide 2016 POST-DOCTORAL PROGRAM Applicant Guide POST-DOCTORAL FELLOWSHIP PROGRAM 2016 Applicant guide The Initiative of Excellence of the University of Bordeaux (IdEx Bordeaux) is opening positions for postdoctoral

More information

Observer Access to the Cherenkov Telescope Array

Observer Access to the Cherenkov Telescope Array Observer Access to the Cherenkov Telescope Array IRAP, Toulouse, France E-mail: jknodlseder@irap.omp.eu V. Beckmann APC, Paris, France E-mail: beckmann@apc.in2p3.fr C. Boisson LUTh, Paris, France E-mail:

More information

National Big Data R&D Initiative

National Big Data R&D Initiative National Big Data R&D Initiative Suzi Iacono, PhD National Science Foundation Co-chair NITRD Big Data Senior Steering Group for CASC Spring Meeting April 23, 2014 Why is Big Data Important? Transformative

More information

ICS Summer School 2016

ICS Summer School 2016 ICS Summer School 2016 Scientific Trends at the Interfaces Scientific Visualization Data Science Organisers: Pascal Frey, Patrick Gallinari, Agathe Guilloux, Sylvie Thiria, Julien Tierny July 18th August

More information

Future and Emerging Technologies (FET) in H2020. Ales Fiala Future and Emerging Technologies DG CONNECT European Commission

Future and Emerging Technologies (FET) in H2020. Ales Fiala Future and Emerging Technologies DG CONNECT European Commission Future and Emerging Technologies (FET) in H2020 51214 Ales Fiala Future and Emerging Technologies DG CONNECT European Commission H2020, three pillars Societal challenges Excellent science FET Industrial

More information

How To Become A Data Scientist

How To Become A Data Scientist Programme Specification Awarding Body/Institution Teaching Institution Queen Mary, University of London Queen Mary, University of London Name of Final Award and Programme Title Master of Science (MSc)

More information

CAMPAIGN 2015/2016: GUIDELINES & FAQ

CAMPAIGN 2015/2016: GUIDELINES & FAQ CAMPAIGN 2015/2016: GUIDELINES & FAQ LE STUDIUM RESEARCH PROGRAMMES Deadline: Monday 16th February 2015 (17:00 CET Paris time) Preamble Created in 1996, inspired by the Loire Valley historical, geographical

More information

Disributed Query Processing KGRAM - Search Engine TOP 10

Disributed Query Processing KGRAM - Search Engine TOP 10 fédération de données et de ConnaissancEs Distribuées en Imagerie BiomédicaLE Data fusion, semantic alignment, distributed queries Johan Montagnat CNRS, I3S lab, Modalis team on behalf of the CrEDIBLE

More information

Tools for Managing and Measuring the Value of Big Data Projects

Tools for Managing and Measuring the Value of Big Data Projects Tools for Managing and Measuring the Value of Big Data Projects Abstract Big Data and analytics focused projects have undetermined scope and changing requirements at their core. There is high risk of loss

More information

Information Visualization WS 2013/14 11 Visual Analytics

Information Visualization WS 2013/14 11 Visual Analytics 1 11.1 Definitions and Motivation Lot of research and papers in this emerging field: Visual Analytics: Scope and Challenges of Keim et al. Illuminating the path of Thomas and Cook 2 11.1 Definitions and

More information

IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper

IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper CAST-2015 provides an opportunity for researchers, academicians, scientists and

More information

Big Data, Physics, and the Industrial Internet! How Modeling & Analytics are Making the World Work Better."

Big Data, Physics, and the Industrial Internet! How Modeling & Analytics are Making the World Work Better. Big Data, Physics, and the Industrial Internet! How Modeling & Analytics are Making the World Work Better." Matt Denesuk! Chief Data Science Officer! GE Software! October 2014! Imagination at work. Contact:

More information

Panasas High Performance Storage Powers the First Petaflop Supercomputer at Los Alamos National Laboratory

Panasas High Performance Storage Powers the First Petaflop Supercomputer at Los Alamos National Laboratory Customer Success Story Los Alamos National Laboratory Panasas High Performance Storage Powers the First Petaflop Supercomputer at Los Alamos National Laboratory June 2010 Highlights First Petaflop Supercomputer

More information

NERC Thematic Programme. Cloud Water Vapour and Climate (CWVC) Data Management Plan

NERC Thematic Programme. Cloud Water Vapour and Climate (CWVC) Data Management Plan NERC Thematic Programme Cloud Water Vapour and Climate (CWVC) Data Management Plan BADC December 2001 Updated February 2002 Scope The purpose of the CWVC data management plan is to set up a coherent approach

More information

Understand life - Preserve the environment. Strategy Document, Department of Biology 2015 2020

Understand life - Preserve the environment. Strategy Document, Department of Biology 2015 2020 Understand life - Preserve the environment Strategy Document, Department of Biology 2015 2020 1 2 Photo: Per Harald Olsen/NTNU Understand life - Preserve the environment Strategy Document for the Department

More information

In 2014, the Research Data group @ Purdue University

In 2014, the Research Data group @ Purdue University EDITOR S SUMMARY At the 2015 ASIS&T Research Data Access and Preservation (RDAP) Summit, panelists from Research Data @ Purdue University Libraries discussed the organizational structure intended to promote

More information

Laurentian University Strategic Research Plan 2012-2017

Laurentian University Strategic Research Plan 2012-2017 Page 1 Laurentian University Strategic Research Plan 2012-2017 PREAMBLE Laurentian University is the principal research and graduate training centre in Northeastern Ontario. In just 50 years, it has grown

More information

Why big data? Lessons from a Decade+ Experiment in Big Data

Why big data? Lessons from a Decade+ Experiment in Big Data Why big data? Lessons from a Decade+ Experiment in Big Data David Belanger PhD Senior Research Fellow Stevens Institute of Technology dbelange@stevens.edu 1 What Does Big Look Like? 7 Image Source Page:

More information

Evaluation Guide 2013 FCT INVESTIGATOR GRANTS. 01 August 2013

Evaluation Guide 2013 FCT INVESTIGATOR GRANTS. 01 August 2013 Evaluation Guide 2013 FCT INVESTIGATOR GRANTS 01 August 2013 1. INTRODUCTION This document outlines the reviewing process of the call for FCT Investigator grants, inputs and outputs, and defines the responsibilities

More information

Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness

Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness Melanie Dulong de Rosnay Fellow, Science Commons and Berkman Center for Internet & Society at Harvard University This article

More information

HPC & Visualization. Visualization and High-Performance Computing

HPC & Visualization. Visualization and High-Performance Computing HPC & Visualization Visualization and High-Performance Computing Visualization is a critical step in gaining in-depth insight into research problems, empowering understanding that is not possible with

More information

EUROTECH UNIVERSITIES ALLIANCE CONTRIBUTION TO THE PUBLIC CONSULTATION SCIENCE 2.0-SCIENCE IN TRANSITION

EUROTECH UNIVERSITIES ALLIANCE CONTRIBUTION TO THE PUBLIC CONSULTATION SCIENCE 2.0-SCIENCE IN TRANSITION EUROTECH UNIVERSITIES ALLIANCE CONTRIBUTION TO THE PUBLIC CONSULTATION SCIENCE 2.0-SCIENCE IN TRANSITION A: INTRODUCTION TO THE ALLIANCE S CONTRIBUTION The EuroTech Universities Alliance is a strategic

More information

Guide for writing and submitting applications for the FCT Investigator Grants

Guide for writing and submitting applications for the FCT Investigator Grants Guide for writing and submitting applications for the FCT Investigator Grants Introduction This guide for writing and submitting an application for the FCT Investigator Grants is intended to guide you

More information

Data Intensive Research Initiative for South Africa (DIRISA)

Data Intensive Research Initiative for South Africa (DIRISA) Data Intensive Research Initiative for South Africa (DIRISA) A Reinterpreted Vision A. Vahed 25 November 2014 Outline Background Data Landscape Strategy & Objectives Activities & Outputs Organisational

More information

A WORLD-CLASS HIGHER EDUCATION AND RESEARCH ESTABLISHMENT

A WORLD-CLASS HIGHER EDUCATION AND RESEARCH ESTABLISHMENT A WORLD-CLASS HIGHER EDUCATION AND RESEARCH ESTABLISHMENT TRAINING FUTURE LEADERS ÉCOLE POLYTECHNIQUE PRODUCES SOCIALLY RESPONSIBLE PROFESSIONALS WHO EXCEL IN HIGH-LEVEL KEY POSITIONS AND LEAD COMPLEX

More information

EIT ICT Labs The ICT Innovation Catalyst for Europe

EIT ICT Labs The ICT Innovation Catalyst for Europe Start-Chart EIT ICT Labs The ICT Innovation Catalyst for Europe Prof. Dr. Willem Jonker CEO willem.jonker@ictlabs.eu EIT ICT Labs mission is to turn Europe into a global leader in ICT Innovation Breed

More information

Promotion of Young Scientists in Eastern Europe (PROMYS)

Promotion of Young Scientists in Eastern Europe (PROMYS) www.snf.ch Wildhainweg 3, P.O. Box 8232, CH-3001 Berne International Co-operation Promotion of Young Scientists in Eastern Europe (PROMYS) Call for proposals 1. Introduction The research systems in the

More information

Big Data The Next Phase Lessons from a Decade+ Experiment in Big Data

Big Data The Next Phase Lessons from a Decade+ Experiment in Big Data Big Data The Next Phase Lessons from a Decade+ Experiment in Big Data David Belanger PhD Senior Research Fellow Stevens Institute of Technology dbelange@stevens.edu 1 Outline Big Data Overview Thinking

More information

Computational Science and Informatics (Data Science) Programs at GMU

Computational Science and Informatics (Data Science) Programs at GMU Computational Science and Informatics (Data Science) Programs at GMU Kirk Borne George Mason University School of Physics, Astronomy, & Computational Sciences http://spacs.gmu.edu/ Outline Graduate Program

More information

ESF-EMBO Symposia. within the framework of the ESF Research Conferences Scheme

ESF-EMBO Symposia. within the framework of the ESF Research Conferences Scheme ESF-EMBO Symposia within the framework of the ESF Research Conferences Scheme The ESF Research Conferences Scheme provides the opportunity for leading scientists and younger researchers to meet for discussions

More information

Environmental Research and Innovation ( ERIN )

Environmental Research and Innovation ( ERIN ) RDI Department Environmental Research and Innovation ( ERIN ) LIST s Environmental Research & Innovation (ERIN) department develops strategies, technologies and tools to better monitor, assess, use and

More information

Extract from the reporting 2008

Extract from the reporting 2008 Extract from the reporting 2008 Domenico Giardini, CCES Director, and Nikolaus Gotsch, CCES Manager February 26, 2009 ACHIEVEMENTS The Competence Center Environment and Sustainability of the ETH Domain

More information

Biomedical Science. General Syllabus for Postgraduate Research Training Programme in Biomedical Science

Biomedical Science. General Syllabus for Postgraduate Research Training Programme in Biomedical Science Biomedical Science General Syllabus for Postgraduate Research Training Programme in Biomedical Science The syllabus was approved by the Faculty Board, Faculty of Health and Society, Malmö University on

More information

HCERES report on the federation: Under the supervision of the following institutions and research bodies:

HCERES report on the federation: Under the supervision of the following institutions and research bodies: Research units HCERES report on the federation: Fédération de Recherche Lasers et Plasmas Under the supervision of the following institutions and research bodies: Université de Bordeaux Université Paris-Sud

More information

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21)

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21) CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21) Goal Develop and deploy comprehensive, integrated, sustainable, and secure cyberinfrastructure (CI) to accelerate research

More information

Miracle Integrating Knowledge Management and Business Intelligence

Miracle Integrating Knowledge Management and Business Intelligence ALLGEMEINE FORST UND JAGDZEITUNG (ISSN: 0002-5852) Available online www.sauerlander-verlag.com/ Miracle Integrating Knowledge Management and Business Intelligence Nursel van der Haas Technical University

More information

The Risks and Promises of Cloud Computing for Genomics

The Risks and Promises of Cloud Computing for Genomics The Risks and Promises of Cloud Computing for Genomics Laura Lyman Rodriguez, Ph.D. National Human Genome Research Institute P3G Privacy Summit: Data Sharing and Cloud Computing May 3, 2013 Key Elements

More information

Internship Opportunities Xerox Research Centre India (XRCI), Bangalore Analytics Research Group

Internship Opportunities Xerox Research Centre India (XRCI), Bangalore Analytics Research Group Analytics Research Group The Analytics Research Group in Xerox Research Centre India (XRCI) is seeking bright Undergraduate, Masters and PhD students for research internships to participate in exciting

More information

M.Tech. Control and Instrumentation Engineering

M.Tech. Control and Instrumentation Engineering Specialization: M.Tech. Control and Instrumentation Engineering Control System Instrumentation Systems Industrial Automation Programme Educational Objectives (PEO s) PEO 1: Graduates of the programme will

More information

A Cloud-Based Collaborative Virtual Environment

A Cloud-Based Collaborative Virtual Environment A Cloud-Based Collaborative Virtual Environment 1 Problem statement Recent years have witnessed a huge spread of information, with an increased expectation to stay connected at all times. This is also

More information

research center concept

research center concept research center concept research center concept 1 Overview As a key part of the Russian Skolkovo Initiative, the Skolkovo Foundation, MIT, and others are assisting in the creation of the Skolkovo Institute

More information

Report of the DTL focus meeting on Life Science Data Repositories

Report of the DTL focus meeting on Life Science Data Repositories Report of the DTL focus meeting on Life Science Data Repositories Goal The goal of the meeting was to inform and discuss research data repositories for life sciences. The big data era adds to the complexity

More information

Panel on Big Data Challenges and Opportunities

Panel on Big Data Challenges and Opportunities Panel on Big Data Challenges and Opportunities Dr. Chaitan Baru Senior Advisor for Data Science, Directorate for Computer & Information Science & Engineering National Science Foundation NSF s Perspective

More information

EPSRC Cross-SAT Big Data Workshop: Well Sorted Materials

EPSRC Cross-SAT Big Data Workshop: Well Sorted Materials EPSRC Cross-SAT Big Data Workshop: Well Sorted Materials 5th August 2015 Contents Introduction 1 Dendrogram 2 Tree Map 3 Heat Map 4 Raw Group Data 5 For an online, interactive version of the visualisations

More information

Future Networks, Society, and Modeling (FuNeSoMo)

Future Networks, Society, and Modeling (FuNeSoMo) Future Networks, Society, and Modeling (FuNeSoMo) Project description 1. Background The project is built on the existing collaboration between Finland and universities and research institutes in the USA.

More information

Cisco Data Center Services for OpenStack

Cisco Data Center Services for OpenStack Data Sheet Cisco Data Center Services for OpenStack Use Cisco Expertise to Accelerate Deployment of Your OpenStack Cloud Operating Environment Why OpenStack? OpenStack is an open source cloud operating

More information

Science of Philanthropy Initiative University of Chicago University of Wisconsin-Madison Georgia State University. Initiative Director: John List

Science of Philanthropy Initiative University of Chicago University of Wisconsin-Madison Georgia State University. Initiative Director: John List Science of Philanthropy Initiative University of Chicago University of Wisconsin-Madison Georgia State University Initiative Director: John List EXTERNAL GRANTS PROGRAM RFP Request for Applications: The

More information

Creating the Future Pierre and Marie Curie University. www.upmc.fr

Creating the Future Pierre and Marie Curie University. www.upmc.fr Creating the Future Pierre and Marie Curie University www.upmc.fr UPMC: the Leading French Scientific and Medical University Every day, UPMC demonstrates how education and research create important synergies.

More information

www.polytechnique.edu

www.polytechnique.edu www.polytechnique.edu A multidisciplinary research centre Biology Chemistry 21 laboratories 1,600 people Computer Science Mathematics Applied Mathematics Mechanics (Solid Mechanics, Fluid Dynamics, Meteorology)

More information

ICSTI 2014 General Assembly October 18-19, 2014

ICSTI 2014 General Assembly October 18-19, 2014 ICSTI 2014 General Assembly October 18-19, 2014 TACC Workshop Sunday, October 19 th, 2014 Enhancing Discoverability and Accessibility of Scientific and Technical Research Information and Data The TACC

More information

White Paper. Version 1.2 May 2015 RAID Incorporated

White Paper. Version 1.2 May 2015 RAID Incorporated White Paper Version 1.2 May 2015 RAID Incorporated Introduction The abundance of Big Data, structured, partially-structured and unstructured massive datasets, which are too large to be processed effectively

More information

MACHINE LEARNING BASICS WITH R

MACHINE LEARNING BASICS WITH R MACHINE LEARNING [Hands-on Introduction of Supervised Machine Learning Methods] DURATION 2 DAY The field of machine learning is concerned with the question of how to construct computer programs that automatically

More information

Position Specification

Position Specification (UK) Position Specification Lexington, Kentucky POSITION ORGANIZATION LOCATION (UK) Lexington, KY REPORTING RELATIONSHIPS Executive Vice President for Health Affairs Provost POSITION DESCRIPTION The Center

More information

Future and Emerging Technologies (FET) FET-Open in Work Programme 2014-2015 in H2020

Future and Emerging Technologies (FET) FET-Open in Work Programme 2014-2015 in H2020 Future and Emerging Technologies (FET) FET-Open in Work Programme 2014-2015 in H2020 51214 FET 2014 Info Session 28 May 2014 Walter Van de Velde Future and Emerging Technologies European Commission the

More information

Teaching Computational Thinking using Cloud Computing: By A/P Tan Tin Wee

Teaching Computational Thinking using Cloud Computing: By A/P Tan Tin Wee Teaching Computational Thinking using Cloud Computing: By A/P Tan Tin Wee Technology in Pedagogy, No. 8, April 2012 Written by Kiruthika Ragupathi (kiruthika@nus.edu.sg) Computational thinking is an emerging

More information

Web and Application Hosting 2015

Web and Application Hosting 2015 MARKET FORECAST Web and Application Hosting 2015 SEP 2015 Liam Eagle, Senior Analyst, Service Providers Web and application hosting, a mature IT market, is facing changes to the variety of services on

More information

VENTURE TECHNOLOGY MASTER S PROGRAM TECHNOLOGY INNOVATION AND ENTREPRENEURSHIP

VENTURE TECHNOLOGY MASTER S PROGRAM TECHNOLOGY INNOVATION AND ENTREPRENEURSHIP TECHNOLOGY VENTURE MASTER S PROGRAM AT THE INTERSECTION OF TECHNOLOGY INNOVATION AND ENTREPRENEURSHIP AN IDEAL ENVIRONMENT FOR GROWING STARTUPS: MENTORSHIP EDUCATION ENTREPRENEURIAL PROJECTS A UNIQUE CAMPUS

More information

CLUSTER ANALYSIS WITH R

CLUSTER ANALYSIS WITH R CLUSTER ANALYSIS WITH R [cluster analysis divides data into groups that are meaningful, useful, or both] LEARNING STAGE ADVANCED DURATION 3 DAY WHAT IS CLUSTER ANALYSIS? Cluster Analysis or Clustering

More information

Institutes for Data Science: New York University University of Washington University of California, Berkeley

Institutes for Data Science: New York University University of Washington University of California, Berkeley Advancing scientific discovery through collaboration across research domains Institutes for Data Science: New York University University of Washington University of California, Berkeley Data Science growing

More information

An adaptable domain specific dissemination infrastructure for enhancing the visibility of complementary and thematically related research information

An adaptable domain specific dissemination infrastructure for enhancing the visibility of complementary and thematically related research information An adaptable domain specific dissemination infrastructure for enhancing the visibility of complementary and thematically related research information Engin Sagbas; 1 York Sure 1, 2 1 GESIS Leibniz Institute

More information

CitationBase: A social tagging management portal for references

CitationBase: A social tagging management portal for references CitationBase: A social tagging management portal for references Martin Hofmann Department of Computer Science, University of Innsbruck, Austria m_ho@aon.at Ying Ding School of Library and Information Science,

More information

High Performance Computing Initiatives

High Performance Computing Initiatives High Performance Computing Initiatives Eric Stahlberg September 1, 2015 DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Cancer Institute Frederick National Laboratory is

More information

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON Overview * Introduction * Multiple faces of Big Data * Challenges of Big Data * Cloud Computing

More information

Ten Mistakes to Avoid

Ten Mistakes to Avoid EXCLUSIVELY FOR TDWI PREMIUM MEMBERS TDWI RESEARCH SECOND QUARTER 2014 Ten Mistakes to Avoid In Big Data Analytics Projects By Fern Halper tdwi.org Ten Mistakes to Avoid In Big Data Analytics Projects

More information

TraMOOC Project Overview Presentation. Overview Presentation

TraMOOC Project Overview Presentation. Overview Presentation TraMOOC Project Overview Presentation Overview Presentation Table of contents TraMOOC in a nutshell SUMMARY Project Motivation WHY? Project Objectives WHAT? Work Description HOW? The TraMOOC Platform RESULT?

More information

COGNITIVE SCIENCE AND NEUROSCIENCE

COGNITIVE SCIENCE AND NEUROSCIENCE COGNITIVE SCIENCE AND NEUROSCIENCE Overview Cognitive Science and Neuroscience is a multi-year effort that includes NSF s participation in the Administration s Brain Research through Advancing Innovative

More information

SMART LOIRE VALLEY PROGRAMME* FELLOWSHIP PROGRAMME Call for applications and guidelines: 2016/2017 campaign

SMART LOIRE VALLEY PROGRAMME* FELLOWSHIP PROGRAMME Call for applications and guidelines: 2016/2017 campaign SMART LOIRE VALLEY PROGRAMME* FELLOWSHIP PROGRAMME Call for applications and guidelines: 2016/2017 campaign Monday 9 th November 2015 to Monday 8 th February 2016 (17:00 CET Paris time) *This project has

More information

CYBERINFRASTRUCTURE FRAMEWORK $143,060,000 FOR 21 ST CENTURY SCIENCE, ENGINEERING, +$14,100,000 / 10.9% AND EDUCATION (CIF21)

CYBERINFRASTRUCTURE FRAMEWORK $143,060,000 FOR 21 ST CENTURY SCIENCE, ENGINEERING, +$14,100,000 / 10.9% AND EDUCATION (CIF21) CYBERINFRASTRUCTURE FRAMEWORK $143,060,000 FOR 21 ST CENTURY SCIENCE, ENGINEERING, +$14,100,000 / 10.9% AND EDUCATION (CIF21) Overview The Cyberinfrastructure Framework for 21 st Century Science, Engineering,

More information

Broad and Integrative Knowledge. Applied and Collaborative Learning. Civic and Global Learning

Broad and Integrative Knowledge. Applied and Collaborative Learning. Civic and Global Learning 1 2 3 4 5 Specialized Knowledge Broad and Integrative Knowledge Intellectual Skills Applied and Collaborative Learning Civic and Global Learning The Degree Qualifications Profile (DQP) provides a baseline

More information

Institut Curie Co-fund PhD program IC-3i-PhD

Institut Curie Co-fund PhD program IC-3i-PhD Institut Curie Co-fund PhD program IC-3i-PhD Institut Curie Hospital group Research Center Private foundation, created in 1909 accepting public donations A leading player in the fight against cancer Institutional

More information

HPC technology and future architecture

HPC technology and future architecture HPC technology and future architecture Visual Analysis for Extremely Large-Scale Scientific Computing KGT2 Internal Meeting INRIA France Benoit Lange benoit.lange@inria.fr Toàn Nguyên toan.nguyen@inria.fr

More information

Press pack. Paris, 20 June 2012. Investing for the future

Press pack. Paris, 20 June 2012. Investing for the future Press pack Paris, 20 June 2012 Investing for the future The "INFINI DRIVE" project for the development of electric vehicle recharging infrastructures has been selected by ADEME (French Environment and

More information

Doctoral Education @Aix-Marseille University 8th EUA-CDE Workshop Regional Engagement and Doctoral Education

Doctoral Education @Aix-Marseille University 8th EUA-CDE Workshop Regional Engagement and Doctoral Education @Aix-Marseille University 8th EUA-CDE Workshop Regional Engagement and Mossadek Talby* & Christophe Muller** Directors of the Doctoral Schools *352: Physics and Materials Science **353: Engineer Science

More information

Funding Opportunities Starter Grants

Funding Opportunities Starter Grants Funding Opportunities Starter Grants Marc R. Moon, M.D. Joseph C. Bancroft Professor of Surgery Division of Cardiothoracic Surgery & Center for Diseases of the Thoracic Aorta Washington University School

More information

Creating a Chemistry of Sciences with Big Data Building the Data Science Institute at Imperial College London

Creating a Chemistry of Sciences with Big Data Building the Data Science Institute at Imperial College London Creating a Chemistry of Sciences with Big Data Building the Data Science Institute at Imperial College London Y. Guo, D. Johnson Data Science Institute, Imperial College London y.guo@imperial.ac.uk, david.johnson@imperial.ac.uk

More information

PROPOSED ACTION PLAN FOR GUIDING ASPIRATION #6 LEAD IN INNOVATION, ENTREPRENEURSHIP, AND CREATIVITY, (CIE)

PROPOSED ACTION PLAN FOR GUIDING ASPIRATION #6 LEAD IN INNOVATION, ENTREPRENEURSHIP, AND CREATIVITY, (CIE) PROPOSED ACTION PLAN FOR GUIDING ASPIRATION #6 LEAD IN INNOVATION, ENTREPRENEURSHIP, AND CREATIVITY, (CIE) UTA values and will encourage a culture of innovation, entrepreneurship, and creativity. We will

More information

Collaborative Computational Projects: Networking and Core Support

Collaborative Computational Projects: Networking and Core Support Collaborative Computational Projects: Networking and Core Support Call type: Invitation for proposals Closing date: 16:00 07 October 2014 Related themes: Engineering, ICT, Mathematical sciences, Physical

More information

The Harvard CSE Curriculum, Its Advantages and Disadvantages

The Harvard CSE Curriculum, Its Advantages and Disadvantages powering 21 st century discovery and innovation computational science and engineering at harvard university Institute for Applied Computational Science IACS graduate programs in computational science and

More information

Doctor of Philosophy in Computer Science

Doctor of Philosophy in Computer Science Doctor of Philosophy in Computer Science Background/Rationale The program aims to develop computer scientists who are armed with methods, tools and techniques from both theoretical and systems aspects

More information

THE M.SC. PROGRAMS OF THE FACULTY OF SCIENCE GENERAL INFORMATION THE SCHOOL OF M.SC. STUDIES

THE M.SC. PROGRAMS OF THE FACULTY OF SCIENCE GENERAL INFORMATION THE SCHOOL OF M.SC. STUDIES THE M.SC. PROGRAMS OF THE FACULTY OF SCIENCE GENERAL INFORMATION THE SCHOOL OF M.SC. STUDIES The Faculty of Science at the Hebrew University of Jerusalem invites outstanding Bachelor s-degree-level graduates

More information

Sarah A. Rajala Ernest W. & Mary Ann Deavenport, Jr. Chair and Dean Bagley College of Engineering Mississippi State University Mississippi State, MS

Sarah A. Rajala Ernest W. & Mary Ann Deavenport, Jr. Chair and Dean Bagley College of Engineering Mississippi State University Mississippi State, MS Sarah A. Rajala Ernest W. & Mary Ann Deavenport, Jr. Chair and Dean Bagley College of Engineering Mississippi State University Mississippi State, MS 39762 USA November 8, 2012 Background: North Carolina

More information

Creative Approaches to Fostering Interdisciplinarity in Graduate Programming

Creative Approaches to Fostering Interdisciplinarity in Graduate Programming Creative Approaches to Fostering Interdisciplinarity in Graduate Programming Open Graduate Education Peter M. Weber Brown University CGS Annual Meeting Washington, D.C. December 5, 2014 Interdisciplinary

More information

CHAPTER 1 INTRODUCTION

CHAPTER 1 INTRODUCTION 1 CHAPTER 1 INTRODUCTION Exploration is a process of discovery. In the database exploration process, an analyst executes a sequence of transformations over a collection of data structures to discover useful

More information

Educating the Neuroscience Workforce of the Future

Educating the Neuroscience Workforce of the Future Photo credits (from left) : U.S. Army Corps of Engineers, Intel Free Press, Kate Ter Haar, Woodley Wonder Works, Ohio Sea Grant, U.S. Army RDECOM, Trevor Prentice Educating the Neuroscience Workforce of

More information

Agropolis Fondation - Fondazione Cariplo 2013 Joint Call for Proposals (CfP) CERES [Ref. CfP 2013-01]

Agropolis Fondation - Fondazione Cariplo 2013 Joint Call for Proposals (CfP) CERES [Ref. CfP 2013-01] Agropolis Fondation - Fondazione Cariplo 2013 Joint Call for Proposals (CfP) CERES [Ref. CfP 2013-01] TERMS OF REFERENCE I. Rationale Cereals play a significant role in the economy and nutrition in both

More information

Building Science and Engineering Talent. SEA Qualification Statement

Building Science and Engineering Talent. SEA Qualification Statement "A Unique Resource for the Nation" Building Science and Engineering Talent SEA Qualification Statement Background and Need Science, mathematics, and engineering education in many countries is essential

More information

Faculty of of Science

Faculty of of Science Faculty of of Science At Ryerson, we believe science is all about discovery and results. We call our approach connected science an approach that forms unique bonds between disciplines to solve some of

More information

Delivering High Performance Computing to the Masses 1. High Performance Computing. Case Study. Delivering High Performance Computing to the Masses

Delivering High Performance Computing to the Masses 1. High Performance Computing. Case Study. Delivering High Performance Computing to the Masses High Performance Computing Delivering High Performance Computing to the Masses 1 Case Study. Delivering High Performance Computing to the Masses This publication may not be reproduced, in whole or in part,

More information

MED 2400 MEDICAL INFORMATICS FUNDAMENTALS

MED 2400 MEDICAL INFORMATICS FUNDAMENTALS MED 2400 MEDICAL INFORMATICS FUNDAMENTALS NEW YORK CITY COLLEGE OF TECHNOLOGY The City University Of New York School of Arts and Sciences Biological Sciences Department Course title: Course code: MED 2400

More information

Research units. Laboratory of Microbial Molecular Genetics LMGM

Research units. Laboratory of Microbial Molecular Genetics LMGM Research units HCERES report on research unit: Laboratory of Microbial Molecular Genetics LMGM Under the supervision of the following institutions and research bodies: Université Toulouse 3 - Paul Sabatier

More information