The Moore/Sloan Data Science Environments: Advancing Data- Intensive Discovery
|
|
|
- Laurence Weaver
- 10 years ago
- Views:
Transcription
1 The Moore/Sloan Data Science Environments: Advancing Data- Intensive Discovery Ed Lazowska Bill & Melinda Gates Chair in Computer Science & Engineering Founding Director, escience InsHtute University of Washington Chesapeake Large Scale AnalyHcs Conference October 2015 hrp://lazowska.cs.washington.edu/clsac.pdf, pptx
2 Today A reminder of the extraordinary progress that Computer Science has achieved Big Data and Smart Everything Jim Gray s Fourth Paradigm : smart discovery / data- intensive discovery / escience The University of Washington escience InsItute, and the Moore/Sloan Data Science Environments A 21 st century view of Computer Science RecommendaIons for the support of 21 st century cyberinfrastructure
3 Processing capacity Storage capacity Network bandwidth Sensors Every aspect of compuing has experienced exponenial improvement Astonishingly, even algorithms in some cases!
4 You can exploit these improvements in two ways Constant capability at exponenially decreasing cost ExponenIally increasing capability at constant cost RAM Disk Flash Storage Price / MB, USD (semi- log plot) John McCallum / Havard Blok Microprocessor Performance, MIPS (semi- log plot) Ray Kurzweil
5 Today, these exponenial improvements in technology and algorithms are enabling a big data revoluion A proliferaion of sensors Think about the sensors on your phone More generally, the creaion of almost all informaion in digital form It doesn t need to be transcribed in order to be processed DramaIc cost reducions in storage You can afford to keep all the data DramaIc increases in network bandwidth You can move the data to where it s needed
6 DramaIc cost reducions and scalability improvements in computaion With Amazon Web Services, 1000 computers for 1 day costs the same as 1 computer for 1000 days DramaIc algorithmic breakthroughs Machine learning, data mining fundamental advances in computer science and staisics Ever more powerful models producing ever- increasing volumes of data that must be analyzed
7 Big Data is enabling computer scienists to put the smarts into everything Smart homes Smart cars Smart health Smart robots Smart crowds and human- computer systems Smart educaion Smart interacion (virtual and augmented reality) Smart ciies Smart discovery
8 Shwetak Patel, University of Washington 2011 MacArthur Fellow Smart homes (the leaf nodes of the smart grid)
9 Smart cars DARPA Grand Challenge DARPA Urban Challenge Google Self- Driving Car AdapIve cruise control Self- parking
10 Smart health Larry Smarr quanified self Evidence- based medicine P4 medicine
11 Smart robots
12 Smart crowds and human- computer systems Zoran Popovic UW Computer Science & Engineering David Baker UW Biochemistry
13 Zoran Popovic UW Computer Science & Engineering Smart educaion
14 Smart interacion
15 Smart ciies
16 Smart discovery (data- intensive discovery, or escience) Nearly every field of discovery is transiioning from data poor to data rich Astronomy: LSST Oceanography: OOI Physics: LHC Sociology: The Web Biology: Sequencing Economics: POS terminals Neuroscience: EEG, fmri
17 The Fourth Paradigm 1. Empirical + experimental 2. TheoreIcal 3. ComputaIonal 4. Data- Intensive Jim Gray Each augments, vs. supplants, its predecessors another arrow in the quiver
18 UW escience InsItute All across our campus, the process of discovery will increasingly rely on researchers ability to extract knowledge from vast amounts of data... In order to remain at the forefront, UW must be a leader in advancing these techniques and technologies, and in making [them] accessible to researchers in the broadest imaginable range of fields. (2007)
19 Major sources of support for our core effort University of Washington $725,000/year for staff support $600,000/year for faculty support NaIonal Science FoundaIon $2.8 million over 5 years for graduate program development and Ph.D. student funding (IGERT) Gordon and Beny Moore FoundaIon and Alfred P. Sloan FoundaIon $37.8 million over 5 years to UW, Berkeley, NYU Washington Research FoundaIon $9.3 million over 5 years for faculty recruiing packages, postdocs Also $7.1 million to the closely- aligned InsItute for Neuroengineering
20 Genesis of the Moore/Sloan Data Science Environments project The FoundaIons have a focus on novel advances in the physical, life, environmental, and social sciences They recognized the emergence of data- intensive discovery as an important new approach that would lead to new advances They perceived a number of impediments to success They sought partners who were prepared to work together in a distributed collaboraive experiment focused on tackling these impediments
21 Vision
22 Data science methodology UW s original core faculty team Cecilia Aragon Human Centered Design & Engr. Magda Balazinska Computer Science & Engineering Emily Fox StaIsIcs Carlos Guestrin CSE Bill Howe CSE Jeff Heer CSE Ed Lazowska CSE Life sciences David Beck Chemical Engr. Tom Daniel Biology Bill Noble Genome Sciences Environmental sciences Ginger Armbrust Oceanography Randy LeVeque Applied MathemaIcs Thomas Richardson StaIsIcs, CSSS Social sciences Physical sciences Werner Stuetzle StaIsIcs Josh Blumenstock ischool Mark Ellis Geography Tyler McCormick Sociology, CSSS Andy Connolly Astronomy John Vidale Earth & Space Sciences
23 Data science methodology UW s original core faculty team Cecilia Aragon Human Centered Design & Engr. Magda Balazinska Computer Science & Engineering Emily Fox StaIsIcs Carlos Guestrin CSE Bill Howe CSE Jeff Heer CSE Ed Lazowska CSE Life sciences David Beck Chemical Engr. Tom Daniel Biology Bill Noble Genome Sciences Environmental sciences Ginger Armbrust Oceanography Randy LeVeque Applied MathemaIcs Thomas Richardson StaIsIcs, CSSS Social sciences Physical sciences Werner Stuetzle StaIsIcs Josh Blumenstock ischool Mark Ellis Geography Tyler McCormick Sociology, CSSS Andy Connolly Astronomy John Vidale Earth & Space Sciences
24 Science example: AstroDB Cosmology at Scale Andrew Connolly (Astronomy), Magda Balazinska (Computer Science & Engineering) Large SynopIc Survey Telescope Survey half the sky every 3 nights (1000- fold increase in data vs. Sloan Digital Sky Survey) Enabled by a 3.2 Gigapixel camera with a 3.5 degree field 15 TB/night (100 PB over 10 years), 20 billion objects, and 20 trillion measurements Will enable dramaically improved resoluion, Ime- series analysis SDSS LSST Credit: Andy Connolly, University of Washington
25 Science quesions Finding the unusual Supernova, GRBs Probes of Dark Energy Finding moving sources Asteroids and comets Origins of the solar system Mapping the Milky Way Tidal streams Probes of Dark Maner Measuring shapes of galaxies GravitaIonal lensing The nature of Dark Energy How do we do science at petabyte scale? Credit: Andy Connolly, University of Washington
26 How do we do science at petabyte scale? Science quesions map to computaional quesions Finding the unusual Supernova, GRBs Probes of Dark Energy Finding moving sources Asteroids and comets Origins of the solar system Mapping the Milky Way Tidal streams Probes of Dark Maner Measuring shapes of galaxies GravitaIonal lensing The nature of Dark Energy Finding the unusual Anomaly detecion Density esimaions Finding moving sources Tracking algorithms Kalman filters Mapping the Milky Way Clustering techniques CorrelaIon funcions Measuring shapes of galaxies Image processing Data intensive analysis Credit: Andy Connolly, University of Washington
27 Science example: Devices + Neuroscience + Data Science Tom Daniel & Bing Brunton (Biology), Adrienne Fairhall (Physiology & Biophysics) Credit: Tom Daniel, University of Washington
28 What features do animals extract to solve problems? Neural acivity How is informaion synthesized to drive decisions? Complex environments Motor acivity Behavioral output How does acion affect subsequent sensaion? How do muscles work together to perform acions? Credit: Tom Daniel, University of Washington
29 Science example: Role of microbes in marine ecosystems Ginger Armbrust (Oceanography), Bill Howe (CSE + escience InsItute) Microbial community visualized with DNA stain Challenges: IntegraIon across different data types Distributed and remote labs 100 μm Credit: Ginger Armbrust, University of Washington
30 Credit: Ginger Armbrust, University of Washington
31 IntegraIng across physics, biology, and chemistry Query across data sets in real- Ime: not just faster different! Dan Halperin, Research ScienIst, escience InsItute KonstanIn Weitz Graduate student, CSE Credit: Ginger Armbrust, University of Washington
32 ConnecIng across distributed labs SeaFlow instrument Satellite link manual Lab computer Ship computer Processed data automated Other ship data streams Completely automated Lab computer Cloud SQLShare Web display collaborator computers Credit: Ginger Armbrust, University of Washington
33 Science Example: Data Science for Social Good / Urban Science Summer projects (from among 11 proposals): OpImizing Paratransit RouIng In collaboraion with King County Metro and UW s Taskar Center for Accessible Technology Assessing Community Well- Being through Open Data & Social Media In collaboraion with Third Place Technologies Open Sidewalks Sidewalk Maps for Low- Mobility CiIzens In collaboraion with UW s Taskar Center for Accessible Technology Predictors of Permanent Housing for Homeless Families In collaboraion with the Bill & Melinda Gates FoundaIon, Building Changes, and King, Pierce, and Snohomish CounIes WA 16 undergraduate and graduate students (from among 144 applicants) 6 ALVA socioeconomically disadvantaged high school students 8 escience InsItute Data ScienIsts
34 Predictors of Permanent Housing for Homeless Families The Bill & Melinda Gates FoundaIon and Building Changes have partnered with King, Pierce, and Snohomish CounIes WA to make homelessness in these counies rare, brief, and one- Ime When homeless families engage in services and programs, what factors are most likely to lead to a successful exit? The DSSG team: Developed algorithms to idenify families Developed algorithms to idenify episodes of homelessness including back- to- back or overlapping enrollments in individual programs Devised innovaive ways to visualize and analyze the ways families transiion between programs Project Leads: Neil Roche & Anjana Sundaram, Bill & Melinda Gates FoundaIon DSSG Fellows: Joan Wang, Jason Portenoy, Fabliha Ibnat, Chris Suberlak ALVA High School Students: Cameron Holt, Xilalit Sanchez escience InsHtute Data ScienHst Mentors: Ariel Rokem, Bryna Hazelton
35 Novel Analyses of Family Trajectories through Programs An example using Pierce County data Common trajectories lead to different outcomes: A highly successful exit from an episode would mean that the family found a permanent housing solution Another successful exit involves continued receipt of government subsidies Other exits are exits back into homelessness, or to other, unknown destinations
36 Using the D3 technology developed in Jeff Heer s group, the DSSG team created interacive Sankey diagrams and other visualizaions to facilitate exploraion of the data by stakeholders. (This diagram shows the proporional flow from one program to another, as well as the eventual outcome.)
37 A closer look at the Moore/Sloan Data Science Environments Launched late fall 2013
38 Career paths and alternaive metrics UW flagship acivity: Establish two new roles on campus: Data Science Fellows and Data ScienIsts Recruited / recruiing data scienists and put processes into place Typically Ph.D.- educated; fully supported by DSE; research posiion with emphasis on taking responsibility for core aciviies (e.g., incubator projects) Recruited / recruiing research scienists and put processes into place Typically Ph.D.- educated; parially supported by DSE; research posiion with emphasis on specific science goals Designated 33 faculty and staff as Data Science Fellows dino We cribbed Berkeley s excellent idea Recruited 6 Provost s IniIaIve faculty members dino Provost provided 6 faculty half- posiions Individuals who are truly π- shaped strength and commitment both to advancing data science methodology and to applying it at the forefront of a specific field Astronomy, Biology, Mechanical Engineering, Sociology, Applied MathemaIcs, StaIsIcs + Computer Science & Engineering Recruited 2 cohorts of 6 Data Science Postdoctoral Fellows dino Each is co- mentored by methodology and applicaions faculty
39 EducaIon and training UW flagship acivity: Establish new graduate program tracks in data science IGERT Ph.D. program in Big Data / Data Science 6 departments have added a transcript- recognized Advanced Data Science OpIon to their Ph.D. programs Data science classes count toward Ph.D. (no extra work) Regular Data Science OpIon coming soon Prepares students to use advanced data science tools, vs. creaing them Started IGERT seminar as the escience Community Seminar Put in place a detailed program evaluaion plan with Data2Insight 3rd cohort of IGERT Ph.D. students, from a variety of departments, arriving this fall Each student is co- mentored by methodology and applicaions faculty Undergraduate transcriptable opion staring this fall Fall 2016 launch of a Data Science Masters degree
40 Workshops and Bootcamps MulIple Soxware Carpentry Bootcamps (Python, R, etc.) AstroData Hack Week Many others Two vibrant seminar series escience Community Seminar (weekly, centered on IGERT students and Data Science Postdoctoral Fellows) Data Science Seminar (external disinguished lectures targeing the campus at large) EducaIon working group is acively tracking all relevant curricular aciviies campus- wide
41 Soxware tools, environments, and support Incubator program Our experiment at achieving scalability A lightweight 2- page proposal process several Imes each year I have an interesing science problem I m stumped by the data science aspects If you cracked it, others would benefit I m going to send you the following person half- Ime for 3 months to provide the labor; you provide the guidance Preceded by an informaion session to clarify expectaions and commitments AcIviIes take place in the Data Science Studio, staffed by our Data ScienIsts We coach soxware hygiene as well as methodology Running two cohorts annually Data Science for Social Good was a special case Incubator cohort Weekly code reviews UW flagship acivity: Establish an incubator seed grant program Leadership in the open source science community Keynotes at PyData ContribuIons to mainstream projects (e.g., scikit- learn (machine learning in Python))
42 Drop- in Office Hours escience InsItute Data ScienIsts UW- IT Academic & CollaboraIve ApplicaIons Team, Research CompuIng Team, Network Design & Architecture Team AWS ScienIfic CompuIng Team Center for StaIsIcs and the Social Sciences StaIsIcal ConsulIng Service UW Libraries Research Data Management Team Google Cloud Plazorm Team Specific broadly applicable tools democraize access to big data and big data infrastructure SQLShare: Database- as- a- Service for scienists and engineers Myria: Easy Scalable- AnalyIcs- as- a- Service with database DNA
43 UW campus- wide monthly reproducibility seminars and working group meeings NaIonal workshops at UW (2014), Berkeley (2015), NYU (2016) Broad involvement from academia, industry, non- profits Drax guidelines for reproducible research Weekly tutorials on research hygiene topics E.g. GitHub, KnitR, ipython Notebook Reproducibility and open science UW flagship acivity: Establish a campus- wide community around reproducible research Template for recording & categorizing research publicaions on reproducibility spectrum Self- cerificaion & badging of research groups for reproducibility
44 Working spaces and culture UW flagship acivity: Establish a Data Science Studio Washington Research FoundaIon Data Science Studio
45 Ethnography and evaluaion UW flagship acivity: Establish a research program in the data science of data science Ethnography and evaluaion integrated into a wide range of Data Science Environment aciviies Project overall (beginning with in- depth baseline interviews with paricipants from grad students through faculty) IGERT (Data Science tracks in muliple Ph.D. programs) Workshops (e.g. Soxware Carpentry, NSF- sponsored Data Science Workshop, M9 Interdisciplinary Workshop), Bootcamps (e.g. Python, R), Hack Weeks (e.g. AstroData Hack) Incubator projects ( regular + Data Science for Social Good) Case studies across Astronomy and Oceanography Developed ethnography research quesions E.g., who does data science, how are they networked, forms of social interacion and organizaion, intellectual groupings, career reward structures, collaboraive tool use in scienific workflows, data science values and ethics, etc. Established baseline for evaluaion, and determined evaluaion quesions
46 General role as a catalyst Annual campus- wide all call data science research poster sessions Various special interest group lunches held periodically to build community (e.g., Big Social Data ) Played a central role in launching Urban@UW A Switzerland to thwart anempts at data science land grabs
47 Similarly at NYU and UC Berkeley Pursuing the same goals Lead in advancing data science methodologies Lead in pu ng these methodologies to work in discovery Lead in creaing environments where data science can flourish Exploring a variety of approaches InteracIng extensively Bi- weekly one- hour teleconferences of the universiies project leadership teams and FoundaIon staff Frequent interacion among each Working Group s members from the three universiies Joint events (AstroData Hack Week, annual Moore/Sloan Data Science Summit, ) Visits Open sharing of successes and importantly failures
48 A 21 st century view of Computer Science: A field unique in its societal impact Medicine & Global Health Energy & Sustainability Transportation Education Scientific Discovery Neural Engineering natural language processing sensors Elder Care mobile computing data science robotics CORE CSE AI, systems, theory, languages, etc. cloud computing computer vision machine learning Accessibility human computer interaction Technology Policy and Societal Implications Security, Privacy, & Safety Advancing the Developing World Interacting with the Physical World: The Internet of Things
49 Is this stuff computer science? Medicine & Global Health Energy & Sustainability Transportation Technology Policy and Societal Implications Education Security, Privacy, & Safety Scientific Discovery Neural Engineering Elder Care Accessibility Advancing the Developing World Interacting with the Physical World: The Internet of Things
50 The last electrical engineer I am worried about the future of our profession. I see the world as an inverted pyramid. It balances precariously on the narrow point at the bonom. This point is being impressed into the ground by the heavy weight at the wide top of the inverted pyramid where all the applicaions reside. Electrical engineering will be in danger of shrinking into a neutron star of infinite weight and importance, but invisible to the known universe. Somewhere in the basement of Intel or its successor the last electrical engineer will sit. Bob Lucky IEEE Spectrum May 1998
51 Computer Science: The ever- expanding sphere Credit: Alfred Spector, Google (ret.)
52 Support for 21 st century cyberinfrastructure Many fields of discovery are becoming informaion fields, not just computaional fields The intellectual approaches of Computer Science are as important to advances as is cyberinfrastructure New approaches will enable new discoveries First we do faster then we do different/smarter/beuer MeeIng evolving cyberinfrastructure needs requires investment in intellectual as well as physical infrastructure We have a crazy obsession with buying shiny objects the bigger and more expensive, the bener
53 NaIonally and insituionally, there are various policies that distort behavior and that should be changed One example: Use of commercial cloud resources essenial to cost- effeciveness and scalability is discouraged by Indirect cost on outsourced services (and not on equipment purchases) This is totally nuts! NSF MRI viewed as a pot separate from Directorates/Divisions InsItuIonal subsidies (power, cooling, space) We re invesing 9:1 in hardware over soxware 1 it ought to be the reverse! 1 According to Ed Seidel when he was at NSF
54 We have a dogged resistance to uilizing commercial soxware, services, and systems We purchase our own We operate our own We roll our own Oxen with amateurs Why? Outmoded policies Subsidies Defense of turf Can a commercial RDBMS host large- scale science data? PoliIcs People whose paychecks depend on convincing you that your needs are so special that no commercial offering could possibly be suitable Failure to do hard- nosed cost- benefit analyses
55 Key anributes of the commercial cloud: EssenIally infinite capacity You pay for exactly what you use (instantaneous expansion and contracion) Zero capital cost 1,000 processors for 1 day costs the same (or less) as 1 processor for 1,000 days (totally revoluionary!) 7x24x365 operaions support, auxiliary power, redundant network connecions, geographical diversity For many services, someone else handles backup, someone else handles soxware updates Sharing and collaboraion are easy It coninuously gets bigger, faster, less expensive, more capable Credit: Werner Vogels, Amazon
56 Some possible acions Eliminate subsidies (or at least be transparent about them)! Space, power, cooling, backup, upgrades Eliminate overhead on outsourced cloud services AUribute NSF MRIs to Directorates/Divisions UW has done this, unilaterally Take steps to encourage and evolve data- intensive discovery that are at least as aggressive as the steps taken decades ago to encourage numerical computaional science Establish the use of commercial cloud services as the strong default for science at all scales. Every request to purchase compuing equipment that won t fit on a desktop should be rigorously jusified. Invest in intellectual infrastructure, so]ware infrastructure, and outsourced services, not big shiny objects!
57 Do not allow a group without a rock- solid track record to be responsible for the creaion of complex mission- criical soxware infrastructure (e.g., for MREFCs) Major naional faciliies to the extent that these are necessary at all should be used only by applicaions that truly require them Take addiional steps to encourage reproducible research and the useful/usable sharing of code and data Recognize that data has both value and cost. How should the costs be covered?
58 Is this a great Ime or what? hrp://lazowska.cs.washington.edu/clsac.pdf, pptx
Why the Big Deal about Big Data?
Why the Big Deal about Big Data? Ed Lazowska Bill & Melinda Gates Chair in Computer Science & Engineering Founding Director, escience Institute University of Washington Technology Alliance Insight to Impact
Big Data and Science: Myths and Reality
Big Data and Science: Myths and Reality H.V. Jagadish http://www.eecs.umich.edu/~jag Six Myths about Big Data It s all hype It s all about size It s all analysis magic Reuse is easy It s the same as Data
An Engagement Model for Master Data Consumers
An Engagement Model for Master Data Consumers 2015-05- 21 David Loshin Knowledge Integrity, Inc. 1 What is Master Data? Master data encompasses the models represening the core business enity objects used
How To Teach Data Science
The Past, Present, and Future of Data Science Education Kirk Borne @KirkDBorne http://kirkborne.net George Mason University School of Physics, Astronomy, & Computational Sciences Outline Research and Application
CPS 216: Advanced Database Systems (Data-intensive Computing Systems) Shivnath Babu
CPS 216: Advanced Database Systems (Data-intensive Computing Systems) Shivnath Babu A Brief History Relational database management systems Time 1975-1985 1985-1995 1995-2005 Let us first see what a relational
Institutes for Data Science: New York University University of Washington University of California, Berkeley
Advancing scientific discovery through collaboration across research domains Institutes for Data Science: New York University University of Washington University of California, Berkeley Data Science growing
Big Data, Enormous Opportunity
Big Data, Enormous Opportunity Ed Lazowska Bill & Melinda Gates Chair in Computer Science & Engineering University of Washington Critical Conversations Lecture Series University at Buffalo The State University
Big Data Challenges in Bioinformatics
Big Data Challenges in Bioinformatics BARCELONA SUPERCOMPUTING CENTER COMPUTER SCIENCE DEPARTMENT Autonomic Systems and ebusiness Pla?orms Jordi Torres [email protected] Talk outline! We talk about Petabyte?
CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21)
CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21) Goal Develop and deploy comprehensive, integrated, sustainable, and secure cyberinfrastructure (CI) to accelerate research
Big-Data Computing: Creating revolutionary breakthroughs in commerce, science, and society
Big-Data Computing: Creating revolutionary breakthroughs in commerce, science, and society Randal E. Bryant Carnegie Mellon University Randy H. Katz University of California, Berkeley Version 8: December
Big Data a threat or a chance?
Big Data a threat or a chance? Helwig Hauser University of Bergen, Dept. of Informatics Big Data What is Big Data? well, lots of data, right? we come back to this in a moment. certainly, a buzz-word but
Multiple Disciplines - Faculty Positions
Multiple Disciplines - Faculty Positions University of Toronto Location: Ontario Date posted: 2013-10-07 University of Toronto For details on these and other career opportunities, please visit http://www.uoftcareers.utoronto.ca
The University of Alabama at Birmingham. Information Technology. Strategic Plan 2011 2013
The University of Alabama at Birmingham Information Technology Strategic Plan 2011 2013 Table of Contents Message from the Vice President... 3 About UAB... 4 About UAB Information Technology Meeting needs
Astrophysics with Terabyte Datasets. Alex Szalay, JHU and Jim Gray, Microsoft Research
Astrophysics with Terabyte Datasets Alex Szalay, JHU and Jim Gray, Microsoft Research Living in an Exponential World Astronomers have a few hundred TB now 1 pixel (byte) / sq arc second ~ 4TB Multi-spectral,
The National Consortium for Data Science (NCDS)
The National Consortium for Data Science (NCDS) A Public-Private Partnership to Advance Data Science Ashok Krishnamurthy PhD Deputy Director, RENCI University of North Carolina, Chapel Hill What is NCDS?
Big Data Analytics for Space Exploration, Entrepreneurship and Policy Opportunities. Tiffani Crawford, PhD
Big Analytics for Space Exploration, Entrepreneurship and Policy Opportunities Tiffani Crawford, PhD Big Analytics Characteristics Large quantities of many data types Structured Unstructured Human Machine
Learning from Big Data in
Learning from Big Data in Astronomy an overview Kirk Borne George Mason University School of Physics, Astronomy, & Computational Sciences http://spacs.gmu.edu/ From traditional astronomy 2 to Big Data
High Performance Computing
High Parallel Computing Hybrid Program Coding Heterogeneous Program Coding Heterogeneous Parallel Coding Hybrid Parallel Coding High Performance Computing Highly Proficient Coding Highly Parallelized Code
UNIVERSITY OF MIAMI SCHOOL OF BUSINESS ADMINISTRATION MISSION, VISION & STRATEGIC PRIORITIES. Approved by SBA General Faculty (April 2012)
UNIVERSITY OF MIAMI SCHOOL OF BUSINESS ADMINISTRATION MISSION, VISION & STRATEGIC PRIORITIES Approved by SBA General Faculty (April 2012) Introduction In 1926, we embarked on a noble experiment the creation
Purdue University Department of Computer Science West Lafayette, IN Strategic Plan 2010-2015
Purdue University Department of Computer Science West Lafayette, IN Strategic Plan 2010-2015 Final Version 5.0: May 3, 2010 Computer science is a discipline that involves the understanding and design of
Good morning. It is a pleasure to be with you here today to talk about the value and promise of Big Data.
Good morning. It is a pleasure to be with you here today to talk about the value and promise of Big Data. 1 Advances in information technologies are transforming the fabric of our society and data represent
In- Memory Data Grid. White Paper GridGain Systems, 2013. In- Memory Data Grid - White Paper
In- Memory Data Grid White Paper GridGain Systems, 2013 Copyright 2007-2014 GridGain Systems, Inc. Page 1 of 16 Table of Contents: Re- Imagining UlImate Performance... 4 In- Memory Data Grid at a Glance
How To Manage Research Data At Columbia
An experience/position paper for the Workshop on Research Data Management Implementations *, March 13-14, 2013, Arlington Rajendra Bose, Ph.D., Manager, CUIT Research Computing Services Amy Nurnberger,
Challenges in e-science: Research in a Digital World
Challenges in e-science: Research in a Digital World Thom Dunning National Center for Supercomputing Applications National Center for Supercomputing Applications University of Illinois at Urbana-Champaign
Big Data Hope or Hype?
Big Data Hope or Hype? David J. Hand Imperial College, London and Winton Capital Management Big data science, September 2013 1 Google trends on big data Google search 1 Sept 2013: 1.6 billion hits on big
Example application (1) Telecommunication. Lecture 1: Data Mining Overview and Process. Example application (2) Health
Lecture 1: Data Mining Overview and Process What is data mining? Example applications Definitions Multi disciplinary Techniques Major challenges The data mining process History of data mining Data mining
Splunk/Ironstream and z/os IT Ops
Copyright 2015 Splunk Inc. Splunk/Ironstream and z/os IT Ops John Reda VP Customer Experience Syncsort Incorporated Disclaimer During the course of this presentaion, we may make forward looking statements
Information Technology R&D and U.S. Innovation
Information Technology R&D and U.S. Innovation Peter Harsha Computing Research Association Ed Lazowska University of Washington Version 9: December 18, 2008 1 Peter Lee Carnegie Mellon University Advances
GYAN VIHAR SCHOOL OF ENGINEERING & TECHNOLOGY M. TECH. CSE (2 YEARS PROGRAM)
GYAN VIHAR SCHOOL OF ENGINEERING & TECHNOLOGY M. TECH. CSE (2 YEARS PROGRAM) Need, objectives and main features of the Match. (CSE) Curriculum The main objective of the program is to develop manpower for
The Challenge of Data in an Era of Petabyte Surveys Andrew Connolly University of Washington
The Challenge of Data in an Era of Petabyte Surveys Andrew Connolly University of Washington We acknowledge support from NSF IIS-0844580 and NASA 08-AISR08-0081 The science of big data sets Big Questions
COGNITIVE SCIENCE AND NEUROSCIENCE
COGNITIVE SCIENCE AND NEUROSCIENCE Overview Cognitive Science and Neuroscience is a multi-year effort that includes NSF s participation in the Administration s Brain Research through Advancing Innovative
Computational Science and Informatics (Data Science) Programs at GMU
Computational Science and Informatics (Data Science) Programs at GMU Kirk Borne George Mason University School of Physics, Astronomy, & Computational Sciences http://spacs.gmu.edu/ Outline Graduate Program
Statistics for BIG data
Statistics for BIG data Statistics for Big Data: Are Statisticians Ready? Dennis Lin Department of Statistics The Pennsylvania State University John Jordan and Dennis K.J. Lin (ICSA-Bulletine 2014) Before
Applications of Deep Learning to the GEOINT mission. June 2015
Applications of Deep Learning to the GEOINT mission June 2015 Overview Motivation Deep Learning Recap GEOINT applications: Imagery exploitation OSINT exploitation Geospatial and activity based analytics
SYRACUSE UNIVERSITY: ACADEMIC STRATEGIC PLAN
SYRACUSE UNIVERSITY: ACADEMIC STRATEGIC PLAN These are extraordinary times for higher education and for students aspiring to succeed in a rapidly changing world. Technology is affecting the way we learn
Online Computer Science Degree Programs. Bachelor s and Associate s Degree Programs for Computer Science
Online Computer Science Degree Programs EDIT Online computer science degree programs are typically offered as blended programs, due to the internship requirements for this field. Blended programs will
Leading with Services. Raghunandan K S Managing Director Services, CISCO India & SAARC
Leading with Raghunandan K S Managing Director, CISCO India & SAARC SUPPLIERS CUSTOMERS Product Focused Much of future Supplier Growth will be determined in this white space Business Outcome Focused 2012
Undergraduate Studies Department of Astronomy
WIYN 3.5-meter Telescope at Kitt Peak near Tucson, AZ Undergraduate Studies Department of Astronomy January 2014 Astronomy at Indiana University General Information The Astronomy Department at Indiana
Sanjeev Kumar. contribute
RESEARCH ISSUES IN DATAA MINING Sanjeev Kumar I.A.S.R.I., Library Avenue, Pusa, New Delhi-110012 [email protected] 1. Introduction The field of data mining and knowledgee discovery is emerging as a
Participants: Introduction:
National Conversation A Trusted Cyber Future Discussion Led by Dan Massey, CSD Program Manager Moderator: Joe Gersch (Secure 64) Department of Homeland Security Science and Technology Directorate (DHS
Cloud Based E-Government: Benefits and Challenges
Cloud Based E-Government: Benefits and Challenges Saleh Alshomrani 1 and Shahzad Qamar 2 1 Faculty of Computing and IT, King Abdulaziz University, Jeddah, Saudi Arabia 2 Faculty of Computing and IT, North
Big Data Analytics. Prof. Dr. Lars Schmidt-Thieme
Big Data Analytics Prof. Dr. Lars Schmidt-Thieme Information Systems and Machine Learning Lab (ISMLL) Institute of Computer Science University of Hildesheim, Germany 33. Sitzung des Arbeitskreises Informationstechnologie,
DEVELOPMENT OF A KNOWLEDGE ECONOMY
DEVELOPMENT OF A KNOWLEDGE ECONOMY HIGHER EDUCATION 6.128 In the global knowledge economy the competitiveness of nations is increasingly determined by their capacity to generate, assimilate and apply new
COLUMBIA UNIVERSITY COLLEGE OF PHYSICIANS & SURGEONS SPURS. A Biomedical Research Program (For Students at CUNY Senior Colleges)
A Biomedical Research Program (For Students at CUNY Senior Colleges) The Summer Program for Under-Represented Students ( ), a Biomedical Research Program, is designed to provide under-represented minority
Institutionalizing Change to Improve Doctoral Completion
Institutionalizing Change to Improve Doctoral Completion Ph.D. Completion Project Interventions CGS July 2010 Dr. Judith Stoddart, Assistant Dean Dr. Karen Klomparens, Dean, Overall Goals Raising awareness
Concept and Project Objectives
3.1 Publishable summary Concept and Project Objectives Proactive and dynamic QoS management, network intrusion detection and early detection of network congestion problems among other applications in the
STUDY AT ONE OF THE WORLD S BEST UNIVERSITIES
STUDY AT ONE OF THE WORLD S BEST UNIVERSITIES WHY AT UOW Psychology at UOW connects you with people, programs and technology to enhance your learning experience. 1. RECOGNISED AUSTRALIA-WIDE When peak
research center concept
research center concept research center concept 1 Overview As a key part of the Russian Skolkovo Initiative, the Skolkovo Foundation, MIT, and others are assisting in the creation of the Skolkovo Institute
Make the Most of Big Data to Drive Innovation Through Reseach
White Paper Make the Most of Big Data to Drive Innovation Through Reseach Bob Burwell, NetApp November 2012 WP-7172 Abstract Monumental data growth is a fact of life in research universities. The ability
Master s Degree Programs. Global Technology Leadership
Master s Degree Programs Global Technology Leadership Why BU? Consistently ranked among the most prestigious graduate engineering programs in the U.S. Study with the Best Work with renowned faculty and
Dear Prospective Applicant:
Dear Prospective Applicant: Thank you for your interest in the MIT MSRED Program. While we will try to provide a quick overview, for the most up-to-date information on the MSRED and upcoming events, please
Budget Proposal Narrative Division: Academic Affairs
Budget Proposal Narrative Division: Academic Affairs Big Data/Analytics at Western Statement of Purpose: (What is the problem or opportunity being addressed? How will you address this problem or opportunity?)
Dale Masterson Director, Engineering Student Services
Dale Masterson Director, Engineering Student Services Have a solid foundation in math and science! Are analytical thinkers who want to solve problems Enjoy figuring out how things work work! Enjoy utilizing
Origins of the Cosmos Summer 2016. Pre-course assessment
Origins of the Cosmos Summer 2016 Pre-course assessment In order to grant two graduate credits for the workshop, we do require you to spend some hours before arriving at Penn State. We encourage all of
Deep Learning Meets Heterogeneous Computing. Dr. Ren Wu Distinguished Scientist, IDL, Baidu [email protected]
Deep Learning Meets Heterogeneous Computing Dr. Ren Wu Distinguished Scientist, IDL, Baidu [email protected] Baidu Everyday 5b+ queries 500m+ users 100m+ mobile users 100m+ photos Big Data Storage Processing
Creating signal from noise:
Creating signal from noise: Applying Big Data and advanced analytics Presented by: Stacy Coggeshall, Tufts Health Plan Chris Coloian, CEO Predilytics, Inc. OCTOBER 23-25, 2013 SCOTTSDALE, AZ Discussion
Potential Career Tracks Associated With the APS Undergraduate Major
Potential Career Tracks Associated With the APS Undergraduate Major This document is an outline of the various career paths that recent APS graduates have taken. We encourage all majors to read these descriptions
Program: Organizational Leadership. Department: Psychology. College: Arts & Sciences. Year: 2014-2015
Program: Organizational Leadership Department: Psychology College: Arts & Sciences Year: 2014-2015 Primary Faculty: Stacie Holloway 513-556-0176 [email protected] Faculty : Donna Chrobot Mason
Cloud computing is a marketing term that means different things to different people. In this presentation, we look at the pros and cons of using
Cloud computing is a marketing term that means different things to different people. In this presentation, we look at the pros and cons of using Amazon Web Services rather than setting up a physical server
The College of Science Graduate Programs integrate the highest level of scholarship across disciplinary boundaries with significant state-of-the-art
GRADUATE PROGRAMS The College of Science Graduate Programs integrate the highest level of scholarship across disciplinary boundaries with significant state-of-the-art research centers and experiential
Graduate Research and Education: New Initiatives at ORNL and the University of Tennessee
Graduate Research and Education: New Initiatives at ORNL and the University of Tennessee Presented to Joint Workshop on Large-Scale Computer Simulation Jim Roberto Associate Laboratory Director Graduate
Master of Integrated Innovation for Products & Services (9 months)
Master of Integrated Innovation for Products & Services (9 months) 2015-2016 Course Descriptions Fall 2015 60 units Fundamental Courses (Two of the three courses below are required) Engineering Design
LSST and the Cloud: Astro Collaboration in 2016 Tim Axelrod LSST Data Management Scientist
LSST and the Cloud: Astro Collaboration in 2016 Tim Axelrod LSST Data Management Scientist DERCAP Sydney, Australia, 2009 Overview of Presentation LSST - a large-scale Southern hemisphere optical survey
See more info at: http://www.ucop.edu/cosmos/faqs.html
California State Summer School for Mathematics & Science (COSMOS) Three UC campuses, Davis, Irvine, and Santa Cruz, welcome high school students who excel in mathematics and science to a fourweek residential
UNIVERSITY OF HAWAI I AT MĀNOA ASTRONOMY & ASTROPHYSICS. The University of Hawai i is an equal opportunity/affirmative action institution.
UNIVERSITY OF HAWAI I AT MĀNOA ASTRONOMY & The University of Hawai i is an equal opportunity/affirmative action institution. A M E SSAG E F R O M THE PROGRAM U N D E R G R A D U AT E P R O G R A M S I
Student Handbook 2015-2016. Master of Information Systems Management (MISM)
Student Handbook 2015-2016 Master of Information Systems Management (MISM) Table of Contents Contents 1 Masters of information systems Management (MISM) Curriculum... 3 1.1 Required Courses... 3 1.2 Specialization
SECURE AND TRUSTWORTHY CYBERSPACE (SaTC)
SECURE AND TRUSTWORTHY CYBERSPACE (SaTC) Overview The Secure and Trustworthy Cyberspace (SaTC) investment is aimed at building a cybersecure society and providing a strong competitive edge in the Nation
The Bachelor of Science program in Environmental Science is a broad, science-based
The Bachelor of Science program in Environmental Science is a broad, science-based curriculum designed to prepare students for a variety of environmentally-related technical careers, as well as for graduate
Education and Workforce Development in the High End Computing Community
Education and Workforce Development in the High End Computing Community The position of NITRD s High End Computing Interagency Working Group (HEC-IWG) Overview High end computing (HEC) plays an important
