CISE Overview and Big Data

Size: px
Start display at page:

Download "CISE Overview and Big Data"

Transcription

1 CISE Overview and Big Data Suzi Iacono CISE Directorate National Science Foundation SI^2 Workshop January 17, 2013 Image&Credit:&Exploratorium.&

2 Economic Impact of IT Growth of IT industry coupled with productivity gains across the entire economy have had enormous impact. IT industries accounted for 25% of US economic growth since In 2010, IT industries grew 16% and contributed 5% to overall US GDP Use and production of IT accounted for ~2/3 of the post-1995 growth in labor productivity. IT sector generates jobs: IT jobs have grown 125x faster than employment as a whole between 2001 and 2011, and in 2011, IT workers earned 74% more than the average worker. IT diversifies regional economies to include idea-driven creative industries. Sources: NRC (2009). Assessing the Impacts of Changes in the IT R&D Ecosystem.; NRC (2012). Continuing Innovation in Information Technology.; ITIF (2012). Looking for Jobs? Look to IT in 2010 and Beyond.

3 CISE&Directorate&& Computing and Communication Foundations (CCF) Susanne Hambrusch Computer and Network Systems (CNS) Keith Marzullo Information and Intelligent Systems (IIS) Howard Wactlar 70%& CISE&Core&Programs& Algorithmic+ Founda1ons+ Communica1on+ and+informa1on+ Founda1ons+ So7ware+and+ Hardware+ Founda1ons+ Computer+ Systems+ Research+ Networking+ Technology+and+ Systems+ HumanA Centered+ Compu1ng+ Informa1on+ Integra1on+and+ Informa1cs+ Robust+ Intelligence+ CISE&Cross<Cu=ng&Programs& 30%& Cross<FoundaAon&Programs&

4 CISE&Directorate&& Computing and Communication Foundations (CCF) Susanne Hambrusch Computer and Network Systems (CNS) Keith Marzullo Information and Intelligent Systems (IIS) Howard Wactlar Office of Cyberinfrastructure (OCI) Alan Blatecky 70%& CISE&Core&Programs& Algorithmic+ Founda1ons+ Communica1on+ and+informa1on+ Founda1ons+ So7ware+and+ Hardware+ Founda1ons+ Computer+ Systems+ Research+ Networking+ Technology+and+ Systems+ HumanA Centered+ Compu1ng+ Informa1on+ Integra1on+and+ Informa1cs+ Robust+ Intelligence+ CISE&Cross<Cu=ng&Programs& 30%& Cross<FoundaAon&Programs&

5 !"Word"Cloud"created"from"CISE"FY"2011"award"9tles."

6

7 Who is the CISE community? PI&and&Co<PI&Departments&for&FY&2011&Awards&Funded&by&CISE& Interdisciplinary+ Centers,+3%+ Sciences+&+ Humani1es,+21%+ Computer+ Science+&+ Informa1on+ Science+&+ Computer+ Engineering+ (CISE),+65%+ Engineering+ (excluding+ Computer+ Engineering),+ 11%+

8 Research Frontiers Data&Explosion& Smart&Systems:& Sensing,&Analysis&and& Decision& Expanding&the&Limits& of&computaaon& Secure&Cyberspace& Universal&ConnecAvity& AugmenAng&Human& CapabiliAes&

9 Advances in information technologies are transforming the fabric of our society and data represents a transformative new currency for science, engineering, education and commerce. Image+Credit:+CCC+and+SIGACT+CATCS+

10 Where+do+the+data+come+from?+ ++ Why+do+we+have+a+na1onal+ini1a1ve?+

11 The Big Data Landscape I: Big Science Science gathers data at an ever-increasing rate across all scales and complexities of natural phenomena Sloan Digital Sky Survey in 2000 collected more data in its 1 st few weeks than had been amassed in the entire history of astronomy Within a decade, over 140 terabytes of information collected Large Hadron Collider generates scores of petabytes a year The proposed Large Synoptic Survey Telescope (3.3 gigapixel digital camera) will generate 40 terabytes of data nightly By 2015, the world will generate the equivalent of approximately 93 million Libraries of Congress

12 The Big Data Landscape II: Smart Sensing, Reasoning and Decisionmaking Environment&Sensing& Emergency&Response& Percepts (sensors) Agent (Reasoning) Situation Awareness: Humans as sensors feed multimodal data streams Credit:+Photo+by+US+Geological+Survey++ People<Centric&Sensing& Actions (controllers) Pervasive&&&&&CompuAng&& Social&&&&&&&&&&&InformaAcs&& Smart&Health&Care& Evaluate+ Sense+ Intervene+ Iden1fy+ Personal+ Sensing+ Public+ Sensing+ Social+ Sensing+ Assess+ Source: Sajal Das, Keith Marzullo

13 The Big Data Landscape III: New Paradigms for Communications Today$ 1988$ Remarkable Pace of Innovation MOBILE SOCIAL NETWORKS VIDEO BLOGS VOIP

14 The Big Data Landscape IV: The Long Tail of Science Hundreds of thousands of scientists and engineers work individually or in small, distributed, disconnected groups all generating data that collectively represent an enormous, largely untapped scientific resource From running simulations, experiments, etc. Making heterogeneous data across many areas of science more homogeneous could give way to breakthroughs across all areas of science and engineering Estimated 40 exabytes of unique new information generated worldwide in 2010 Only 5% of the information created is structured, however, in a standard format of words or numbers; the rest are unstructured text, voice, images, etc.

15 How Big is Big? Big Data : Datasets whose size are beyond the ability of typical database software tools to capture, store, manage, and analyze -McKinsey Global Institute, Big data: the next frontier for innovation, competition, and productivity, May Image+Credit:+Sigrid"Knemeyer+

16 Not Just Volumes of Data The science of big data is not just about volumes and velocity of data, but also Heterogeneity and diversity Levels of granularity Media formats Scientific disciplines Complexity Uncertainty Incompleteness Representation types

17 Why is Big Data Important? Critical to transforming how science is done and to accelerating the pace of discovery in almost every science and engineering discipline Transformative implications for commerce and economy Potential for addressing some of society s most pressing challenges Image+Credit:+Chi"Birmingham+

18 Paradigm Shift: from Hypothesis-driven to Data-driven Discovery " " " The"Economist,"The+data+ deluge+and+how+to+ handle+it:+a+14apage+ special+report+(feb+25,+ 2010) " " " The"Fourth"Paradigm:" Data!Intensive"Scien9fic" Discovery"(2009," Microso7+Corpora1on) hvp://www.sciencemag.org/site/special/data/+ hvp://www.economist.com/node/ hvp://research.microso7.com/enaus/ collabora1on/fourthparadigm/++

19 The Age of Data: From Data to Knowledge to Action Data-driven discovery is revolutionizing scientific exploration and engineering innovations Automatic extraction of new knowledge about the physical, biological and cyber world continues to accelerate Multi-cores, concurrent and parallel algorithms, virtualization and advanced server architectures will enable data mining and machine learning, and discovery and visualization of Big Data

20 Potential for Transformational Science & Engineering: From Data to Knowledge to Action Integration of discipline (or media format ) specific data, examine for relationships Disaster informatics 3D toxic fume images Simulations of gas spread Maps of census concentrations First responder on-theground findings Evacuation routing

21 Examples of Research Challenges More data are being collected than we can store Analyze the data as it becomes available Decide what to archive and what to discard Many data sets are too large to download Analyze the data wherever it resides Many data sets are too poorly organized to be usable Better organize and retrieve data Many data sets are heterogeneous in type, structure, semantics, organization, granularity, accessibility Integrate and customize access to federate data Utility of data is limited by our ability to interpret and use it Extract and visualize actionable knowledge Evaluate results Large and linked datasets may be exploited to identify individuals Design management and analysis with built-in privacy preserving characteristics

22 A National Imperative PCAST calls on the Federal government to increase R&D investments for collecting, storing, preserving, managing, analyzing, and sharing the increasing quantities of data. Furthermore, PCAST observed that the potential to gain new insights to move from data to knowledge to action has tremendous potential to transform all areas of national priority. Source: PCAST (December 2010), Report to the President and Congress: Designing a Digital Future a periodic congressionally-mandated review of the Federal Networking and Information Technology Research and Development (NITRD) Program.

23 Administration s Big Data Research and Development Initiative Big Data Senior Steering Group chartered in spring 2011 under the Networking and Information Technology R&D (NITRD) Program Members from DARPA, DOD OSD, DHS, DOE-Science, HHS, NARA, NASA, NIST, NOAA, NSA, OFR, USGS, etc. Co-chaired by NSF (and NIH) Initial charge was to come up with a plan, a strategy 23+ Image+Credit:+Fuqing"Zhang"and"Yonghui"Weng,"Pennsylvania"State"University;" Frank"Marks,"NOAA;"Gregory"P."Johnson,"Romy"Schneider,"John"Cazes,"Karl" Schulz,"Bill"Barth,"The"University"of"Texas"at"Aus9n+

24 Big Data Membership Biven&& Laura&& DOE&Science&& Blatecki&& Alan&& NSFNational&Science&Foundation&& Collica&& Leslie&& NISTNational&Institute&of&Standards&and&Technology&& Deift&& Abby&& NSFNational&Science&Foundation&& Downing&& Gregory&& HHSDepartment&of&Health&and&Human&Services&& Espina&& Pedro&& OSTPWhite&House&Office&of&Science&and&Technology& Policy&& Gerr&& Neil&& DARPADefense&Advanced&Research&Projects&Agency&& Gundersen&& Linda&& USGSU.S.&Geological&Survey&& Hall&& Alan&& NOAANational&Oceanic&and&Atmospheric& Administration&& Iacono&& Suzanne&& NSFNational&Science&Foundation&& Jakubek&& David&& OSDOffice&of&the&Secretary&of&Defense&PATL&& Kaufman&& Daniel&& DARPADefense&Advanced&Research&Projects&Agency&& Larson&& Phillip&P.&& OSTPWhite&House&Office&of&Science&and&Technology& Policy&& Lee&& Tsengdar&&NASANational&Aeronautics&and&Space&Administration&& Lipman&& David&& NIHNational&Institutes&of&Health&/NLMNIH s&national& Library&of&Medicine&/NCBI&& Little&& Michael&& NASANational&Aeronautics&and&Space&Administration&& Luker&& Mark&& NCONational&Coordination&Office&for&NITRD& /NITRDNetworking&and&Information&Technology& Research&and&Development&& Marth&& Lisa&& NISTNational&Institute&of&Standards&and&Technology&& Muoio&& Patricia& A.&& DNI&& Pantula&& Sastry&& NSFNational&Science&Foundation&& Preuss&& Don&& NIHNational&Institutes&of&Health&/NLMNIH s&national& Library&of&Medicine&/NCBI&& Quade&& Brittany&& NSFNational&Science&Foundation&& Romine&& Charles&& NISTNational&Institute&of&Standards&and&Technology&& Smith&& Darren&& NOAANational&Oceanic&and&Atmospheric& Administration&& Spengler&& Sylvia&& NSFNational&Science&Foundation&& Statler&& Tom&& NSFNational&Science&Foundation&& Strawn&& George&& NCONational&Coordination&Office&for&NITRD& /NITRDNetworking&and&Information&Technology& Research&and&Development&& Suskin&& Mark&& NSFNational&Science&Foundation&& Villani&& Jennifer&& NIHNational&Institutes&of&Health&/NIGMS&& Wigen&& Wendy&& NCONational&Coordination&Office&for&NITRD& /NITRDNetworking&and&Information&Technology& Research&and&Development&& Zhao&& Fen&& NSFNational&Science&Foundation&& Nowell&& Lucy&& DOE&Science&&

25 Big Data Membership Bristol(( Sky(( USGSU.S.(Geological(Survey(( Kielman(( Joseph(( DH SDepartment(of(Homeland(Security(( Petters(( Jonathan((DOE(Science(( Carver(( Doris(( NSFNational(Science(Foundation(( Adolfie(( Laura(( OSDOffice(of(the(Secretary(of(Defense(( Chadduck(( Robert(( NSFNational(Science(Foundation(( Crowder(( Grace(( NSANational(Security(Agency(( Dunn(( Michelle(( NIHNational(Institutes(of(Health(( Florance(( Valerie(( NIHNational(Institutes(of(Health(( Frehill(( Lisa(( OSDOffice(of(the(Secretary(of(Defense(( lisa.frehill.c Helland(( Barbara(( DOEDepartment(of(Energy(QScience(( Hoang(( Thuc(( DOEDepartment(of(Energy(QNNSA(( Kannan(( Nandini(( NSFNational(Science(Foundation(( gov(( Warnow(( Tandy(( NSFNational(Science(Foundation(( Dean(( David(( DOE(Science(( Ly ster(( Peter(( NIHNational(Institutes(of(Health(( Millemaci(( John(( OSDOffice(of(the(Secretary(of(Defense(( Pearce(( Claudia(( NSANational(Security(Agency(( Allen(( Marc(( Pearl(( Jennifer(( NSFNational(Science(Foundation(( gov(( Blaszkowsky((David(( TreasuryDepartment(of(the(Treasury(OFR(( Szykman(( James(( EPAEnvironmental(Protection(Agency(( Tompkins(( Jerry(( NSANational(Security(Agency(( c.mil(( Flood(( Mark(( TreasuryDepartment(of(the(Treasury(OFR(( ury.gov(( Holm(( Jeanne(( Data.gov(( jeanne.m.holm.jpl.nasa.gov(( Misawa(( Eduardo(( NSFNational(Science(Foundation(( gov(( (

26 Big Data Launch Federal Big Data R&D Initiative launched by White House OSTP on March 29, 2012 at AAAS Federal Announcements: NSF Subra Suresh NIH Francis Collins USGS Marcia McNutt DoD Zach Lemnios DARPA - Ken Gabriel DOE William Brinkman Panel Discussion: Moderator - Steve Lohr, New York Times Daphne Koller, Stanford University James Manyika, McKinsey & Company Lucila Ohno-Machado, UC San Diego Alex Szalay, Johns Hopkins University Image+Credit:+Na9onal"Science"Founda9on+ More information available at:

27 Strategy to Address Big Data FoundaAonal&research&to& develop&new&techniques&and& technologies&to&derive&knowledge& from&data& New&cyberinfrastructure&to& manage,&curate,&and&serve&data&to& research&communiaes& Policy& New&approaches&for&educaAon& and&workforce&development& New&types&of&inter<disciplinary& collaboraaons,&grand&challenges,& and&compeaaons&

28 Core Techniques and Technologies for Advancing Big Data Science & Engineering (BIG DATA) Program Solicitation: NSF Foundational research to extract knowledge from data Foundational research to advance the core techniques and technologies for managing, analyzing, visualizing, and extracting useful information from large, diverse, distributed and heterogeneous data sets. Image+Credit:+Jurgen"Schulze,"Calit2,"UC!San"Diego+ CrossADirectorate+Program:+NSF+Wide+ Mul1Aagency+Commitment:+NSF+and+NIH+

29 BIG DATA Research Thrusts CollecAon,&Storage,&and& Management&of& Big&Data + 3 awards Foundations of big data management Mitigating tradeoffs among speed of data ingestion, quicker answers and the freshness of data through the design of new storage devices with extreme capacities 4 awards Data&AnalyAcs&& Novel machine learning where multi-dimensional vector data points are replaced by distributions Design and test mathematical and statistical techniques for large-scale heterogeneous data in DNA repositories Research&in&Data&Sharing& and&collaboraaon& 1 award (+1 shared with data collection) Open source tools for infrastructure for improving discovery through use of social analytic data Databridge linking data, human interactions, and usage practices for the long-tail of science Databridge linking data, human interactions and usage practices for the long-tail of science Data analytics problems in next generation sequencing Theory and algorithms fro couples tensors and associated software toolkits to make analysis possible Credit:+Fermilab+Photo+ Eight+midAscale+(up+to+$1M+a+year)+awards+out+of+over+136+ projects+announced+on+oct.+3.+

30 Award Citations DCM: Dan Suciu University of WA A formal foundation for big data management Michael Bender SUNY at Stony Brook & Martin Farach-Colton Rutgers University Eliminating the data ingestion bottleneck in big data applications Arcot Rajasekar University of North Carolina, Chapel Hill & Gary King Harvard University & Justin Zhan North Carolina Agriculture & Technical State University Databridge A sociometric system for long-tail science data collections 30+

31 Award Citations Data Analytics Eli Upfal Brown University Analytic approaches to massive data computation with applications to genomics Aarti Singh Carnegie-Mellon University Distribution-based machine learning for high dimensional datasets Srinvas Aluru Iowa State University & Wuchun Feng Virginia Polytechnic Institute & State University & Oyekunie Olukotun Stanford University Genomes Galore Core techniques, libraries, and domain specific languages for high throughput DNA sequencing

32 Award Citations Data Analytics (continued) Christos Faloutsos Carnegie Mellon University & Nikolaos Sidiropoulos University of Minnesota Twin Cities Big Tensor Mining: Theory Scalable Algorithms and Applications 32+

33 Award Citations E-Science Collaboration Environments Thorsten Joachim Cornell University & Paul Kantor Rutgers University Discovery and social analysis for large-scale scientific literature 33+

34 Ideation Contest Launch Opportunity to expand the innovation ecosystem Joint among NASA, NSF and DOE Office of Science A contest focused on How to make heterogeneous data seem more homogeneous? 5 judges 5 criteria Launched on Challenge.gov and the Top Coder platform on Oct. 3 with a two week window

35 Ongoing Big Data Programs at NSF Dear Colleague Letters: Encourage CIF21 IGERTs to educate and support a new generation of researchers able to address fundamental Big Data challenges: Data-Intensive Education-Related Research Funding Opportunities announcing an Ideas Lab, for which cross disciplinary participation will be solicited, to generate transformative ideas for using large datasets to enhance the effectiveness of teaching and learning environments: Data Citation to the Geosciences Community to encourage transparency and increased opportunities for the use and analysis of data sets:

36 Earthcube: GEO Science Infrastructure EAGER awards announced as part of White House Big Data Launch Integrates geosciences data and high-performance computing technologies in an open, adaptable and sustainable framework to enable transformative research and education in Earth System Science Innovative Model: Community designed, community owned, community governed Interdisciplinary research: Building and sustaining new communities Workshops to bring together (GEO, SBE, CISE) communities EAGER awards to seed new research

37 A Complex Policy Setting Researchers want data. Public policy requires access to data. Public policy also requires protection of privacy and intellectual property and other sensitive information. Much more to be done: Policy on data management and data access.

38 Data Privacy Never more important than today However, not all data contain people s identities (as in data landscape III) Not a Big Brother scenario Government (NSF) invests in privacy research Values in design research community: Identity cloaking, anonymization Do-not-track cookie management Obfuscation, blurring Privacy preserving data mining, search, payment Just-in-time crypto Secure data distribution. Privacy in technology; privacy inspired technology

39 Emerging Frontiers Data&Explosion& Smart&Systems:&Sensing,& Analysis&and&Decision& Expanding&the&Limits&of& ComputaAon& Secure&Cyberspace& Universal&ConnecAvity& AugmenAng&Human& CapabiliAes&

40 1,000,000 Processor Performance Plateaued Around the Year 2004 Microprocessor Performance Expectation Gap over Time ( projected) 100,000 10,000 Image+Credit:+USC"BMES"ERC+ The Expectation Gap 1, Year of Introduction Credit:&Graph&reprinted&with&permission&from&The$Future$of$Compu4ng$Performance:$Game$Over$or$ Next$Level?&NaAonal&Academy&of&Sciences&(2011).&

41 & Impact of Single-Processor Performance Plateau Accentuated+by+emergence+of+massive&data&sets,+scien1sts+have+an+ increasing+appe1te+and+need+for+speed+and+performance.+++ Important+new+science+ques1ons+in+physics,&materials,&biology,&& health&and&medicine,&and&climate&&change&require+increased+ processing+power.+ Support&of&naAonal&defense&and&intelligence&community&will+need+ increasingly+more+processing+power.+ Applica1ons+include+training+simula1ons,+autonomous+robo1c+vehicles,+ airport+security,+surveillance,+video+analy1cs,+infrastructure+defense+ against+cyber+avacks,+and+data+analysis+for+intelligence.++ + Both+consumer&and&enterprise&needs&are+increasing.+ Applica1ons+include+search+and+data+mining,+realA1me+decisionAmaking,++ web+services,+digital+content+crea1on,+speech+recogni1on,+and+ simula1on+and+modeling+for+product+design.++

42 Research to Expand the Limits of Computation ExploiAng&Parallelism&and&Scalability:&XPS& (NSF&13<507)& Happening&now+ Architectural+innova1ons+with+mul1A core+and+manyacore+ DomainAspecific+integrated+circuits+ EnergyAefficient+compu1ng+and++new+ processor+architectures+ Mid<term&soluAons& Research+agenda+based+on+parallelism,+ concurrency,+and+scalability+ Algorithmic+innova1ons+exploi1ng+ parallelism+ So7ware+systems+leading+to+improved+ performance+ Long<term&soluAons&&& New+materials+(e.g.,+carbon+nanoA tubes,+graphene+based+devices)& NonAcharge+transfer+devices;+(e.g.,+ electron+spin)++ Bio,+nano,+and+quantum+devices&

43 Computing Research Agenda on Parallelism, Concurrency, and Scalability Computational models and programming languages to enable new ways of thinking parallel and expression of parallelism at every scale. Algorithms and algorithmic paradigms that allow reasoning about parallel performance and scalability. Software systems capable of handling both small and extreme-scale data systems and aware of communication and energy use. Synthesis tools that generate efficient parallel codes from high-level descriptions. Scalable and energy-efficient architectures ranging from sensors to clouds while addressing programmability, reliability, and security. A new cross-layer approach integrating both software and hardware through new programming languages, models, algorithms, compilers, runtime systems and architectures. hvp://www.nap.edu/catalog.php? record_id=

44 Advanced Computational Infrastructure Anticipate and invest in diverse and innovative national scale shared resources, outreach and education complementing campus and other national investments Leverage and invest in collaborative flexible fabrics dynamically connecting scientific communities with computational resources and services at all scales (campus, regional, national, international) CIPRES+ + Cyberinfrastructure+for+ Phylogenic+Research+ XSEDE&

45 Opportunities for the Future Our investments in research and education have already returned exceptional dividends to the Nation. Many of tomorrow s breakthroughs will occur as a result of new techniques and technologies for advancing computing science and engineering. In turn, scientific discovery and technological innovation are at the core of our response to national and societal challenges from environment, energy, transportation, sustainability, and healthcare to cyber security and national defense.

46 Thanks!

Big Data R&D Initiative

Big Data R&D Initiative Big Data R&D Initiative Howard Wactlar CISE Directorate National Science Foundation NIST Big Data Meeting June, 2012 Image Credit: Exploratorium. The Landscape: Smart Sensing, Reasoning and Decision Environment

More information

Suzi Iacono CISE Directorate National Science Foundation

Suzi Iacono CISE Directorate National Science Foundation Big Data R&D Initiative Suzi Iacono CISE Directorate National Science Foundation National Academies of Science Integrating Environmental Health Data to Advance Discovery January 11, 2013 Image Credit:

More information

Big Data R&D Initiative

Big Data R&D Initiative Big Data R&D Initiative Mhyron Gutmann Directorate for the Social, Behavioral and Economic Sciences National Science Foundation Image Credit: Exploratorium. Digital Preservation 2012 July 25, 2012 Advances

More information

Core Techniques and Technologies for Advancing Big Data Science & Engineering (BIGDATA) NSF 12-499

Core Techniques and Technologies for Advancing Big Data Science & Engineering (BIGDATA) NSF 12-499 Core Techniques and Technologies for Advancing Big Data Science & Engineering (BIGDATA) NSF 12-499 Vasant Honavar Program Director Information & Intelligent Systems (IIS) Division Computer and Information

More information

National Big Data R&D Initiative

National Big Data R&D Initiative National Big Data R&D Initiative Suzi Iacono, PhD National Science Foundation Co-chair NITRD Big Data Senior Steering Group for CASC Spring Meeting April 23, 2014 Why is Big Data Important? Transformative

More information

MEETING SUMMARY. Wednesday, November 28, 2012:

MEETING SUMMARY. Wednesday, November 28, 2012: MEETING SUMMARY Advisory Committee Directorate for Computer and Information Science and Engineering November 28 29, 2012 National Science Foundation 4201 Wilson Boulevard, Arlington, VA 22230 The fall

More information

NITRD and Big Data. George O. Strawn NITRD

NITRD and Big Data. George O. Strawn NITRD NITRD and Big Data George O. Strawn NITRD Caveat auditor The opinions expressed in this talk are those of the speaker, not the U.S. government Outline What is Big Data? Who is NITRD? NITRD's Big Data Research

More information

Good morning. It is a pleasure to be with you here today to talk about the value and promise of Big Data.

Good morning. It is a pleasure to be with you here today to talk about the value and promise of Big Data. Good morning. It is a pleasure to be with you here today to talk about the value and promise of Big Data. 1 Advances in information technologies are transforming the fabric of our society and data represent

More information

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21)

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21) CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21) Overview The Cyberinfrastructure Framework for 21 st Century Science, Engineering, and Education (CIF21) investment

More information

Symposium on the Interagency Strategic Plan for Big Data: Focus on R&D

Symposium on the Interagency Strategic Plan for Big Data: Focus on R&D Symposium on the Interagency Strategic Plan for Big Data: Focus on R&D NAS Board on Research Data and Information October 23, 2014 Big Data Senior Steering Group (BDSSG) Allen Dearry, NIH, Co-Chair Suzi

More information

Big Data. George O. Strawn NITRD

Big Data. George O. Strawn NITRD Big Data George O. Strawn NITRD Caveat auditor The opinions expressed in this talk are those of the speaker, not the U.S. government Outline What is Big Data? NITRD's Big Data Research Initiative Big Data

More information

BIG DATA Funding Opportunities

BIG DATA Funding Opportunities BIG DATA Funding Opportunities Jill Morris Morris.856@osu.edu 688-5423 Institute for Population Research The Ohio State University NSF bigdata@nsf.gov NSF Big Data Initiatives Core Techniques and Technologies

More information

CASC Spring Meeting 2014 Federal Agency Panel Update on Big Data

CASC Spring Meeting 2014 Federal Agency Panel Update on Big Data CASC Spring Meeting 2014 Federal Agency Panel Update on Big Data Robert Chadduck Program Director, Data & CI CISE Division of Advanced Cyberinfrastructure 23 April 2014 ACI data focused CI - A view towards

More information

CYBERINFRASTRUCTURE FRAMEWORK $143,060,000 FOR 21 ST CENTURY SCIENCE, ENGINEERING, +$14,100,000 / 10.9% AND EDUCATION (CIF21)

CYBERINFRASTRUCTURE FRAMEWORK $143,060,000 FOR 21 ST CENTURY SCIENCE, ENGINEERING, +$14,100,000 / 10.9% AND EDUCATION (CIF21) CYBERINFRASTRUCTURE FRAMEWORK $143,060,000 FOR 21 ST CENTURY SCIENCE, ENGINEERING, +$14,100,000 / 10.9% AND EDUCATION (CIF21) Overview The Cyberinfrastructure Framework for 21 st Century Science, Engineering,

More information

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21)

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21) CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21) Goal Develop and deploy comprehensive, integrated, sustainable, and secure cyberinfrastructure (CI) to accelerate research

More information

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21) $100,070,000 -$32,350,000 / -24.43%

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21) $100,070,000 -$32,350,000 / -24.43% CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21) $100,070,000 -$32,350,000 / -24.43% Overview The Cyberinfrastructure Framework for 21 st Century Science, Engineering,

More information

NITRD: National Big Data Strategic Plan. Summary of Request for Information Responses

NITRD: National Big Data Strategic Plan. Summary of Request for Information Responses NITRD: National Big Data Strategic Plan Summary of Request for Information Responses Introduction: Demographics Summary of Responses Next generation Capabilities Data to Knowledge to Action Access to Big

More information

CC-NIE PI Workshop Plenary

CC-NIE PI Workshop Plenary CC-NIE PI Workshop Plenary Farnam Jahanian April 30, 2014 Image Credit: Exploratorium. Pervasive Impact We are at the center of an ongoing societal transformation and will be for decades to come. Advances

More information

COGNITIVE SCIENCE AND NEUROSCIENCE

COGNITIVE SCIENCE AND NEUROSCIENCE COGNITIVE SCIENCE AND NEUROSCIENCE Overview Cognitive Science and Neuroscience is a multi-year effort that includes NSF s participation in the Administration s Brain Research through Advancing Innovative

More information

Information Technology R&D and U.S. Innovation

Information Technology R&D and U.S. Innovation Information Technology R&D and U.S. Innovation Peter Harsha Computing Research Association Ed Lazowska University of Washington Version 9: December 18, 2008 1 Peter Lee Carnegie Mellon University Advances

More information

Overcoming the Technical and Policy Constraints That Limit Large-Scale Data Integration

Overcoming the Technical and Policy Constraints That Limit Large-Scale Data Integration Overcoming the Technical and Policy Constraints That Limit Large-Scale Data Integration Revised Proposal from The National Academies Summary An NRC-appointed committee will plan and organize a cross-disciplinary

More information

SECURE AND TRUSTWORTHY CYBERSPACE (SaTC)

SECURE AND TRUSTWORTHY CYBERSPACE (SaTC) SECURE AND TRUSTWORTHY CYBERSPACE (SaTC) Overview The Secure and Trustworthy Cyberspace (SaTC) investment is aimed at building a cybersecure society and providing a strong competitive edge in the Nation

More information

The Past, Present, and Future of Data Science Education

The Past, Present, and Future of Data Science Education The Past, Present, and Future of Data Science Education Kirk Borne @KirkDBorne http://kirkborne.net George Mason University School of Physics, Astronomy, & Computational Sciences Outline Research and Application

More information

curation, analyses and interpretation of massive datasets opportunities are varied across disciplines

curation, analyses and interpretation of massive datasets opportunities are varied across disciplines ! Efficiency in scientific discovery through curation, analyses and interpretation of massive datasets! Uptake level and concentration on Big Data opportunities are varied across disciplines The nature

More information

New Jersey Big Data Alliance

New Jersey Big Data Alliance Rutgers Discovery Informatics Institute (RDI 2 ) New Jersey s Center for Advanced Computation New Jersey Big Data Alliance Manish Parashar Director, Rutgers Discovery Informatics Institute (RDI 2 ) Professor,

More information

Information Visualization WS 2013/14 11 Visual Analytics

Information Visualization WS 2013/14 11 Visual Analytics 1 11.1 Definitions and Motivation Lot of research and papers in this emerging field: Visual Analytics: Scope and Challenges of Keim et al. Illuminating the path of Thomas and Cook 2 11.1 Definitions and

More information

Data Intensive Scalable Computing. Harnessing the Power of Cloud Computing

Data Intensive Scalable Computing. Harnessing the Power of Cloud Computing Data Intensive Scalable Computing Harnessing the Power of Cloud Computing Randal E. Bryant February, 2009 Our world is awash in data. Millions of devices generate digital data, an estimated one zettabyte

More information

Testimony of. Before the

Testimony of. Before the Testimony of Farnam Jahanian, Ph.D. Assistant Director Computer and Information Science and Engineering Directorate National Science Foundation Before the Committee on Science, Space, and Technology Subcommittee

More information

Government Technology Trends to Watch in 2014: Big Data

Government Technology Trends to Watch in 2014: Big Data Government Technology Trends to Watch in 2014: Big Data OVERVIEW The federal government manages a wide variety of civilian, defense and intelligence programs and services, which both produce and require

More information

The National Consortium for Data Science (NCDS)

The National Consortium for Data Science (NCDS) The National Consortium for Data Science (NCDS) A Public-Private Partnership to Advance Data Science Ashok Krishnamurthy PhD Deputy Director, RENCI University of North Carolina, Chapel Hill What is NCDS?

More information

Big-Data Computing: Creating revolutionary breakthroughs in commerce, science, and society

Big-Data Computing: Creating revolutionary breakthroughs in commerce, science, and society Big-Data Computing: Creating revolutionary breakthroughs in commerce, science, and society Randal E. Bryant Carnegie Mellon University Randy H. Katz University of California, Berkeley Version 8: December

More information

Challenges in e-science: Research in a Digital World

Challenges in e-science: Research in a Digital World Challenges in e-science: Research in a Digital World Thom Dunning National Center for Supercomputing Applications National Center for Supercomputing Applications University of Illinois at Urbana-Champaign

More information

NASA Earth Science Research in Data and Computational Science Technologies Report of the ESTO/AIST Big Data Study Roadmap Team September 2015

NASA Earth Science Research in Data and Computational Science Technologies Report of the ESTO/AIST Big Data Study Roadmap Team September 2015 NASA Earth Science Research in Data and Computational Science Technologies Report of the ESTO/AIST Big Data Study Roadmap Team September 2015 I. Background Over the next decade, the dramatic growth of

More information

Data Centric Computing Revisited

Data Centric Computing Revisited Piyush Chaudhary Technical Computing Solutions Data Centric Computing Revisited SPXXL/SCICOMP Summer 2013 Bottom line: It is a time of Powerful Information Data volume is on the rise Dimensions of data

More information

Center for Dynamic Data Analytics (CDDA) An NSF Supported Industry / University Cooperative Research Center (I/UCRC)

Center for Dynamic Data Analytics (CDDA) An NSF Supported Industry / University Cooperative Research Center (I/UCRC) Photo courtesy of Justin Reuter Center for Dynamic Data Analytics (CDDA) An NSF Supported Industry / University Cooperative Research Center (I/UCRC) Photo courtesy of Justin Reuter University Consortium

More information

Big Data Challenges in Bioinformatics

Big Data Challenges in Bioinformatics Big Data Challenges in Bioinformatics BARCELONA SUPERCOMPUTING CENTER COMPUTER SCIENCE DEPARTMENT Autonomic Systems and ebusiness Pla?orms Jordi Torres Jordi.Torres@bsc.es Talk outline! We talk about Petabyte?

More information

In December 2011, the White House Office of Science. Introducing the federal cybersecurity R&D strategic plan. Leaping ahead on cybersecurity

In December 2011, the White House Office of Science. Introducing the federal cybersecurity R&D strategic plan. Leaping ahead on cybersecurity Introducing the federal cybersecurity R&D strategic plan Douglas Maughan, Bill Newhouse, and Tomas Vagoun In December 2011, the White House Office of Science and Technology Policy (OSTP) released the document,

More information

Big Data a threat or a chance?

Big Data a threat or a chance? Big Data a threat or a chance? Helwig Hauser University of Bergen, Dept. of Informatics Big Data What is Big Data? well, lots of data, right? we come back to this in a moment. certainly, a buzz-word but

More information

SDN Security Challenges. Anita Nikolich National Science Foundation Program Director, Advanced Cyberinfrastructure July 2015

SDN Security Challenges. Anita Nikolich National Science Foundation Program Director, Advanced Cyberinfrastructure July 2015 SDN Security Challenges Anita Nikolich National Science Foundation Program Director, Advanced Cyberinfrastructure July 2015 Cybersecurity Enhancement Act 2014 Public-Private Collaboration on Security (NIST

More information

An analysis of Big Data ecosystem from an HCI perspective.

An analysis of Big Data ecosystem from an HCI perspective. An analysis of Big Data ecosystem from an HCI perspective. Jay Sanghvi Rensselaer Polytechnic Institute For: Theory and Research in Technical Communication and HCI Rensselaer Polytechnic Institute Wednesday,

More information

Training for Big Data

Training for Big Data Training for Big Data Learnings from the CATS Workshop Raghu Ramakrishnan Technical Fellow, Microsoft Head, Big Data Engineering Head, Cloud Information Services Lab Store any kind of data What is Big

More information

White Paper. Version 1.2 May 2015 RAID Incorporated

White Paper. Version 1.2 May 2015 RAID Incorporated White Paper Version 1.2 May 2015 RAID Incorporated Introduction The abundance of Big Data, structured, partially-structured and unstructured massive datasets, which are too large to be processed effectively

More information

Panel on Big Data Challenges and Opportunities

Panel on Big Data Challenges and Opportunities Panel on Big Data Challenges and Opportunities Dr. Chaitan Baru Senior Advisor for Data Science, Directorate for Computer & Information Science & Engineering National Science Foundation NSF s Perspective

More information

Standards for Big Data in the Cloud

Standards for Big Data in the Cloud Standards for Big Data in the Cloud International Cloud Symposium 15/10/2013 Carola Carstens (Project Officer) DG CONNECT, Unit G3 Data Value Chain European Commission Outline 1) Data Value Chain Unit

More information

RISK AND RESILIENCE $58,000,000 +$38,000,000 / 190.0%

RISK AND RESILIENCE $58,000,000 +$38,000,000 / 190.0% RISK AND RESILIENCE $58,000,000 +$38,000,000 / 190.0% Overview The economic competiveness and societal well-being of the United States depend on the affordability, availability, quality, and reliability

More information

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank Agenda» Overview» What is Big Data?» Accelerates advances in computer & technologies» Revolutionizes data measurement»

More information

Panel on Emerging Cyber Security Technologies. Robert F. Brammer, Ph.D., VP and CTO. Northrop Grumman Information Systems.

Panel on Emerging Cyber Security Technologies. Robert F. Brammer, Ph.D., VP and CTO. Northrop Grumman Information Systems. Panel on Emerging Cyber Security Technologies Robert F. Brammer, Ph.D., VP and CTO Northrop Grumman Information Systems Panel Moderator 27 May 2010 Panel on Emerging Cyber Security Technologies Robert

More information

3rd International Symposium on Big Data and Cloud Computing Challenges (ISBCC-2016) March 10-11, 2016 VIT University, Chennai, India

3rd International Symposium on Big Data and Cloud Computing Challenges (ISBCC-2016) March 10-11, 2016 VIT University, Chennai, India 3rd International Symposium on Big Data and Cloud Computing Challenges (ISBCC-2016) March 10-11, 2016 VIT University, Chennai, India Call for Papers Cloud computing has emerged as a de facto computing

More information

Data Driven Discovery In the Social, Behavioral, and Economic Sciences

Data Driven Discovery In the Social, Behavioral, and Economic Sciences Data Driven Discovery In the Social, Behavioral, and Economic Sciences Simon Appleford, Marshall Scott Poole, Kevin Franklin, Peter Bajcsy, Alan B. Craig, Institute for Computing in the Humanities, Arts,

More information

Make the Most of Big Data to Drive Innovation Through Reseach

Make the Most of Big Data to Drive Innovation Through Reseach White Paper Make the Most of Big Data to Drive Innovation Through Reseach Bob Burwell, NetApp November 2012 WP-7172 Abstract Monumental data growth is a fact of life in research universities. The ability

More information

Standard Big Data Architecture and Infrastructure

Standard Big Data Architecture and Infrastructure Standard Big Data Architecture and Infrastructure Wo Chang Digital Data Advisor Information Technology Laboratory (ITL) National Institute of Standards and Technology (NIST) wchang@nist.gov May 20, 2016

More information

Science Gateways What are they and why are they having such a tremendous impact on science? Nancy Wilkins- Diehr wilkinsn@sdsc.edu

Science Gateways What are they and why are they having such a tremendous impact on science? Nancy Wilkins- Diehr wilkinsn@sdsc.edu Science Gateways What are they and why are they having such a tremendous impact on science? Nancy Wilkins- Diehr wilkinsn@sdsc.edu What is a science gateway? science gateway /sī əәns gāt wā / n. 1. an

More information

Big Data to Knowledge (BD2K)

Big Data to Knowledge (BD2K) Big Data to Knowledge () potential funding agency synergies Jennie Larkin, PhD Office of the Associate Director of Data Science National Institutes of Health idash-pscanner meeting UCSD September 16, 2014

More information

Proposal for the Theme on Big Data. Analytics. Qiang Yang, HKUST Jiannong Cao, PolyU Qi-man Shao, CUHK. May 2015

Proposal for the Theme on Big Data. Analytics. Qiang Yang, HKUST Jiannong Cao, PolyU Qi-man Shao, CUHK. May 2015 Proposal for the Theme on Big Data Analytics May 2015 Qiang Yang, HKUST Jiannong Cao, PolyU Qi-man Shao, CUHK Motivation The world's technological per-capita capacity to store information doubled every

More information

MEDICAL DATA MINING. Timothy Hays, PhD. Health IT Strategy Executive Dynamics Research Corporation (DRC) December 13, 2012

MEDICAL DATA MINING. Timothy Hays, PhD. Health IT Strategy Executive Dynamics Research Corporation (DRC) December 13, 2012 MEDICAL DATA MINING Timothy Hays, PhD Health IT Strategy Executive Dynamics Research Corporation (DRC) December 13, 2012 2 Healthcare in America Is a VERY Large Domain with Enormous Opportunities for Data

More information

Analyzing Big Data: The Path to Competitive Advantage

Analyzing Big Data: The Path to Competitive Advantage White Paper Analyzing Big Data: The Path to Competitive Advantage by Marcia Kaplan Contents Introduction....2 How Big is Big Data?................................................................................

More information

National and Transnational Security Implications of Big Data in the Life Sciences

National and Transnational Security Implications of Big Data in the Life Sciences Prepared by the American Association for the Advancement of Science in conjunction with the Federal Bureau of Investigation and the United Nations Interregional Crime and Justice Research Institute National

More information

Integrating Data Life Cycle into Mission Life Cycle. Arcot Rajasekar rajasekar@unc.edu sekar@diceresearch.org

Integrating Data Life Cycle into Mission Life Cycle. Arcot Rajasekar rajasekar@unc.edu sekar@diceresearch.org Integrating Data Life Cycle into Mission Life Cycle Arcot Rajasekar rajasekar@unc.edu sekar@diceresearch.org 1 Technology of Interest Provide an end-to-end capability for Exa-scale data orchestration From

More information

Are You Ready for Big Data?

Are You Ready for Big Data? Are You Ready for Big Data? Jim Gallo National Director, Business Analytics February 11, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?

More information

CASC Autumn Meeting 2015 Data Activities Panel 14 October 2015. ACI Data Program : an Integrative, Evolving Portfolio with a View towards the Horizon

CASC Autumn Meeting 2015 Data Activities Panel 14 October 2015. ACI Data Program : an Integrative, Evolving Portfolio with a View towards the Horizon CASC Autumn Meeting 2015 Data Activities Panel 14 October 2015 ACI Data Program : an Integrative, Evolving Portfolio with a View towards the Horizon Robert Chadduck Program Director, Data & CI Program

More information

Are You Ready for Big Data?

Are You Ready for Big Data? Are You Ready for Big Data? Jim Gallo National Director, Business Analytics April 10, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?

More information

Exploring the roles and responsibilities of data centres and institutions in curating research data a preliminary briefing.

Exploring the roles and responsibilities of data centres and institutions in curating research data a preliminary briefing. Exploring the roles and responsibilities of data centres and institutions in curating research data a preliminary briefing. Dr Liz Lyon, UKOLN, University of Bath Introduction and Objectives UKOLN is undertaking

More information

Indexed Terms: Big Data, benefits, characteristics, definition, problems, unstructured data

Indexed Terms: Big Data, benefits, characteristics, definition, problems, unstructured data Managing Data through Big Data: A Review Harsimran Singh Anand Assistant Professor, PG Dept of Computer Science & IT, DAV College, Amritsar Email id: harsimran_anand@yahoo.com A B S T R A C T Big Data

More information

Knowledge Discovery from patents using KMX Text Analytics

Knowledge Discovery from patents using KMX Text Analytics Knowledge Discovery from patents using KMX Text Analytics Dr. Anton Heijs anton.heijs@treparel.com Treparel Abstract In this white paper we discuss how the KMX technology of Treparel can help searchers

More information

Computing at a Cross-Roads: Big Data, Big Compute, and the Long Tail. William Gropp www.cs.illinois.edu/~wgropp

Computing at a Cross-Roads: Big Data, Big Compute, and the Long Tail. William Gropp www.cs.illinois.edu/~wgropp Computing at a Cross-Roads: Big Data, Big Compute, and the Long Tail William Gropp www.cs.illinois.edu/~wgropp What this talk is A request for help, in keeping with the topics of this meeting The US NSF

More information

How Big Data is Different

How Big Data is Different FALL 2012 VOL.54 NO.1 Thomas H. Davenport, Paul Barth and Randy Bean How Big Data is Different Brought to you by Please note that gray areas reflect artwork that has been intentionally removed. The substantive

More information

USGS Community for Data Integration

USGS Community for Data Integration Community of Science: Strategies for Coordinating Integration of Data USGS Community for Data Integration Kevin T. Gallagher USGS Core Science Systems January 11, 2013 U.S. Department of the Interior U.S.

More information

CIS492 Special Topics: Cloud Computing د. منذر الطزاونة

CIS492 Special Topics: Cloud Computing د. منذر الطزاونة CIS492 Special Topics: Cloud Computing د. منذر الطزاونة Big Data Definition No single standard definition Big Data is data whose scale, diversity, and complexity require new architecture, techniques, algorithms,

More information

NetApp Big Content Solutions: Agile Infrastructure for Big Data

NetApp Big Content Solutions: Agile Infrastructure for Big Data White Paper NetApp Big Content Solutions: Agile Infrastructure for Big Data Ingo Fuchs, NetApp April 2012 WP-7161 Executive Summary Enterprises are entering a new era of scale, in which the amount of data

More information

Learning from Big Data in

Learning from Big Data in Learning from Big Data in Astronomy an overview Kirk Borne George Mason University School of Physics, Astronomy, & Computational Sciences http://spacs.gmu.edu/ From traditional astronomy 2 to Big Data

More information

Grand Challenges, Federal Priorities and Funding an NSF/CISE view

Grand Challenges, Federal Priorities and Funding an NSF/CISE view Grand Challenges, Federal Priorities and Funding an NSF/CISE view Jim Kurose Assistant Director, NSF Computer & Information Science & Engineering Distinguished Professor College of Information and Computer

More information

Critical. Center (C 3 ) George Markowsky

Critical. Center (C 3 ) George Markowsky Critical Cyber Infrastructure Center (C 3 ) George Markowsky School of Computing & Information Science Cybersecurity and the Protection of Critical Digital Infrastructure The Problem Digital infrastructures,

More information

Cloud Computing for Research Roger Barga Cloud Computing Futures, Microsoft Research

Cloud Computing for Research Roger Barga Cloud Computing Futures, Microsoft Research Cloud Computing for Research Roger Barga Cloud Computing Futures, Microsoft Research Trends: Data on an Exponential Scale Scientific data doubles every year Combination of inexpensive sensors + exponentially

More information

Dr. Raju Namburu Computational Sciences Campaign U.S. Army Research Laboratory. The Nation s Premier Laboratory for Land Forces UNCLASSIFIED

Dr. Raju Namburu Computational Sciences Campaign U.S. Army Research Laboratory. The Nation s Premier Laboratory for Land Forces UNCLASSIFIED Dr. Raju Namburu Computational Sciences Campaign U.S. Army Research Laboratory 21 st Century Research Continuum Theory Theory embodied in computation Hypotheses tested through experiment SCIENTIFIC METHODS

More information

Towards a Thriving Data Economy: Open Data, Big Data, and Data Ecosystems

Towards a Thriving Data Economy: Open Data, Big Data, and Data Ecosystems Towards a Thriving Data Economy: Open Data, Big Data, and Data Ecosystems Volker Markl volker.markl@tu-berlin.de dima.tu-berlin.de dfki.de/web/research/iam/ bbdc.berlin Based on my 2014 Vision Paper On

More information

The Packard Fellowships for Science and Engineering

The Packard Fellowships for Science and Engineering The Packard Fellowships for Science and Engineering 2016 Guidelines The Packard Fellowships for Science and Engineering program invests in future leaders who have the freedom to take risks, explore new

More information

Databases & Data Infrastructure. Kerstin Lehnert

Databases & Data Infrastructure. Kerstin Lehnert + Databases & Data Infrastructure Kerstin Lehnert + Access to Data is Needed 2 to allow verification of research results to allow re-use of data + The road to reuse is perilous (1) 3 Accessibility Discovery,

More information

High Performance Computing Initiatives

High Performance Computing Initiatives High Performance Computing Initiatives Eric Stahlberg September 1, 2015 DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Cancer Institute Frederick National Laboratory is

More information

SECURE AND TRUSTWORTHY CYBERSPACE (SaTC) $124,250,000 +$1,500,000 / 1.2%

SECURE AND TRUSTWORTHY CYBERSPACE (SaTC) $124,250,000 +$1,500,000 / 1.2% SECURE AND TRUSTWORTHY CYBERSPACE (SaTC) $124,250,000 +$1,500,000 / 1.2% Overview The Secure and Trustworthy Cyberspace (SaTC) investment is aimed at building a cybersecure society and providing a strong

More information

DataBridge http://databridge.web.unc.edu/" Arcot Rajasekar" rajasekar@unc.edu The University of North Carolina at Chapel Hill "

DataBridge http://databridge.web.unc.edu/ Arcot Rajasekar rajasekar@unc.edu The University of North Carolina at Chapel Hill DataBridge http://databridge.web.unc.edu/" Arcot Rajasekar" rajasekar@unc.edu The University of North Carolina at Chapel Hill " Data Bridge: A Social Network for Long Tail Science Data" Outline of the

More information

IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper

IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper CAST-2015 provides an opportunity for researchers, academicians, scientists and

More information

Data Requirements from NERSC Requirements Reviews

Data Requirements from NERSC Requirements Reviews Data Requirements from NERSC Requirements Reviews Richard Gerber and Katherine Yelick Lawrence Berkeley National Laboratory Summary Department of Energy Scientists represented by the NERSC user community

More information

Data : Big & Open Big Data Open Data. François Bancilhon Data Publica & INRIA/Mobile Services Initiative twitter.com/fbancilhon

Data : Big & Open Big Data Open Data. François Bancilhon Data Publica & INRIA/Mobile Services Initiative twitter.com/fbancilhon Data : Big & Open Big Data Open Data François Bancilhon Data Publica & INRIA/Mobile Services Initiative twitter.com/fbancilhon A deluge of data Lots of Data Open Data Big Data A wealth of data Lots of

More information

Collaborations between Official Statistics and Academia in the Era of Big Data

Collaborations between Official Statistics and Academia in the Era of Big Data Collaborations between Official Statistics and Academia in the Era of Big Data World Statistics Day October 20-21, 2015 Budapest Vijay Nair University of Michigan Past-President of ISI vnn@umich.edu What

More information

Connecting Researchers, Data & HPC

Connecting Researchers, Data & HPC Connecting Researchers, Data & HPC Nick Nystrom Director, Strategic Applications & Bridges PI nystrom@psc.edu July 1, 2015 2015 Pittsburgh Supercomputing Center The Shift to Big Data New Emphases Pan-STARRS

More information

Understanding Big Data Analytics for Research

Understanding Big Data Analytics for Research Understanding Big Data Analytics for Research Hye-Chung Kum Texas A&M Health Science Center, Dept. of Health Policy & Management University of North Carolina at Chapel Hill, Dept. of Computer Science (kum@tamhsc.edu)

More information

THE FEDERAL BIG DATA RESEARCH AND DEVELOPMENT STRATEGIC PLAN. THE NETWORKING AND INFORMATION TECHNOLOGY RESEARCH AND DEVELOPMENT PROGRAM April 2016

THE FEDERAL BIG DATA RESEARCH AND DEVELOPMENT STRATEGIC PLAN. THE NETWORKING AND INFORMATION TECHNOLOGY RESEARCH AND DEVELOPMENT PROGRAM April 2016 THE FEDERAL BIG DATA RESEARCH AND DEVELOPMENT STRATEGIC PLAN THE NETWORKING AND INFORMATION TECHNOLOGY RESEARCH AND DEVELOPMENT PROGRAM April 2016 MAY 2016 About this Document This report was developed

More information

Center for Dynamic Data Analytics (CDDA) An NSF Supported Industry / University Cooperative Research Center (I/UCRC) Vision and Mission

Center for Dynamic Data Analytics (CDDA) An NSF Supported Industry / University Cooperative Research Center (I/UCRC) Vision and Mission Photo courtesy of Justin Reuter Center for Dynamic Data Analytics (CDDA) An NSF Supported Industry / University Cooperative Research Center (I/UCRC) Vision and Mission CDDA Mission Mission of our CDDA

More information

DATA MANAGEMENT FOR THE INTERNET OF THINGS

DATA MANAGEMENT FOR THE INTERNET OF THINGS DATA MANAGEMENT FOR THE INTERNET OF THINGS February, 2015 Peter Krensky, Research Analyst, Analytics & Business Intelligence Report Highlights p2 p4 p6 p7 Data challenges Managing data at the edge Time

More information

Time: 9:50-12:00 pm on Oct. 9, 2013 Location: TBA. Bios of Panelists

Time: 9:50-12:00 pm on Oct. 9, 2013 Location: TBA. Bios of Panelists Panel: Key Issues in Big Data Panelists: 1) Dr. Roger R. Schell, USC 2) Dr. Amr Awadallah, Cloudera, Inc. 3) Dr. Peter G. Neumann, RSl 4) Dr.Tomoyuki Higuchi 5) Dr. Sylvia Osborn, University of Western

More information

EL Program: Smart Manufacturing Systems Design and Analysis

EL Program: Smart Manufacturing Systems Design and Analysis EL Program: Smart Manufacturing Systems Design and Analysis Program Manager: Dr. Sudarsan Rachuri Associate Program Manager: K C Morris Strategic Goal: Smart Manufacturing, Construction, and Cyber-Physical

More information

Computational Science and Informatics (Data Science) Programs at GMU

Computational Science and Informatics (Data Science) Programs at GMU Computational Science and Informatics (Data Science) Programs at GMU Kirk Borne George Mason University School of Physics, Astronomy, & Computational Sciences http://spacs.gmu.edu/ Outline Graduate Program

More information

The Tonnabytes Big Data Challenge: Transforming Science and Education. Kirk Borne George Mason University

The Tonnabytes Big Data Challenge: Transforming Science and Education. Kirk Borne George Mason University The Tonnabytes Big Data Challenge: Transforming Science and Education Kirk Borne George Mason University Ever since we first began to explore our world humans have asked questions and have collected evidence

More information

The Research Data Revolution. 2015 Harvard/Purdue Data Symposium Sayeed Choudhury

The Research Data Revolution. 2015 Harvard/Purdue Data Symposium Sayeed Choudhury The Research Data Revolution 2015 Harvard/Purdue Data Symposium Sayeed Choudhury Data Conservancy (DC) One of five awards through US National Science Foundation s (NSF) DataNet program $10 million award

More information

Is Big Data a Big Deal? What Big Data Does to Science

Is Big Data a Big Deal? What Big Data Does to Science Is Big Data a Big Deal? What Big Data Does to Science Netherlands escience Center Wilco Hazeleger Wilco Hazeleger Student @ Wageningen University and Reading University Meteorology PhD @ Utrecht University,

More information

Data Management. Facility Access Challenges: Rudi Eigenmann NEES Operations Headquarters NEEScomm Center Purdue University

Data Management. Facility Access Challenges: Rudi Eigenmann NEES Operations Headquarters NEEScomm Center Purdue University George E. Brown Jr. Network for Earthquake Engineering Simulation Facility Access Challenges: Data Management Rudi Eigenmann NEES Operations Headquarters NEEScomm Center Purdue University Why Data Management?

More information

US Federal Cyber Security Research Program November 15, 2012 New England Advanced Cyber Security Center Workshop Bill Newhouse (NIST)

US Federal Cyber Security Research Program November 15, 2012 New England Advanced Cyber Security Center Workshop Bill Newhouse (NIST) US Federal Cyber Security Research Program November 15, 2012 New England Advanced Cyber Security Center Workshop Bill Newhouse (NIST) william.newhouse@nist.gov NITRD Structure for US Federal Cybersecurity

More information

ANALYTICS STRATEGY: creating a roadmap for success

ANALYTICS STRATEGY: creating a roadmap for success ANALYTICS STRATEGY: creating a roadmap for success Companies in the capital and commodity markets are looking at analytics for opportunities to improve revenue and cost savings. Yet, many firms are struggling

More information

Information Management course

Information Management course Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 01 : 06/10/2015 Practical informations: Teacher: Alberto Ceselli (alberto.ceselli@unimi.it)

More information

Considering the Way Forward for Data Science and International Climate Science

Considering the Way Forward for Data Science and International Climate Science Considering the Way Forward for Data Science and International Climate Science Improving Data Mobility and Management for International Climate Science July 14-16, 2014 Boulder, CO Sara J. Graves, Ph.D.

More information