Theo JD Bothma Department of Informa1on Science
|
|
- Brian Perry
- 8 years ago
- Views:
Transcription
1 Theo JD Bothma Department of Informa1on Science Reflec1ons on the role of corpora and big data in e- lexicography in rela1on to end user informa1on needs CILC th Interna1onal Conference on Corpus Linguis1cs Valladolid,, 5 7 March 2015
2 Overview Introduc)on Clarifica)on of terms Access to linked corpora Access to frequency pa9erns Technical issues Conclusion
3 Introduc1on Focus of presenta)on Func)on theory Filtering of data Data on demand Lexicotainment
4 Focus of presenta1on How corpora and big data can be used to supplement dic)onary data Specifically for end user informa)on needs On demand Why users would need such data Research that needs to be done To provide useful tools Focus on end user
5 Outside scope of presenta1on The use of corpora by lexicographers to create dic)onaries Predefined corpora or web as corpus Many papers at the conference The use of corpora and big data in digital humani)es research Lexicographic research The use of big data by commercial en))es
6 Func1on theory Communica)ve situa)ons where a need to solve a communica)on problem may occur Text recep)on Text produc)on Transla)on Cogni)ve situa)ons where a need for knowledge may occur Opera)ve situa)ons Interpre)ve situa)ons
7 Filtering of data...uncovering the needs users have in the last 20 percent of the look- ups, i.e. in one out of five consulta)ons...discover the needs that only show up in one out of a hundred or one out of a thousand consulta)ons (Tarp, 2009a)...ar)cles that are especially adapted individualiza)on of the lexical product, adap)ng to the concrete needs of a concrete user (Tarp, 2009b)
8 Data on demand One cannot develop separate dic)onaries for 1 in a 1000 queries Data on demand Filtering data through search and presenta)on op)ons More bu9on Link to internal data Link to external data, including corpora Link to lexicographic user support tools
9 Lexicotainment Commercial publishers develop many data on demand op)ons for online dic)onaries OWen interac)ve There must be a need for such func)ons Why develop them if not used? OWen / usually free Marke)ng Vast array of gadgets / tools / types of informa)on
10 Word of the day Blog Language )ps Quizzes Crosswords Trending Interna)onal Local Slide shows Dic1onary.com
11
12 Dic1onary.com local lookups
13 Merriam- Webster
14 Merriam- Webster (2)
15 Clarifica1on of terms / Examples Corpus Big data Structured / unstructured data Data analy)cs Text / data mining
16 Corpus No defini)on required Oxford English Corpus...over 2 billion words of real 21st century English. It is not only size that ma9ers, though: it is the size of the corpus coupled with the careful selec)on and development of its contents which means that it is a resource unlike any other in the world. (h9p:// the- oxford- english- corpus) UMBC WebBase Corpus over three billion words, 48GB (h9p://ebiquity.umbc.edu/resource/html/id/351) Library of Congress 10 TB, probably about 3 Petabytes (3,000 TB) if all mul)media is included (h9p://blogs.loc.gov/digitalpreserva)on/2012/03/how- many- libraries- of- congress- does- it- take/)
17 Google Books corpus The total collec)on contains more than 6% of all books ever published. (Lin, Y et al. 2012)
18 Big data Big data is data that exceeds the processing capacity of conven)onal database systems. The data is too big, moves too fast, or doesn t fit the strictures of your database architectures. To gain value from this data, you must choose an alterna)ve way to process it. (Edd Dumbill Making sense of big data. Big data 1(1). h9p://online.liebertpub.com/doi/pdf/ /big )
19 Big data (2) Big data is high- volume, - velocity and - variety informa)on assets that demand cost- effec)ve, innova)ve forms of informa)on processing for enhanced insight and decision making. (Beyer, MA & Laney, D "The Importance of 'Big Data': A Defini)on. Gartner....the real- )me and high frequency nature of the data is also key. For example, nowcas)ng is used extensively and adds considerable power to predic)on. Similarly the high frequency of data allows users to test theories in near real- )me and to a level never before possible. (The Challenges and awards of big data )
20 Examples The NSA very casually dropped a number: Every six hours, the agency collects as much data as is stored in the en)re Library of Congress. (h9p:// six- hours- nsa- gathers- much- data- stored- en)re- library- congress) ebay.com uses two data warehouses at 7.5 petabytes and 40PB as well as a 40PB Hadoop cluster for search, consumer recommenda)ons, and merchandising. (Tay, L Inside ebay s 90PB data warehouse. h9p:// ebay8217s- 90pb- data- warehouse.aspx)
21 Square Kilometre Array The SKA represents the ul)mate Big Data challenge (h9p:// DOME.pdf) The project is expected to deliver up to an exabyte a day of raw data, compressed to some 10 petabytes of data in images for storage. "This telescope will generate the same amount of data in a day as the en)re planet does in a year. We es)mate that there will be more data flowing inside the telescope network than the en)re internet in 2020." (h9p:// ska_telescope_generate_more_data_than_en)re_internet_2020/) The data collected by the SKA in a single day would take nearly two million years to playback on an ipod. (h9ps://
22 h9p://datacook.blogspot.com/
23 Structured / unstructured data Structured data Data that resides in a fixed field within a record or file. Unstructured data All those things that can't be so readily classified and fit into a neat box: photos and graphic images, videos, streaming instrument data, webpages, pdf files, PowerPoint presenta)ons, s, blog entries, wikis and word processing documents. Semi- structured data A cross between the two. Tags or other types of markers are used to iden)fy certain elements within the data, but the data doesn t have a rigid structure. (h9p://
24 Data analy1cs Predic)ve analy)cs, data mining, text mining, forecas)ng and data op)miza)on. (h9p:// Big data uses induc)ve sta)s)cs and concepts from nonlinear system iden)fica)on to infer laws (regressions, nonlinear rela)onships, and causal effects) from large sets of data with low informa)on density to reveal rela)onships, dependencies and perform predic)ons of outcomes and behaviors (h9p://en.wikipedia.org/wiki/big_data#science)
25 Text / data mining Data Mining is an analy)c process designed to explore data (usually large amounts of data - typically business or market related - also known as "big data") in search of consistent pa9erns and/or systema)c rela)onships between variables, and then to validate the findings by applying the detected pa9erns to new subsets of data. The ul)mate goal of data mining is predic)on - and predic)ve data mining is the most common type of data mining and one that has the most direct business applica)ons. (StatSoW, Inc. (2013). Electronic Sta)s)cs Textbook. Tulsa, OK: StatSoW. h9p://
26 Big data in e- lexicography Yes Volumes Database requirements Processing requirements Research Digital humani)es Word studies No Speed of change Nature of user requirements Ac)onable
27 Two examples Access to linked corpora Access to frequency pa9erns Data available on demand Examples from exis)ng dic)onaries / tools Characteris)cs Why end users would want to do this Problems
28
29
30
31
32 Frequency pa]erns Only one commercial applica)on discussed: Google Ngram viewer Mul)ple databases Some tagged for PoS and syntac)c dependencies 12 language universal part- of- speech tags and unlabeled head- modifier dependencies (Lin, Y et al Syntac)c Annota)ons for the Google Books Ngram Corpus. In Proceedings of the 50th Annual Mee7ng of the Associa7on for Computa7onal Linguis7cs, pp , Jeju, Republic of Korea, 8-14 July Associa)on for Computa)onal Linguis)cs)
33 Racialism / Racism OED: Racialism = racism n. An earlier term than racism n., but now largely superseded by it Examples form Racism Examples from Not dis)nguished from racialism Approximately similar entries for racialist / racist
34 racialism / racism
35 Walkman / Ipod
36
37 catch up with X / catch X up On demand
38 Why end users would want to do this Cogni)ve / lexicotainment Simply interested to learn more about word usage and word history See word in context over )me Understand word usage be9er Text produc)on Decide between alterna)ves Situate word in historical context when wri)ng a text
39 Problems No genre- specific search op)ons General trade, fic)on, academic, newspaper, etc. No usage- tagged search op)on Formal, colloquial and slang, regional, etc. Direct speech Date of wri)ng not dis)nguished from date of context
40 Corpora and frequency tables Posi)ve Examples of actual usage Genre- specific dis)nc)ons DWDS Limited availability for Google Ngram viewer Fic)on / non- fic)on Bri)sh / American Drill- down to context Nega)ve Time dis)nc)ons Limited drill- down op)ons Limited granularity
41 Racialism / Racism OED no help both occur Example from fic)on: Text wri9en in 2012, set in USA in 1941 (vol. 2 of trilogy) Racialist counted as occurrence in 2012 Racialist used in direct speech, racism in narra)ve Text wri9en in 2014, set in USA in 1961 (vol. 3) Uses racism exclusively (direct speech and narra)ve) Current UK TV programme racialist Does corpus reflect usage in the 1940s / 1960s / 2000s?
42 Walkman / ipod Evident rise of ipod vs Walkman Does the higher use of Walkman even in 2008 reflect actual technology prolifera)on? To what extent do books (vs other media) reflect actual usage?
43 Conjugated / Inflected forms Excellent feature Aggregated usage? PoS tagging Future tagging? Syntac)c (cf. Google Books) Seman)c Etc.
44 Catch X up / Catch up with X Mixed bag un)l 1930s ThereaWer clear preference for catch up with X Current Bri)sh colloquial catch you up? Is there a dis)nc)on between formal and colloquial? Would a different corpus reveal a different pa9ern? Is there a difference between the two items in direct speech compared to narra)ve?
45 Technical issues Corpus and data set selec)on Corpus clean- up Corpus markup Search func)ons and filtering Data presenta)on and usability issues
46 Corpus and data set selec1on Decide on intended use Lexicographer, researcher, professional, end user Decide on characteris)cs Contemporary / diachronic Genre- specific / general Formal / Informal (e.g. social media) Size Selec)ve or as large as possible Hardware / sowware Copyright
47 Corpus clean- up Digi)sa)on Image clean- up OCR Quality control of both images and OCR Digi)zed materials Remove noise HTML Long- term preserva)on and cura)on of originals
48 Corpus markup Linguis)c processing Tokeniza)on PoS tagging Lemma)za)on Addi)onal gramma)cal markup Metadata markup Standard Bibliographic data Gramma)cal tagging Addi)onal markup Genre Date of seyng Direct speech vs narra)ve
49 Filtering and search func1ons Selec)on of corpus or mul)ple corpora Based on markup Metadata Gramma)cal Fine grained Subset of corpora Date Genre Etc. Complex combina)on of criteria
50 Data presenta1on and usability issues Incorporated into dic)onary interface How Display only required data Drill- down to actual texts on demand From corpus examples From frequency tables User studies
51 Examples Show all examples of: catch PRON up and catch up with PRON In a newspaper corpus Da)ng between 1940 and 1950 Show all examples of: racialist and racist Bri)sh fic)on compared to Bri)sh newspapers Between 1940 and 1960 Dis)nguished between direct speech and narra)ve
52 Conclusion Many addi)onal tools possible Mul)discplinary research Impact on markup Database technologies Hardware requirements User and usability studies Develop prototypes Innova)on Think cri)cally about the way forward
53 Lexicographers can considerably enhance the user experience by making non- tradi)onal data available to their end users through exploi)ng the technologies and data accessible through corpora, big data sets and the internet
54 Thank you! Ques1ons / comments? theo.bothma@up.ac.za
Opportuni)es and Challenges of Textual Big Data for the Humani)es
Opportuni)es and Challenges of Textual Big Data for the Humani)es Dr. Adam Wyner, Department of Compu)ng Prof. Barbara Fennell, Department of Linguis)cs THiNK Network Knowledge Exchange in the Humani)es
More informationData Warehousing. Yeow Wei Choong Anne Laurent
Data Warehousing Yeow Wei Choong Anne Laurent Databases Databases are developed on the IDEA that DATA is one of the cri>cal materials of the Informa>on Age Informa>on, which is created by data, becomes
More informationProject Management Introduc1on
Project Management Introduc1on Session 1 Part I Introduc1on By Amal Le Collen, PMP Dr. Lauren1u Neamtu, PMP Session outline 1. PART I: Introduc1on 1. The Purpose of the PMBOK Guide 2. What is a project?
More informationHigh Performance Compu2ng and Big Data. High Performance compu2ng Curriculum UvA- SARA h>p://www.hpc.uva.nl/
High Performance Compu2ng and Big Data High Performance compu2ng Curriculum UvA- SARA h>p://www.hpc.uva.nl/ Big data was big news in 2012 and probably in 2013 too. The Harvard Business Review talks about
More informationIns+tuto Superior Técnico Technical University of Lisbon. Big Data. Bruno Lopes Catarina Moreira João Pinho
Ins+tuto Superior Técnico Technical University of Lisbon Big Data Bruno Lopes Catarina Moreira João Pinho Mo#va#on 2 220 PetaBytes Of data that people create every day! 2 Mo#va#on 90 % of Data UNSTRUCTURED
More informationUsing Social Media to Drive Recommender Systems for Mobile Apps. - GRP Presenta=on - Jovian Lin (A0026542M)
Using Social Media to Drive Recommender Systems for Mobile Apps - GRP Presenta=on - Jovian Lin (A0026542M) Structure of Presenta=on Introduc=on Why Recommender Systems (RS)? Problems in Recommending Our
More informationHow To Use Splunk For Android (Windows) With A Mobile App On A Microsoft Tablet (Windows 8) For Free (Windows 7) For A Limited Time (Windows 10) For $99.99) For Two Years (Windows 9
Copyright 2014 Splunk Inc. Splunk for Mobile Intelligence Bill Emme< Director, Solu?ons Marke?ng Panos Papadopoulos Director, Product Management Disclaimer During the course of this presenta?on, we may
More informationApplication of Supply Chain Concepts to the Analysis Process
Application of Supply Chain Concepts to the Analysis Process Rob Handfield, PhD Bank of America University Distinguished Professor of Supply Chain Management Executive Director, Supply Chain Resource Cooperative
More informationBig Data. The Big Picture. Our flexible and efficient Big Data solu9ons open the door to new opportuni9es and new business areas
Big Data The Big Picture Our flexible and efficient Big Data solu9ons open the door to new opportuni9es and new business areas What is Big Data? Big Data gets its name because that s what it is data that
More informationDiscovering Computers Fundamentals, 2010 Edition. Living in a Digital World
Discovering Computers Fundamentals, 2010 Edition Living in a Digital World Objec&ves Overview Discuss the importance of project management, feasibility assessment, documenta8on, and data and informa8on
More informationHoneycomb Crea/ve Works is financed by the European Union s European Regional Development Fund through the INTERREG IVA Cross- border Programme
Honeycomb Crea/ve Works is financed by the European Union s European Regional Development Fund through the INTERREG IVA Cross- border Programme managed by the Special EU Programmes Body. Web Analy*cs In
More informationBIG DATA AND INVESTIGATIVE ANALYTICS
The New Fron+er BIG DATA AND INVESTIGATIVE ANALYTICS A Publication of Infobright Table of Contents Introduc+on 3 Chapter 1: What Is Inves+ga+ve Analy+cs?. 4 Chapter 2: Top Five Requirements for Inves+ga+ve
More informationTexas Digital Government Summit. Data Analysis Structured vs. Unstructured Data. Presented By: Dave Larson
Texas Digital Government Summit Data Analysis Structured vs. Unstructured Data Presented By: Dave Larson Speaker Bio Dave Larson Solu6ons Architect with Freeit Data Solu6ons In the IT industry for over
More informationSocial Media Analy.cs (SMA)
Social Media Analy.cs (SMA) Emanuele Della Valle DEIB - Politecnico di Milano emanuele.dellavalle@polimi.it hap://emanueledellavalle.org What's social media? haps://www.youtube.com/watch?v=sgniiud_oqg
More informationFounda'onal IT Governance A Founda'onal Framework for Governing Enterprise IT Adapted from the ISACA COBIT 5 Framework
Founda'onal IT Governance A Founda'onal Framework for Governing Enterprise IT Adapted from the ISACA COBIT 5 Framework Steven Hunt Enterprise IT Governance Strategist NASA Ames Research Center Michael
More informationExtrac'ng People s Hobby and Interest Informa'on from Social Media Content
Extrac'ng People s Hobby and Interest Informa'on from Social Media Content Thomas Forss, Shuhua Liu and Kaj- Mikael Björk Dept of Business Administra?on and Analy?cs Arcada University of Applied Sciences
More informationAn Introduc@on to Big Data, Apache Hadoop, and Cloudera
An Introduc@on to Big Data, Apache Hadoop, and Cloudera Ian Wrigley, Curriculum Manager, Cloudera 1 The Mo@va@on for Hadoop 2 Tradi@onal Large- Scale Computa@on Tradi*onally, computa*on has been processor-
More informationIntroduc)on to the IoT- A methodology
10/11/14 1 Introduc)on to the IoTA methodology Olivier SAVRY CEA LETI 10/11/14 2 IoTA Objec)ves Provide a reference model of architecture (ARM) based on Interoperability Scalability Security and Privacy
More informationData Management in the Cloud: Limitations and Opportunities. Annies Ductan
Data Management in the Cloud: Limitations and Opportunities Annies Ductan Discussion Outline: Introduc)on Overview Vision of Cloud Compu8ng Managing Data in The Cloud Cloud Characteris8cs Data Management
More informationMSc Data Science at the University of Sheffield. Started in September 2014
MSc Data Science at the University of Sheffield Started in September 2014 Gianluca Demar?ni Lecturer in Data Science at the Informa?on School since 2014 Ph.D. in Computer Science at U. Hannover, Germany
More informationUNIFIED, END- TO- END EDISCOVERY
ac.onable informa.on governance Partners Providing Excellence in: UNIFIED, END- TO- END EDISCOVERY 2011 IBM Corpora.on Meet the Presenters Amir Jaibaji Vice President, Product Management StoredIQ Kevin
More informationAn Open Dynamic Big Data Driven Applica3on System Toolkit
An Open Dynamic Big Data Driven Applica3on System Toolkit Craig C. Douglas University of Wyoming and KAUST This research is supported in part by the Na3onal Science Founda3on and King Abdullah University
More informationHow To Use A Webmail On A Pc Or Macodeo.Com
Big data workloads and real-world data sets Gang Lu Institute of Computing Technology, Chinese Academy of Sciences BigDataBench Tutorial MICRO 2014 Cambridge, UK INSTITUTE OF COMPUTING TECHNOLOGY 1 Five
More informationStream Deployments in the Real World: Enhance Opera?onal Intelligence Across Applica?on Delivery, IT Ops, Security, and More
Copyright 2015 Splunk Inc. Stream Deployments in the Real World: Enhance Opera?onal Intelligence Across Applica?on Delivery, IT Ops, Security, and More Stela Udovicic Sr. Product Marke?ng Manager Clayton
More informationThe importance of supply chain
Guidelines for Improving Supply Chain Management at Bulgarian Enterprises Chief Assist. Prof. Miroslava Rakovska, Ph.D. Department of Business Logistics, UNWE tеl. (02) 9435248, 0888704178 e-mail: mirar@unwe.acad.bg
More informationPu?ng B2B Research to the Legal Test
With the global leader in sampling and data services Pu?ng B2B Research to the Legal Test Ashlin Quirk, SSI General Counsel 2014 Survey Sampling Interna6onal 1 2014 Survey Sampling Interna6onal Se?ng the
More informationThe Library (Big) Data scien4st
The Library (Big) Data scien4st IFLA/ALA webinar: Big Data: new roles and opportuni4es for new librarians June 15 th 2016 IFLA Big Data Special Interest Group (SIG) Wouter Klapwijk, Stellenbosch University,
More informationNextGen Infrastructure for Big DATA Analytics.
NextGen Infrastructure for Big DATA Analytics. So What is Big Data? Data that exceeds the processing capacity of conven4onal database systems. The data is too big, moves too fast, or doesn t fit the structures
More informationWelcome! Accelera'ng Pa'ent- Centered Outcomes Research and Methodological Research. Andrea Heckert, PhD, MPH Program Officer, Science
Accelera'ng Pa'ent- Centered Outcomes Research and Methodological Research Emily Evans, PhD, MPH Program Officer, Science Andrea Heckert, PhD, MPH Program Officer, Science June 22, 2015 Welcome! Emily
More informationKeeping Pace with Big Data
- A Data Mining Perspec>ve Huan Liu, Tempe, AZ hep://www.public.asu.edu/~huanliu NSF Workshop on Big Data Analy6cs for Infrastructure and Building Resilience and Sustainability, Beijing, China Sept 19-20,
More informationHands On- Google Grants Google Adwords for Non- Pro5its
Hands On- Google Grants Google Adwords for Non- Pro5its Search Adver5sing Approach and Strategy Katherine Cleland ClelandMarke5ng 1 Why Google Adwords? Online Search has replaced Yellow Pages 80% of online
More informationBig Data /Data Science Data Intensive (Science) Technologies
Big Data /Data Science Data Intensive (Science) Technologies Adam Belloum Ins:tute of Informa:cs University of Amsterdam a.s.z.belloum@uva.nl High Performance compu:ng Curriculum, Jan 2015 hmp://www.hpc.uva.nl/
More informationHow To Understand The Big Data Paradigm
Big Data and Its Empiricist Founda4ons Teresa Scantamburlo The evolu4on of Data Science The mechaniza4on of induc4on The business of data The Big Data paradigm (data + computa4on) Cri4cal analysis Tenta4ve
More informationTRANSLATING TECHNOLOGY INTO BUSINESS. Let s make money from Big Data!
TRANSLATING TECHNOLOGY INTO BUSINESS Let s make money from Big Data! JUNE, 2014 About Transla.ng Technology into Business B Spot helps clients transform technology ideas into business concepts. As part
More informationThe DATA Difference Targe.ng for Stronger ROI!
The DATA Difference Targe.ng for Stronger ROI! Presented by: Dr. John Leininger Department of Graphic Communica
More informationPower to the People: Analy0cs for All
Arijit Sengupta CEO, BeyondCore, Inc. Power to the People: Analy0cs for All " Ten patents related to Advanced Analytics, Privacy/Security and BPaaS. " Previously worked at Oracle, Microsoft, Yankee Group
More information1 Actuate Corpora-on 2013. Big Data Business Analy/cs
1 Big Data Business Analy/cs Introducing BIRT Analy3cs Provides analysts and business users with advanced visual data discovery and predictive analytics to make better, more timely decisions in the age
More informationNetwork Maps for End Users: Collect, Analyze, Visualize and Communicate Network Insights with Zero Coding
Network Maps for End Users: Collect, Analyze, Visualize and Communicate Network Insights with Zero Coding A project from the Social Media Research Founda8on: h:p://www.smrfounda8on.org About Me Introduc8ons
More informationMigrating to Hosted Telephony. Your ultimate guide to migrating from on premise to hosted telephony. www.ucandc.com
Migrating to Hosted Telephony Your ultimate guide to migrating from on premise to hosted telephony Intro What is covered in this guide? A professional and reliable business telephone system is a central
More informationSynchronous and asynchronous video conferencing tools in an online-course:! Supporting a community of inquiry!
Synchronous and asynchronous video conferencing tools in an online-course:! Supporting a community of inquiry! David Wicks, Seattle Pacific University! Andrew Lumpe, Seattle Pacific University! Janiess
More informationFixed Scope Offering (FSO) for Oracle SRM
Fixed Scope Offering (FSO) for Oracle SRM Agenda iapps Introduc.on Execu.ve Summary Business Objec.ves Solu.on Proposal Scope - Business Process Scope Applica.on Implementa.on Methodology Time Frames Team,
More informationDNS Big Data Analy@cs
Klik om de s+jl te bewerken Klik om de models+jlen te bewerken! Tweede niveau! Derde niveau! Vierde niveau DNS Big Data Analy@cs Vijfde niveau DNS- OARC Fall 2015 Workshop October 4th 2015 Maarten Wullink,
More informationXML, Seman9c Web and Content Analy9cs
XML, Seman9c Web and Content Analy9cs XML Prague Pre- conference 2014 Felix Sasaki DFKI / W3C Fellow 1 What do you need to follow this session? Ideal: a computer with internet access, to be able to provide
More informationMission. To provide higher technological educa5on with quality, preparing. competent professionals, with sound founda5ons in science, technology
Mission To provide higher technological educa5on with quality, preparing competent professionals, with sound founda5ons in science, technology and innova5on, commi
More informationHunk & Elas=c MapReduce: Big Data Analy=cs on AWS
Copyright 2014 Splunk Inc. Hunk & Elas=c MapReduce: Big Data Analy=cs on AWS Dritan Bi=ncka BD Solu=ons Architecture Disclaimer During the course of this presenta=on, we may make forward looking statements
More informationUrban Big Data Centre
Urban Big Data Centre Piyushimita Thakuriah (Vonu) Director, UBDC Professor and Ch2M Chair of Transport UNIVERSITY OF GLASGOW November 12, 2015 July 10, 2015 UBDC Partners Funded by ESRC Big Data Network
More informationConfessions of a new (agile) software project manager. Laura Akerman
Confessions of a new (agile) software project manager Laura Akerman Agile at Emory Libraries: our own flavor Itera9ons are 2 weeks long. A minor release = a Milestone = 2-3 itera9ons (ideally). We use
More informationData Mining. Supervised Methods. Ciro Donalek donalek@astro.caltech.edu. Ay/Bi 199ab: Methods of Computa@onal Sciences hcp://esci101.blogspot.
Data Mining Supervised Methods Ciro Donalek donalek@astro.caltech.edu Supervised Methods Summary Ar@ficial Neural Networks Mul@layer Perceptron Support Vector Machines SoLwares Supervised Models: Supervised
More informationPhone Systems Buyer s Guide
Phone Systems Buyer s Guide Contents How Cri(cal is Communica(on to Your Business? 3 Fundamental Issues 4 Phone Systems Basic Features 6 Features for Users with Advanced Needs 10 Key Ques(ons for All Buyers
More informationNZ On Air Digital Strategy 2012-2015
NZ On Air Digital Strategy 2012-2015 Defining digital Digital has various meanings that originate from different sources. In its purest sense it is simply the dis9nc9on from analogue. Broadcast content
More informationSo#ware quality assurance - introduc4on. Dr Ana Magazinius
So#ware quality assurance - introduc4on Dr Ana Magazinius 1 What is quality? 2 What is a good quality car? 2 and 2 2 minutes 3 characteris4cs 3 What is quality? 4 What is quality? How good or bad something
More informationSurfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics
Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics Dr. Liangxiu Han Future Networks and Distributed Systems Group (FUNDS) School of Computing, Mathematics and Digital Technology,
More informationData Governance Framework: Bank of Canada
Data Governance Framework: Bank of Canada The views and opinions expressed herein are those of the author and do not necessarily reflect the official policy or posi8on of the Bank of Canada or any agency
More informationData archiving and reproducible research for ecology and evolu6on. March 23 rd 2010 Ian Dworkin
Data archiving and reproducible research for ecology and evolu6on. March 23 rd 2010 Ian Dworkin Outline of the ques6ons for the workshop 1. Why should I share my data? 2. When should I share my data? 3.
More informationelearning: present and future
elearning: present and future Defini2on E- learning can be defined as the use of computer and Internet technologies to deliver a broad array of solu2ons to enable learning and improve performance. (FAO)
More informationDTCC Data Quality Survey Industry Report
DTCC Data Quality Survey Industry Report November 2013 element 22 unlocking the power of your data Contents 1. Introduction 3 2. Approach and participants 4 3. Summary findings 5 4. Findings by topic 6
More informationHow to write an effec-ve DIGITAL MARKETING STRATEGY. Secrets from the professionals
How to write an effec-ve DIGITAL MARKETING STRATEGY Secrets from the professionals Wri-ng an effec-ve digital media strategy comes down to three things: content, connec-ons and consistency. When building
More informationMaking Sense of Big Data. Dr. Thomas E. Potok Computa2onal Data Analy2cs Group Leader Oak Ridge Na2onal Laboratory potokte@ornl.
Making Sense of Big Data Dr. Thomas E. Potok Computa2onal Data Analy2cs Group Leader Oak Ridge Na2onal Laboratory potokte@ornl.gov 865-574- 0834 ORNL s Big Data Legacy Science National Security Energy
More informationBPO. Accerela*ng Revenue Enhancements Through Sales Support Services
BPO Accerela*ng Revenue Enhancements Through Sales Support Services What is BPO? Business Process Outsorcing (BPO) is the process of outsourcing specific business func6ons to a third- party service provider
More informationCS 5150 So(ware Engineering System Architecture: Introduc<on
Cornell University Compu1ng and Informa1on Science CS 5150 So(ware Engineering System Architecture: Introduc
More informationB2B Offerings. Helping businesses op2mize. Infolob s amazing b2b offerings helps your company achieve maximum produc2vity
B2B Offerings Helping businesses op2mize Infolob s amazing b2b offerings helps your company achieve maximum produc2vity What is B2B? B2B is shorthand for the sales prac4ce called business- to- business
More informationPredictions for the Digital Workplace 2015
Predictions for the Digital Workplace 2015 Jim Lundy CEO and Lead Analyst David Mario Smith Research Director, Lead Analyst Speakers for Today Jim Lundy David Smith CEO, Lead Analyst Research Director,
More informationBig Data and Health Insurance Product Selec6on (and a few other applica6on) Jonathan Kolstad UC Berkeley and NBER
Big Data and Health Insurance Product Selec6on (and a few other applica6on) Jonathan Kolstad UC Berkeley and NBER Introduc6on Applica6ons of behavioral economics in health SeIng where behavioral assump6ons
More information.nl ENTRADA. CENTR-tech 33. November 2015 Marco Davids, SIDN Labs. Klik om de s+jl te bewerken
Klik om de s+jl te bewerken Klik om de models+jlen te bewerken Tweede niveau Derde niveau Vierde niveau.nl ENTRADA Vijfde niveau CENTR-tech 33 November 2015 Marco Davids, SIDN Labs Wie zijn wij? Mijlpalen
More informationPromo%ng Your OCS Business through Digital & Social Media. Presented by: John Healy
Promo%ng Your OCS Business through Digital & Social Media Presented by: John Healy Who Is John Healy? jhealy@healyco.com 25+ years in marke;ng, communica;ons & PR consul;ng Specialty: Helping clients balance
More informationThe Elusive U,lity Customer: How Big Data & Analy,cs Connects U,li,es & Their Customers
The Place Analy,cs Leaders Turn to for Answers Member.U(lityAnaly(cs.com The Elusive U,lity Customer: How Big & Analy,cs Connects U,li,es & Their Customers Mike Smith Vice President, U(lity Analy(cs Ins(tute
More informationECEC 22013. Accelera@ng Europe s Cloud Future. Chambre du Commerce September. October. 14. Mai 2013, Konzerthaus. 1st 2014
ECEC 22013 014 ECDC Partnerschaft in in der der Cloud Cloud Partnerschaft Accelera@ng Europe s Cloud Future Europe Congress & A ward Ceremony 14. Mai 2013, Konzerthaus Chambre du Commerce 14. Mai 2013,
More informationThe Data Reservoir. 10 th September 2014. Mandy Chessell FREng CEng FBCS Dis4nguished Engineer, Master Inventor Chief Architect, Informa4on Solu4ons
Mandy Chessell FREng CEng FBCS Dis4nguished Engineer, Master Inventor Chief Architect, Solu4ons The Reservoir 10 th September 2014 A growing demand Business Teams want Open access to more informa4on More
More informationOffensive & Defensive & Forensic Techniques for Determining Web User Iden<ty
Offensive & Defensive & Forensic Techniques for Determining Web User Iden
More informationWhat will I learn as an Computer Engineering student?
What will I learn as an Computer Engineering student? Department of Electrical and Computer Engineering Tu8s School of Engineering Trying to decide on a major? Most college course descrip?ons are full
More informationSBML SBGN SBML Just my 2 cents. Alice C. Villéger COMBINE 2010
SBML SBGN SBML Just my 2 cents Alice C. Villéger COMBINE 2010 Disclaimer Fuzzy talk work in progress last minute slides Someone else has been working on very similar stuff and should really have been talking
More informationTeaching Analy-cs, Big Data and Sustainability: An IS perspec-ve
Teaching Analy-cs, Big Data and Sustainability: An IS perspec-ve Raja Sooriamurthi / Randy Weinberg Informa(on Systems Program Carnegie Mellon University {raja,rweinberg}@cmu.edu Presenta-on Outline The
More informationBeyond Strategy: Building Your Mobile Capabili6es
Beyond Strategy: Building Your Mobile Capabili6es TASSCC Technology Educa6on Conference April 10, 2015 Presented by: Raj Polikepa6 Director of App Development Texas.gov Agenda ê Objec6ves of Mobile Strategy
More informationFrom Big Data to Value
From Big Data to Value The Power of Master Data Management 2.0 Sergio Juarez SVP Elemica EMEA & LATAM Reveal Oct 2014 Agenda Master Data Management Why Now? What To Do? How To Do It? What s Next? Today
More informationGovernance as Leadership: Reframing the Work of Nonprofit Boards
Governance as Leadership: Reframing the Work of Nonprofit Boards Tradi
More informationThis presenta,on covers the essen,al informa,on about IT services and facili,es which all new students will need to get started.
This presenta,on covers the essen,al informa,on about IT services and facili,es which all new students will need to get started. 1 Most of the informa,on is covered in more depth on the Informa,on Services
More informationSplunk for Data Science
Copyright 2014 Splunk Inc. Splunk for Data Science Tom LaGa=a Data Scien@st, Splunk Olivier de Garrigues Sr Prof Services Consultant, Splunk Disclaimer During the course of this presenta@on, we may make
More informationWe are pleased to offer the following program to Woodstock Area Educators:
DATE: Spring 2016 TO: RE: Woodstock Area Educators Upcoming Cohort Programs Presently, many teachers are enrolled in cohort graduate programs through partnerships between local regional offices of education,
More informationBig Data in medical image processing
Big Data in medical image processing Konstan3n Bychenkov, CEO Aligned Research Group LLC bychenkov@alignedresearch.com Big data in medicine Genomic Research Popula3on Health Images M- Health hips://cloud.google.com/genomics/v1beta2/reference/
More informationHP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica
HP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica So What s the market s definition of Big Data? Datasets whose volume, velocity, variety
More informationAdvanced Project Management Training Course
Advanced Project Management Training Course 1-34 Advanced Project Management Crea/ng the Scope Baseline 2-34 Crea/ng the Scope Baseline Module 1 Introduction Module 2 Creating the Project Charter Module
More informationSocial Media for Business - Primer. Becky Livingston President & CEO Penheel Marke:ng April 2014
Social Media for Business - Primer Becky Livingston President & CEO Penheel Marke:ng April 2014 Becky Livingston ü Speaker ü Author ü Educator ü Social Media & Digital Marke:ng Consultant ü 25 years marke:ng
More information2015-16 ITS Strategic Plan Enabling an Unbounded University
2015-16 ITS Strategic Plan Enabling an Unbounded University Update: July 31, 2015 IniAaAve: Agility Through Technology Vision Mission Enable Unbounded Learning Support student success through the innovaave
More informationThe system approach in human resources. Functional Analysis of the System for Human Resources Management. Introduction. Arcles
Functional Analysis of the System for Human Resources Management Assoc. Prof. Margarita Harizanova, Ph.D. Chief Assist. Prof. Nadya Mironova, Ph.D. Assist. Prof. Tatyana Shtetinska Summary: The arcle presents
More informationEffec%ve AX 2012 Upgrade Project Planning and Microso< Sure Step. Arbela Technologies
Effec%ve AX 2012 Upgrade Project Planning and Microso< Sure Step Arbela Technologies Why Upgrade? What to do? How to do it? Tools and templates Agenda Sure Step 2012 Ax2012 Upgrade specific steps Checklist
More informationKaseya Fundamentals Workshop DAY THREE. Developed by Kaseya University. Powered by IT Scholars
Kaseya Fundamentals Workshop DAY THREE Developed by Kaseya University Powered by IT Scholars Kaseya Version 6.5 Last updated March, 2014 Day Two Overview Day Two Lab Review Patch Management Configura;on
More informationMega Modeling for Scien/fic Big Data Processing
Mega Modeling for Scien/fic Big Data Processing Stefano Ceri, Emanuele Della Valle (Politecnico di Milano) Dino Pedreschi, Roberto Trasar/ (ISTI- CNR and University of Pisa) 1 The context 2 Scenario BIG
More informationHome Selling Marke/ng Proposal
Home Selling Marke/ng Proposal Presented by: Nate Harimoto & Shane Haas Aviara Real Estate 2555 Townsgate Road, Suite 200 Westlake Village, CA 91361 www.aviararealestate.net Nate Office & Fax: (805) 418-2675
More informationScalus Winter School Storage Systems
Scalus Winter School Storage Systems Flash Memory André Brinkmann Flash Memory Floa:ng gate of a flash cell is electrically isolated Applying high voltages between source and drain accelerates electrons
More informationThe Adop)on Pa-erns of Mobile Telephones by Micro and Small Enterprises in Ghana
The Adop)on Pa-erns of Mobile Telephones by Micro and Small Enterprises in Ghana Godfred Kwasi Frempong Science and Technology Policy Research Ins)tute gkfrempong@csir- stepri.org Introduc*on Development
More informationAn Integrated Approach to Manage IT Network Traffic - An Overview Click to edit Master /tle style
An Integrated Approach to Manage IT Network Traffic - An Overview Click to edit Master /tle style Agenda A quick look at ManageEngine Tradi/onal Traffic Analysis Techniques & Tools Changing face of Network
More informationRethink. Recruitment. McFrank & Williams Adver3sing Agency
Rethink. Recruitment. McFrank & Williams Adver3sing Agency Introduction 1 Recruitment Solutions & Tools with our proprietary technologies Be#er results require different methods McFrank & Williams Advertising
More informationPhysiotherapy & Occupational Therapy
SOS PRESENTS... Physiotherapy & Occupational Therapy Presented by: Avideh Khalili & Helia Ghazinejad Slides will be available at bethune.yorku.ca/ events Agenda Introduc7on Applica9on Process PT OT Opportuni9es
More informationScalus A)ribute Workshop. Paris, April 14th 15th
Scalus A)ribute Workshop Paris, April 14th 15th Content Mo=va=on, objec=ves, and constraints Scalus strategy Scenario and architectural views How the architecture works Mo=va=on for this MCITN Storage
More informationSan Jacinto College Banner & Enterprise Applica5on Review Task Force Report. November 01, 2011 FINAL
San Jacinto College Banner & Enterprise Applica5on Review Task Force Report November 01, 2011 FINAL 1 Content Review goal and approach 3 Barriers to effec5ve use of Banner: Consultant observa5ons 10 Consultant
More informationAdvanced Fraud Detection & Prevention Through Big Data
Advanced Fraud Detection & Prevention Through Big Data Mark Johnson Director, Engineered Systems, Oracle Public Sector 1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. The following
More informationExpanding Assessment of Analy3cal Skills among Biology Majors: From Introductory labs to Upper Division Elec3ves
Expanding Assessment of Analy3cal Skills among Biology Majors: From Introductory labs to Upper Division Elec3ves Presented by Kathleen McAuley PI: Serena Moseman- Val3erra, Ph.D. Department of Biological
More informationSuppor&ng a social media research environment by mining big textual data. Sophia Ananiadou Na-onal Centre for Text Mining www.nactem.ac.
Suppor&ng a social media research environment by mining big textual data Sophia Ananiadou Na-onal Centre for Text Mining www.nactem.ac.uk Mo-va-on Much social media data consists of unstructured, noisy
More informationThe model of SWOT-analysis is the most
Ten Mistakes at the Usage of the SWOT-Analysis in the Strategic Marketing Planning in the Healthcare Institutions Chief Assist. Prof. Alexander Valkov, Ph.D. Department of Public Administration and Regional
More information