Theo JD Bothma Department of Informa1on Science

Size: px
Start display at page:

Download "Theo JD Bothma Department of Informa1on Science theo.bothma@up.ac.za"

Transcription

1 Theo JD Bothma Department of Informa1on Science Reflec1ons on the role of corpora and big data in e- lexicography in rela1on to end user informa1on needs CILC th Interna1onal Conference on Corpus Linguis1cs Valladolid,, 5 7 March 2015

2 Overview Introduc)on Clarifica)on of terms Access to linked corpora Access to frequency pa9erns Technical issues Conclusion

3 Introduc1on Focus of presenta)on Func)on theory Filtering of data Data on demand Lexicotainment

4 Focus of presenta1on How corpora and big data can be used to supplement dic)onary data Specifically for end user informa)on needs On demand Why users would need such data Research that needs to be done To provide useful tools Focus on end user

5 Outside scope of presenta1on The use of corpora by lexicographers to create dic)onaries Predefined corpora or web as corpus Many papers at the conference The use of corpora and big data in digital humani)es research Lexicographic research The use of big data by commercial en))es

6 Func1on theory Communica)ve situa)ons where a need to solve a communica)on problem may occur Text recep)on Text produc)on Transla)on Cogni)ve situa)ons where a need for knowledge may occur Opera)ve situa)ons Interpre)ve situa)ons

7 Filtering of data...uncovering the needs users have in the last 20 percent of the look- ups, i.e. in one out of five consulta)ons...discover the needs that only show up in one out of a hundred or one out of a thousand consulta)ons (Tarp, 2009a)...ar)cles that are especially adapted individualiza)on of the lexical product, adap)ng to the concrete needs of a concrete user (Tarp, 2009b)

8 Data on demand One cannot develop separate dic)onaries for 1 in a 1000 queries Data on demand Filtering data through search and presenta)on op)ons More bu9on Link to internal data Link to external data, including corpora Link to lexicographic user support tools

9 Lexicotainment Commercial publishers develop many data on demand op)ons for online dic)onaries OWen interac)ve There must be a need for such func)ons Why develop them if not used? OWen / usually free Marke)ng Vast array of gadgets / tools / types of informa)on

10 Word of the day Blog Language )ps Quizzes Crosswords Trending Interna)onal Local Slide shows Dic1onary.com

11

12 Dic1onary.com local lookups

13 Merriam- Webster

14 Merriam- Webster (2)

15 Clarifica1on of terms / Examples Corpus Big data Structured / unstructured data Data analy)cs Text / data mining

16 Corpus No defini)on required Oxford English Corpus...over 2 billion words of real 21st century English. It is not only size that ma9ers, though: it is the size of the corpus coupled with the careful selec)on and development of its contents which means that it is a resource unlike any other in the world. (h9p:// the- oxford- english- corpus) UMBC WebBase Corpus over three billion words, 48GB (h9p://ebiquity.umbc.edu/resource/html/id/351) Library of Congress 10 TB, probably about 3 Petabytes (3,000 TB) if all mul)media is included (h9p://blogs.loc.gov/digitalpreserva)on/2012/03/how- many- libraries- of- congress- does- it- take/)

17 Google Books corpus The total collec)on contains more than 6% of all books ever published. (Lin, Y et al. 2012)

18 Big data Big data is data that exceeds the processing capacity of conven)onal database systems. The data is too big, moves too fast, or doesn t fit the strictures of your database architectures. To gain value from this data, you must choose an alterna)ve way to process it. (Edd Dumbill Making sense of big data. Big data 1(1). h9p://online.liebertpub.com/doi/pdf/ /big )

19 Big data (2) Big data is high- volume, - velocity and - variety informa)on assets that demand cost- effec)ve, innova)ve forms of informa)on processing for enhanced insight and decision making. (Beyer, MA & Laney, D "The Importance of 'Big Data': A Defini)on. Gartner....the real- )me and high frequency nature of the data is also key. For example, nowcas)ng is used extensively and adds considerable power to predic)on. Similarly the high frequency of data allows users to test theories in near real- )me and to a level never before possible. (The Challenges and awards of big data )

20 Examples The NSA very casually dropped a number: Every six hours, the agency collects as much data as is stored in the en)re Library of Congress. (h9p:// six- hours- nsa- gathers- much- data- stored- en)re- library- congress) ebay.com uses two data warehouses at 7.5 petabytes and 40PB as well as a 40PB Hadoop cluster for search, consumer recommenda)ons, and merchandising. (Tay, L Inside ebay s 90PB data warehouse. h9p:// ebay8217s- 90pb- data- warehouse.aspx)

21 Square Kilometre Array The SKA represents the ul)mate Big Data challenge (h9p:// DOME.pdf) The project is expected to deliver up to an exabyte a day of raw data, compressed to some 10 petabytes of data in images for storage. "This telescope will generate the same amount of data in a day as the en)re planet does in a year. We es)mate that there will be more data flowing inside the telescope network than the en)re internet in 2020." (h9p:// ska_telescope_generate_more_data_than_en)re_internet_2020/) The data collected by the SKA in a single day would take nearly two million years to playback on an ipod. (h9ps://

22 h9p://datacook.blogspot.com/

23 Structured / unstructured data Structured data Data that resides in a fixed field within a record or file. Unstructured data All those things that can't be so readily classified and fit into a neat box: photos and graphic images, videos, streaming instrument data, webpages, pdf files, PowerPoint presenta)ons, s, blog entries, wikis and word processing documents. Semi- structured data A cross between the two. Tags or other types of markers are used to iden)fy certain elements within the data, but the data doesn t have a rigid structure. (h9p://

24 Data analy1cs Predic)ve analy)cs, data mining, text mining, forecas)ng and data op)miza)on. (h9p:// Big data uses induc)ve sta)s)cs and concepts from nonlinear system iden)fica)on to infer laws (regressions, nonlinear rela)onships, and causal effects) from large sets of data with low informa)on density to reveal rela)onships, dependencies and perform predic)ons of outcomes and behaviors (h9p://en.wikipedia.org/wiki/big_data#science)

25 Text / data mining Data Mining is an analy)c process designed to explore data (usually large amounts of data - typically business or market related - also known as "big data") in search of consistent pa9erns and/or systema)c rela)onships between variables, and then to validate the findings by applying the detected pa9erns to new subsets of data. The ul)mate goal of data mining is predic)on - and predic)ve data mining is the most common type of data mining and one that has the most direct business applica)ons. (StatSoW, Inc. (2013). Electronic Sta)s)cs Textbook. Tulsa, OK: StatSoW. h9p://

26 Big data in e- lexicography Yes Volumes Database requirements Processing requirements Research Digital humani)es Word studies No Speed of change Nature of user requirements Ac)onable

27 Two examples Access to linked corpora Access to frequency pa9erns Data available on demand Examples from exis)ng dic)onaries / tools Characteris)cs Why end users would want to do this Problems

28

29

30

31

32 Frequency pa]erns Only one commercial applica)on discussed: Google Ngram viewer Mul)ple databases Some tagged for PoS and syntac)c dependencies 12 language universal part- of- speech tags and unlabeled head- modifier dependencies (Lin, Y et al Syntac)c Annota)ons for the Google Books Ngram Corpus. In Proceedings of the 50th Annual Mee7ng of the Associa7on for Computa7onal Linguis7cs, pp , Jeju, Republic of Korea, 8-14 July Associa)on for Computa)onal Linguis)cs)

33 Racialism / Racism OED: Racialism = racism n. An earlier term than racism n., but now largely superseded by it Examples form Racism Examples from Not dis)nguished from racialism Approximately similar entries for racialist / racist

34 racialism / racism

35 Walkman / Ipod

36

37 catch up with X / catch X up On demand

38 Why end users would want to do this Cogni)ve / lexicotainment Simply interested to learn more about word usage and word history See word in context over )me Understand word usage be9er Text produc)on Decide between alterna)ves Situate word in historical context when wri)ng a text

39 Problems No genre- specific search op)ons General trade, fic)on, academic, newspaper, etc. No usage- tagged search op)on Formal, colloquial and slang, regional, etc. Direct speech Date of wri)ng not dis)nguished from date of context

40 Corpora and frequency tables Posi)ve Examples of actual usage Genre- specific dis)nc)ons DWDS Limited availability for Google Ngram viewer Fic)on / non- fic)on Bri)sh / American Drill- down to context Nega)ve Time dis)nc)ons Limited drill- down op)ons Limited granularity

41 Racialism / Racism OED no help both occur Example from fic)on: Text wri9en in 2012, set in USA in 1941 (vol. 2 of trilogy) Racialist counted as occurrence in 2012 Racialist used in direct speech, racism in narra)ve Text wri9en in 2014, set in USA in 1961 (vol. 3) Uses racism exclusively (direct speech and narra)ve) Current UK TV programme racialist Does corpus reflect usage in the 1940s / 1960s / 2000s?

42 Walkman / ipod Evident rise of ipod vs Walkman Does the higher use of Walkman even in 2008 reflect actual technology prolifera)on? To what extent do books (vs other media) reflect actual usage?

43 Conjugated / Inflected forms Excellent feature Aggregated usage? PoS tagging Future tagging? Syntac)c (cf. Google Books) Seman)c Etc.

44 Catch X up / Catch up with X Mixed bag un)l 1930s ThereaWer clear preference for catch up with X Current Bri)sh colloquial catch you up? Is there a dis)nc)on between formal and colloquial? Would a different corpus reveal a different pa9ern? Is there a difference between the two items in direct speech compared to narra)ve?

45 Technical issues Corpus and data set selec)on Corpus clean- up Corpus markup Search func)ons and filtering Data presenta)on and usability issues

46 Corpus and data set selec1on Decide on intended use Lexicographer, researcher, professional, end user Decide on characteris)cs Contemporary / diachronic Genre- specific / general Formal / Informal (e.g. social media) Size Selec)ve or as large as possible Hardware / sowware Copyright

47 Corpus clean- up Digi)sa)on Image clean- up OCR Quality control of both images and OCR Digi)zed materials Remove noise HTML Long- term preserva)on and cura)on of originals

48 Corpus markup Linguis)c processing Tokeniza)on PoS tagging Lemma)za)on Addi)onal gramma)cal markup Metadata markup Standard Bibliographic data Gramma)cal tagging Addi)onal markup Genre Date of seyng Direct speech vs narra)ve

49 Filtering and search func1ons Selec)on of corpus or mul)ple corpora Based on markup Metadata Gramma)cal Fine grained Subset of corpora Date Genre Etc. Complex combina)on of criteria

50 Data presenta1on and usability issues Incorporated into dic)onary interface How Display only required data Drill- down to actual texts on demand From corpus examples From frequency tables User studies

51 Examples Show all examples of: catch PRON up and catch up with PRON In a newspaper corpus Da)ng between 1940 and 1950 Show all examples of: racialist and racist Bri)sh fic)on compared to Bri)sh newspapers Between 1940 and 1960 Dis)nguished between direct speech and narra)ve

52 Conclusion Many addi)onal tools possible Mul)discplinary research Impact on markup Database technologies Hardware requirements User and usability studies Develop prototypes Innova)on Think cri)cally about the way forward

53 Lexicographers can considerably enhance the user experience by making non- tradi)onal data available to their end users through exploi)ng the technologies and data accessible through corpora, big data sets and the internet

54 Thank you! Ques1ons / comments? theo.bothma@up.ac.za

Opportuni)es and Challenges of Textual Big Data for the Humani)es

Opportuni)es and Challenges of Textual Big Data for the Humani)es Opportuni)es and Challenges of Textual Big Data for the Humani)es Dr. Adam Wyner, Department of Compu)ng Prof. Barbara Fennell, Department of Linguis)cs THiNK Network Knowledge Exchange in the Humani)es

More information

Data Warehousing. Yeow Wei Choong Anne Laurent

Data Warehousing. Yeow Wei Choong Anne Laurent Data Warehousing Yeow Wei Choong Anne Laurent Databases Databases are developed on the IDEA that DATA is one of the cri>cal materials of the Informa>on Age Informa>on, which is created by data, becomes

More information

Project Management Introduc1on

Project Management Introduc1on Project Management Introduc1on Session 1 Part I Introduc1on By Amal Le Collen, PMP Dr. Lauren1u Neamtu, PMP Session outline 1. PART I: Introduc1on 1. The Purpose of the PMBOK Guide 2. What is a project?

More information

High Performance Compu2ng and Big Data. High Performance compu2ng Curriculum UvA- SARA h>p://www.hpc.uva.nl/

High Performance Compu2ng and Big Data. High Performance compu2ng Curriculum UvA- SARA h>p://www.hpc.uva.nl/ High Performance Compu2ng and Big Data High Performance compu2ng Curriculum UvA- SARA h>p://www.hpc.uva.nl/ Big data was big news in 2012 and probably in 2013 too. The Harvard Business Review talks about

More information

Ins+tuto Superior Técnico Technical University of Lisbon. Big Data. Bruno Lopes Catarina Moreira João Pinho

Ins+tuto Superior Técnico Technical University of Lisbon. Big Data. Bruno Lopes Catarina Moreira João Pinho Ins+tuto Superior Técnico Technical University of Lisbon Big Data Bruno Lopes Catarina Moreira João Pinho Mo#va#on 2 220 PetaBytes Of data that people create every day! 2 Mo#va#on 90 % of Data UNSTRUCTURED

More information

Using Social Media to Drive Recommender Systems for Mobile Apps. - GRP Presenta=on - Jovian Lin (A0026542M)

Using Social Media to Drive Recommender Systems for Mobile Apps. - GRP Presenta=on - Jovian Lin (A0026542M) Using Social Media to Drive Recommender Systems for Mobile Apps - GRP Presenta=on - Jovian Lin (A0026542M) Structure of Presenta=on Introduc=on Why Recommender Systems (RS)? Problems in Recommending Our

More information

How To Use Splunk For Android (Windows) With A Mobile App On A Microsoft Tablet (Windows 8) For Free (Windows 7) For A Limited Time (Windows 10) For $99.99) For Two Years (Windows 9

How To Use Splunk For Android (Windows) With A Mobile App On A Microsoft Tablet (Windows 8) For Free (Windows 7) For A Limited Time (Windows 10) For $99.99) For Two Years (Windows 9 Copyright 2014 Splunk Inc. Splunk for Mobile Intelligence Bill Emme< Director, Solu?ons Marke?ng Panos Papadopoulos Director, Product Management Disclaimer During the course of this presenta?on, we may

More information

Application of Supply Chain Concepts to the Analysis Process

Application of Supply Chain Concepts to the Analysis Process Application of Supply Chain Concepts to the Analysis Process Rob Handfield, PhD Bank of America University Distinguished Professor of Supply Chain Management Executive Director, Supply Chain Resource Cooperative

More information

Big Data. The Big Picture. Our flexible and efficient Big Data solu9ons open the door to new opportuni9es and new business areas

Big Data. The Big Picture. Our flexible and efficient Big Data solu9ons open the door to new opportuni9es and new business areas Big Data The Big Picture Our flexible and efficient Big Data solu9ons open the door to new opportuni9es and new business areas What is Big Data? Big Data gets its name because that s what it is data that

More information

Discovering Computers Fundamentals, 2010 Edition. Living in a Digital World

Discovering Computers Fundamentals, 2010 Edition. Living in a Digital World Discovering Computers Fundamentals, 2010 Edition Living in a Digital World Objec&ves Overview Discuss the importance of project management, feasibility assessment, documenta8on, and data and informa8on

More information

Honeycomb Crea/ve Works is financed by the European Union s European Regional Development Fund through the INTERREG IVA Cross- border Programme

Honeycomb Crea/ve Works is financed by the European Union s European Regional Development Fund through the INTERREG IVA Cross- border Programme Honeycomb Crea/ve Works is financed by the European Union s European Regional Development Fund through the INTERREG IVA Cross- border Programme managed by the Special EU Programmes Body. Web Analy*cs In

More information

BIG DATA AND INVESTIGATIVE ANALYTICS

BIG DATA AND INVESTIGATIVE ANALYTICS The New Fron+er BIG DATA AND INVESTIGATIVE ANALYTICS A Publication of Infobright Table of Contents Introduc+on 3 Chapter 1: What Is Inves+ga+ve Analy+cs?. 4 Chapter 2: Top Five Requirements for Inves+ga+ve

More information

Texas Digital Government Summit. Data Analysis Structured vs. Unstructured Data. Presented By: Dave Larson

Texas Digital Government Summit. Data Analysis Structured vs. Unstructured Data. Presented By: Dave Larson Texas Digital Government Summit Data Analysis Structured vs. Unstructured Data Presented By: Dave Larson Speaker Bio Dave Larson Solu6ons Architect with Freeit Data Solu6ons In the IT industry for over

More information

Social Media Analy.cs (SMA)

Social Media Analy.cs (SMA) Social Media Analy.cs (SMA) Emanuele Della Valle DEIB - Politecnico di Milano emanuele.dellavalle@polimi.it hap://emanueledellavalle.org What's social media? haps://www.youtube.com/watch?v=sgniiud_oqg

More information

Founda'onal IT Governance A Founda'onal Framework for Governing Enterprise IT Adapted from the ISACA COBIT 5 Framework

Founda'onal IT Governance A Founda'onal Framework for Governing Enterprise IT Adapted from the ISACA COBIT 5 Framework Founda'onal IT Governance A Founda'onal Framework for Governing Enterprise IT Adapted from the ISACA COBIT 5 Framework Steven Hunt Enterprise IT Governance Strategist NASA Ames Research Center Michael

More information

Extrac'ng People s Hobby and Interest Informa'on from Social Media Content

Extrac'ng People s Hobby and Interest Informa'on from Social Media Content Extrac'ng People s Hobby and Interest Informa'on from Social Media Content Thomas Forss, Shuhua Liu and Kaj- Mikael Björk Dept of Business Administra?on and Analy?cs Arcada University of Applied Sciences

More information

An Introduc@on to Big Data, Apache Hadoop, and Cloudera

An Introduc@on to Big Data, Apache Hadoop, and Cloudera An Introduc@on to Big Data, Apache Hadoop, and Cloudera Ian Wrigley, Curriculum Manager, Cloudera 1 The Mo@va@on for Hadoop 2 Tradi@onal Large- Scale Computa@on Tradi*onally, computa*on has been processor-

More information

Introduc)on to the IoT- A methodology

Introduc)on to the IoT- A methodology 10/11/14 1 Introduc)on to the IoTA methodology Olivier SAVRY CEA LETI 10/11/14 2 IoTA Objec)ves Provide a reference model of architecture (ARM) based on Interoperability Scalability Security and Privacy

More information

Data Management in the Cloud: Limitations and Opportunities. Annies Ductan

Data Management in the Cloud: Limitations and Opportunities. Annies Ductan Data Management in the Cloud: Limitations and Opportunities Annies Ductan Discussion Outline: Introduc)on Overview Vision of Cloud Compu8ng Managing Data in The Cloud Cloud Characteris8cs Data Management

More information

MSc Data Science at the University of Sheffield. Started in September 2014

MSc Data Science at the University of Sheffield. Started in September 2014 MSc Data Science at the University of Sheffield Started in September 2014 Gianluca Demar?ni Lecturer in Data Science at the Informa?on School since 2014 Ph.D. in Computer Science at U. Hannover, Germany

More information

UNIFIED, END- TO- END EDISCOVERY

UNIFIED, END- TO- END EDISCOVERY ac.onable informa.on governance Partners Providing Excellence in: UNIFIED, END- TO- END EDISCOVERY 2011 IBM Corpora.on Meet the Presenters Amir Jaibaji Vice President, Product Management StoredIQ Kevin

More information

An Open Dynamic Big Data Driven Applica3on System Toolkit

An Open Dynamic Big Data Driven Applica3on System Toolkit An Open Dynamic Big Data Driven Applica3on System Toolkit Craig C. Douglas University of Wyoming and KAUST This research is supported in part by the Na3onal Science Founda3on and King Abdullah University

More information

How To Use A Webmail On A Pc Or Macodeo.Com

How To Use A Webmail On A Pc Or Macodeo.Com Big data workloads and real-world data sets Gang Lu Institute of Computing Technology, Chinese Academy of Sciences BigDataBench Tutorial MICRO 2014 Cambridge, UK INSTITUTE OF COMPUTING TECHNOLOGY 1 Five

More information

Stream Deployments in the Real World: Enhance Opera?onal Intelligence Across Applica?on Delivery, IT Ops, Security, and More

Stream Deployments in the Real World: Enhance Opera?onal Intelligence Across Applica?on Delivery, IT Ops, Security, and More Copyright 2015 Splunk Inc. Stream Deployments in the Real World: Enhance Opera?onal Intelligence Across Applica?on Delivery, IT Ops, Security, and More Stela Udovicic Sr. Product Marke?ng Manager Clayton

More information

The importance of supply chain

The importance of supply chain Guidelines for Improving Supply Chain Management at Bulgarian Enterprises Chief Assist. Prof. Miroslava Rakovska, Ph.D. Department of Business Logistics, UNWE tеl. (02) 9435248, 0888704178 e-mail: mirar@unwe.acad.bg

More information

Pu?ng B2B Research to the Legal Test

Pu?ng B2B Research to the Legal Test With the global leader in sampling and data services Pu?ng B2B Research to the Legal Test Ashlin Quirk, SSI General Counsel 2014 Survey Sampling Interna6onal 1 2014 Survey Sampling Interna6onal Se?ng the

More information

The Library (Big) Data scien4st

The Library (Big) Data scien4st The Library (Big) Data scien4st IFLA/ALA webinar: Big Data: new roles and opportuni4es for new librarians June 15 th 2016 IFLA Big Data Special Interest Group (SIG) Wouter Klapwijk, Stellenbosch University,

More information

NextGen Infrastructure for Big DATA Analytics.

NextGen Infrastructure for Big DATA Analytics. NextGen Infrastructure for Big DATA Analytics. So What is Big Data? Data that exceeds the processing capacity of conven4onal database systems. The data is too big, moves too fast, or doesn t fit the structures

More information

Welcome! Accelera'ng Pa'ent- Centered Outcomes Research and Methodological Research. Andrea Heckert, PhD, MPH Program Officer, Science

Welcome! Accelera'ng Pa'ent- Centered Outcomes Research and Methodological Research. Andrea Heckert, PhD, MPH Program Officer, Science Accelera'ng Pa'ent- Centered Outcomes Research and Methodological Research Emily Evans, PhD, MPH Program Officer, Science Andrea Heckert, PhD, MPH Program Officer, Science June 22, 2015 Welcome! Emily

More information

Keeping Pace with Big Data

Keeping Pace with Big Data - A Data Mining Perspec>ve Huan Liu, Tempe, AZ hep://www.public.asu.edu/~huanliu NSF Workshop on Big Data Analy6cs for Infrastructure and Building Resilience and Sustainability, Beijing, China Sept 19-20,

More information

Hands On- Google Grants Google Adwords for Non- Pro5its

Hands On- Google Grants Google Adwords for Non- Pro5its Hands On- Google Grants Google Adwords for Non- Pro5its Search Adver5sing Approach and Strategy Katherine Cleland ClelandMarke5ng 1 Why Google Adwords? Online Search has replaced Yellow Pages 80% of online

More information

Big Data /Data Science Data Intensive (Science) Technologies

Big Data /Data Science Data Intensive (Science) Technologies Big Data /Data Science Data Intensive (Science) Technologies Adam Belloum Ins:tute of Informa:cs University of Amsterdam a.s.z.belloum@uva.nl High Performance compu:ng Curriculum, Jan 2015 hmp://www.hpc.uva.nl/

More information

How To Understand The Big Data Paradigm

How To Understand The Big Data Paradigm Big Data and Its Empiricist Founda4ons Teresa Scantamburlo The evolu4on of Data Science The mechaniza4on of induc4on The business of data The Big Data paradigm (data + computa4on) Cri4cal analysis Tenta4ve

More information

TRANSLATING TECHNOLOGY INTO BUSINESS. Let s make money from Big Data!

TRANSLATING TECHNOLOGY INTO BUSINESS. Let s make money from Big Data! TRANSLATING TECHNOLOGY INTO BUSINESS Let s make money from Big Data! JUNE, 2014 About Transla.ng Technology into Business B Spot helps clients transform technology ideas into business concepts. As part

More information

The DATA Difference Targe.ng for Stronger ROI!

The DATA Difference Targe.ng for Stronger ROI! The DATA Difference Targe.ng for Stronger ROI! Presented by: Dr. John Leininger Department of Graphic Communica

More information

Power to the People: Analy0cs for All

Power to the People: Analy0cs for All Arijit Sengupta CEO, BeyondCore, Inc. Power to the People: Analy0cs for All " Ten patents related to Advanced Analytics, Privacy/Security and BPaaS. " Previously worked at Oracle, Microsoft, Yankee Group

More information

1 Actuate Corpora-on 2013. Big Data Business Analy/cs

1 Actuate Corpora-on 2013. Big Data Business Analy/cs 1 Big Data Business Analy/cs Introducing BIRT Analy3cs Provides analysts and business users with advanced visual data discovery and predictive analytics to make better, more timely decisions in the age

More information

Network Maps for End Users: Collect, Analyze, Visualize and Communicate Network Insights with Zero Coding

Network Maps for End Users: Collect, Analyze, Visualize and Communicate Network Insights with Zero Coding Network Maps for End Users: Collect, Analyze, Visualize and Communicate Network Insights with Zero Coding A project from the Social Media Research Founda8on: h:p://www.smrfounda8on.org About Me Introduc8ons

More information

Migrating to Hosted Telephony. Your ultimate guide to migrating from on premise to hosted telephony. www.ucandc.com

Migrating to Hosted Telephony. Your ultimate guide to migrating from on premise to hosted telephony. www.ucandc.com Migrating to Hosted Telephony Your ultimate guide to migrating from on premise to hosted telephony Intro What is covered in this guide? A professional and reliable business telephone system is a central

More information

Synchronous and asynchronous video conferencing tools in an online-course:! Supporting a community of inquiry!

Synchronous and asynchronous video conferencing tools in an online-course:! Supporting a community of inquiry! Synchronous and asynchronous video conferencing tools in an online-course:! Supporting a community of inquiry! David Wicks, Seattle Pacific University! Andrew Lumpe, Seattle Pacific University! Janiess

More information

Fixed Scope Offering (FSO) for Oracle SRM

Fixed Scope Offering (FSO) for Oracle SRM Fixed Scope Offering (FSO) for Oracle SRM Agenda iapps Introduc.on Execu.ve Summary Business Objec.ves Solu.on Proposal Scope - Business Process Scope Applica.on Implementa.on Methodology Time Frames Team,

More information

DNS Big Data Analy@cs

DNS Big Data Analy@cs Klik om de s+jl te bewerken Klik om de models+jlen te bewerken! Tweede niveau! Derde niveau! Vierde niveau DNS Big Data Analy@cs Vijfde niveau DNS- OARC Fall 2015 Workshop October 4th 2015 Maarten Wullink,

More information

XML, Seman9c Web and Content Analy9cs

XML, Seman9c Web and Content Analy9cs XML, Seman9c Web and Content Analy9cs XML Prague Pre- conference 2014 Felix Sasaki DFKI / W3C Fellow 1 What do you need to follow this session? Ideal: a computer with internet access, to be able to provide

More information

Mission. To provide higher technological educa5on with quality, preparing. competent professionals, with sound founda5ons in science, technology

Mission. To provide higher technological educa5on with quality, preparing. competent professionals, with sound founda5ons in science, technology Mission To provide higher technological educa5on with quality, preparing competent professionals, with sound founda5ons in science, technology and innova5on, commi

More information

Hunk & Elas=c MapReduce: Big Data Analy=cs on AWS

Hunk & Elas=c MapReduce: Big Data Analy=cs on AWS Copyright 2014 Splunk Inc. Hunk & Elas=c MapReduce: Big Data Analy=cs on AWS Dritan Bi=ncka BD Solu=ons Architecture Disclaimer During the course of this presenta=on, we may make forward looking statements

More information

Urban Big Data Centre

Urban Big Data Centre Urban Big Data Centre Piyushimita Thakuriah (Vonu) Director, UBDC Professor and Ch2M Chair of Transport UNIVERSITY OF GLASGOW November 12, 2015 July 10, 2015 UBDC Partners Funded by ESRC Big Data Network

More information

Confessions of a new (agile) software project manager. Laura Akerman

Confessions of a new (agile) software project manager. Laura Akerman Confessions of a new (agile) software project manager Laura Akerman Agile at Emory Libraries: our own flavor Itera9ons are 2 weeks long. A minor release = a Milestone = 2-3 itera9ons (ideally). We use

More information

Data Mining. Supervised Methods. Ciro Donalek donalek@astro.caltech.edu. Ay/Bi 199ab: Methods of Computa@onal Sciences hcp://esci101.blogspot.

Data Mining. Supervised Methods. Ciro Donalek donalek@astro.caltech.edu. Ay/Bi 199ab: Methods of Computa@onal Sciences hcp://esci101.blogspot. Data Mining Supervised Methods Ciro Donalek donalek@astro.caltech.edu Supervised Methods Summary Ar@ficial Neural Networks Mul@layer Perceptron Support Vector Machines SoLwares Supervised Models: Supervised

More information

Phone Systems Buyer s Guide

Phone Systems Buyer s Guide Phone Systems Buyer s Guide Contents How Cri(cal is Communica(on to Your Business? 3 Fundamental Issues 4 Phone Systems Basic Features 6 Features for Users with Advanced Needs 10 Key Ques(ons for All Buyers

More information

NZ On Air Digital Strategy 2012-2015

NZ On Air Digital Strategy 2012-2015 NZ On Air Digital Strategy 2012-2015 Defining digital Digital has various meanings that originate from different sources. In its purest sense it is simply the dis9nc9on from analogue. Broadcast content

More information

So#ware quality assurance - introduc4on. Dr Ana Magazinius

So#ware quality assurance - introduc4on. Dr Ana Magazinius So#ware quality assurance - introduc4on Dr Ana Magazinius 1 What is quality? 2 What is a good quality car? 2 and 2 2 minutes 3 characteris4cs 3 What is quality? 4 What is quality? How good or bad something

More information

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics Dr. Liangxiu Han Future Networks and Distributed Systems Group (FUNDS) School of Computing, Mathematics and Digital Technology,

More information

Data Governance Framework: Bank of Canada

Data Governance Framework: Bank of Canada Data Governance Framework: Bank of Canada The views and opinions expressed herein are those of the author and do not necessarily reflect the official policy or posi8on of the Bank of Canada or any agency

More information

Data archiving and reproducible research for ecology and evolu6on. March 23 rd 2010 Ian Dworkin

Data archiving and reproducible research for ecology and evolu6on. March 23 rd 2010 Ian Dworkin Data archiving and reproducible research for ecology and evolu6on. March 23 rd 2010 Ian Dworkin Outline of the ques6ons for the workshop 1. Why should I share my data? 2. When should I share my data? 3.

More information

elearning: present and future

elearning: present and future elearning: present and future Defini2on E- learning can be defined as the use of computer and Internet technologies to deliver a broad array of solu2ons to enable learning and improve performance. (FAO)

More information

DTCC Data Quality Survey Industry Report

DTCC Data Quality Survey Industry Report DTCC Data Quality Survey Industry Report November 2013 element 22 unlocking the power of your data Contents 1. Introduction 3 2. Approach and participants 4 3. Summary findings 5 4. Findings by topic 6

More information

How to write an effec-ve DIGITAL MARKETING STRATEGY. Secrets from the professionals

How to write an effec-ve DIGITAL MARKETING STRATEGY. Secrets from the professionals How to write an effec-ve DIGITAL MARKETING STRATEGY Secrets from the professionals Wri-ng an effec-ve digital media strategy comes down to three things: content, connec-ons and consistency. When building

More information

Making Sense of Big Data. Dr. Thomas E. Potok Computa2onal Data Analy2cs Group Leader Oak Ridge Na2onal Laboratory potokte@ornl.

Making Sense of Big Data. Dr. Thomas E. Potok Computa2onal Data Analy2cs Group Leader Oak Ridge Na2onal Laboratory potokte@ornl. Making Sense of Big Data Dr. Thomas E. Potok Computa2onal Data Analy2cs Group Leader Oak Ridge Na2onal Laboratory potokte@ornl.gov 865-574- 0834 ORNL s Big Data Legacy Science National Security Energy

More information

BPO. Accerela*ng Revenue Enhancements Through Sales Support Services

BPO. Accerela*ng Revenue Enhancements Through Sales Support Services BPO Accerela*ng Revenue Enhancements Through Sales Support Services What is BPO? Business Process Outsorcing (BPO) is the process of outsourcing specific business func6ons to a third- party service provider

More information

CS 5150 So(ware Engineering System Architecture: Introduc<on

CS 5150 So(ware Engineering System Architecture: Introduc<on Cornell University Compu1ng and Informa1on Science CS 5150 So(ware Engineering System Architecture: Introduc

More information

B2B Offerings. Helping businesses op2mize. Infolob s amazing b2b offerings helps your company achieve maximum produc2vity

B2B Offerings. Helping businesses op2mize. Infolob s amazing b2b offerings helps your company achieve maximum produc2vity B2B Offerings Helping businesses op2mize Infolob s amazing b2b offerings helps your company achieve maximum produc2vity What is B2B? B2B is shorthand for the sales prac4ce called business- to- business

More information

Predictions for the Digital Workplace 2015

Predictions for the Digital Workplace 2015 Predictions for the Digital Workplace 2015 Jim Lundy CEO and Lead Analyst David Mario Smith Research Director, Lead Analyst Speakers for Today Jim Lundy David Smith CEO, Lead Analyst Research Director,

More information

Big Data and Health Insurance Product Selec6on (and a few other applica6on) Jonathan Kolstad UC Berkeley and NBER

Big Data and Health Insurance Product Selec6on (and a few other applica6on) Jonathan Kolstad UC Berkeley and NBER Big Data and Health Insurance Product Selec6on (and a few other applica6on) Jonathan Kolstad UC Berkeley and NBER Introduc6on Applica6ons of behavioral economics in health SeIng where behavioral assump6ons

More information

.nl ENTRADA. CENTR-tech 33. November 2015 Marco Davids, SIDN Labs. Klik om de s+jl te bewerken

.nl ENTRADA. CENTR-tech 33. November 2015 Marco Davids, SIDN Labs. Klik om de s+jl te bewerken Klik om de s+jl te bewerken Klik om de models+jlen te bewerken Tweede niveau Derde niveau Vierde niveau.nl ENTRADA Vijfde niveau CENTR-tech 33 November 2015 Marco Davids, SIDN Labs Wie zijn wij? Mijlpalen

More information

Promo%ng Your OCS Business through Digital & Social Media. Presented by: John Healy

Promo%ng Your OCS Business through Digital & Social Media. Presented by: John Healy Promo%ng Your OCS Business through Digital & Social Media Presented by: John Healy Who Is John Healy? jhealy@healyco.com 25+ years in marke;ng, communica;ons & PR consul;ng Specialty: Helping clients balance

More information

The Elusive U,lity Customer: How Big Data & Analy,cs Connects U,li,es & Their Customers

The Elusive U,lity Customer: How Big Data & Analy,cs Connects U,li,es & Their Customers The Place Analy,cs Leaders Turn to for Answers Member.U(lityAnaly(cs.com The Elusive U,lity Customer: How Big & Analy,cs Connects U,li,es & Their Customers Mike Smith Vice President, U(lity Analy(cs Ins(tute

More information

ECEC 22013. Accelera@ng Europe s Cloud Future. Chambre du Commerce September. October. 14. Mai 2013, Konzerthaus. 1st 2014

ECEC 22013. Accelera@ng Europe s Cloud Future. Chambre du Commerce September. October. 14. Mai 2013, Konzerthaus. 1st 2014 ECEC 22013 014 ECDC Partnerschaft in in der der Cloud Cloud Partnerschaft Accelera@ng Europe s Cloud Future Europe Congress & A ward Ceremony 14. Mai 2013, Konzerthaus Chambre du Commerce 14. Mai 2013,

More information

The Data Reservoir. 10 th September 2014. Mandy Chessell FREng CEng FBCS Dis4nguished Engineer, Master Inventor Chief Architect, Informa4on Solu4ons

The Data Reservoir. 10 th September 2014. Mandy Chessell FREng CEng FBCS Dis4nguished Engineer, Master Inventor Chief Architect, Informa4on Solu4ons Mandy Chessell FREng CEng FBCS Dis4nguished Engineer, Master Inventor Chief Architect, Solu4ons The Reservoir 10 th September 2014 A growing demand Business Teams want Open access to more informa4on More

More information

Offensive & Defensive & Forensic Techniques for Determining Web User Iden<ty

Offensive & Defensive & Forensic Techniques for Determining Web User Iden<ty Offensive & Defensive & Forensic Techniques for Determining Web User Iden

More information

What will I learn as an Computer Engineering student?

What will I learn as an Computer Engineering student? What will I learn as an Computer Engineering student? Department of Electrical and Computer Engineering Tu8s School of Engineering Trying to decide on a major? Most college course descrip?ons are full

More information

SBML SBGN SBML Just my 2 cents. Alice C. Villéger COMBINE 2010

SBML SBGN SBML Just my 2 cents. Alice C. Villéger COMBINE 2010 SBML SBGN SBML Just my 2 cents Alice C. Villéger COMBINE 2010 Disclaimer Fuzzy talk work in progress last minute slides Someone else has been working on very similar stuff and should really have been talking

More information

Teaching Analy-cs, Big Data and Sustainability: An IS perspec-ve

Teaching Analy-cs, Big Data and Sustainability: An IS perspec-ve Teaching Analy-cs, Big Data and Sustainability: An IS perspec-ve Raja Sooriamurthi / Randy Weinberg Informa(on Systems Program Carnegie Mellon University {raja,rweinberg}@cmu.edu Presenta-on Outline The

More information

Beyond Strategy: Building Your Mobile Capabili6es

Beyond Strategy: Building Your Mobile Capabili6es Beyond Strategy: Building Your Mobile Capabili6es TASSCC Technology Educa6on Conference April 10, 2015 Presented by: Raj Polikepa6 Director of App Development Texas.gov Agenda ê Objec6ves of Mobile Strategy

More information

From Big Data to Value

From Big Data to Value From Big Data to Value The Power of Master Data Management 2.0 Sergio Juarez SVP Elemica EMEA & LATAM Reveal Oct 2014 Agenda Master Data Management Why Now? What To Do? How To Do It? What s Next? Today

More information

Governance as Leadership: Reframing the Work of Nonprofit Boards

Governance as Leadership: Reframing the Work of Nonprofit Boards Governance as Leadership: Reframing the Work of Nonprofit Boards Tradi

More information

This presenta,on covers the essen,al informa,on about IT services and facili,es which all new students will need to get started.

This presenta,on covers the essen,al informa,on about IT services and facili,es which all new students will need to get started. This presenta,on covers the essen,al informa,on about IT services and facili,es which all new students will need to get started. 1 Most of the informa,on is covered in more depth on the Informa,on Services

More information

Splunk for Data Science

Splunk for Data Science Copyright 2014 Splunk Inc. Splunk for Data Science Tom LaGa=a Data Scien@st, Splunk Olivier de Garrigues Sr Prof Services Consultant, Splunk Disclaimer During the course of this presenta@on, we may make

More information

We are pleased to offer the following program to Woodstock Area Educators:

We are pleased to offer the following program to Woodstock Area Educators: DATE: Spring 2016 TO: RE: Woodstock Area Educators Upcoming Cohort Programs Presently, many teachers are enrolled in cohort graduate programs through partnerships between local regional offices of education,

More information

Big Data in medical image processing

Big Data in medical image processing Big Data in medical image processing Konstan3n Bychenkov, CEO Aligned Research Group LLC bychenkov@alignedresearch.com Big data in medicine Genomic Research Popula3on Health Images M- Health hips://cloud.google.com/genomics/v1beta2/reference/

More information

HP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica

HP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica HP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica So What s the market s definition of Big Data? Datasets whose volume, velocity, variety

More information

Advanced Project Management Training Course

Advanced Project Management Training Course Advanced Project Management Training Course 1-34 Advanced Project Management Crea/ng the Scope Baseline 2-34 Crea/ng the Scope Baseline Module 1 Introduction Module 2 Creating the Project Charter Module

More information

Social Media for Business - Primer. Becky Livingston President & CEO Penheel Marke:ng April 2014

Social Media for Business - Primer. Becky Livingston President & CEO Penheel Marke:ng April 2014 Social Media for Business - Primer Becky Livingston President & CEO Penheel Marke:ng April 2014 Becky Livingston ü Speaker ü Author ü Educator ü Social Media & Digital Marke:ng Consultant ü 25 years marke:ng

More information

2015-16 ITS Strategic Plan Enabling an Unbounded University

2015-16 ITS Strategic Plan Enabling an Unbounded University 2015-16 ITS Strategic Plan Enabling an Unbounded University Update: July 31, 2015 IniAaAve: Agility Through Technology Vision Mission Enable Unbounded Learning Support student success through the innovaave

More information

The system approach in human resources. Functional Analysis of the System for Human Resources Management. Introduction. Arcles

The system approach in human resources. Functional Analysis of the System for Human Resources Management. Introduction. Arcles Functional Analysis of the System for Human Resources Management Assoc. Prof. Margarita Harizanova, Ph.D. Chief Assist. Prof. Nadya Mironova, Ph.D. Assist. Prof. Tatyana Shtetinska Summary: The arcle presents

More information

Effec%ve AX 2012 Upgrade Project Planning and Microso< Sure Step. Arbela Technologies

Effec%ve AX 2012 Upgrade Project Planning and Microso< Sure Step. Arbela Technologies Effec%ve AX 2012 Upgrade Project Planning and Microso< Sure Step Arbela Technologies Why Upgrade? What to do? How to do it? Tools and templates Agenda Sure Step 2012 Ax2012 Upgrade specific steps Checklist

More information

Kaseya Fundamentals Workshop DAY THREE. Developed by Kaseya University. Powered by IT Scholars

Kaseya Fundamentals Workshop DAY THREE. Developed by Kaseya University. Powered by IT Scholars Kaseya Fundamentals Workshop DAY THREE Developed by Kaseya University Powered by IT Scholars Kaseya Version 6.5 Last updated March, 2014 Day Two Overview Day Two Lab Review Patch Management Configura;on

More information

Mega Modeling for Scien/fic Big Data Processing

Mega Modeling for Scien/fic Big Data Processing Mega Modeling for Scien/fic Big Data Processing Stefano Ceri, Emanuele Della Valle (Politecnico di Milano) Dino Pedreschi, Roberto Trasar/ (ISTI- CNR and University of Pisa) 1 The context 2 Scenario BIG

More information

Home Selling Marke/ng Proposal

Home Selling Marke/ng Proposal Home Selling Marke/ng Proposal Presented by: Nate Harimoto & Shane Haas Aviara Real Estate 2555 Townsgate Road, Suite 200 Westlake Village, CA 91361 www.aviararealestate.net Nate Office & Fax: (805) 418-2675

More information

Scalus Winter School Storage Systems

Scalus Winter School Storage Systems Scalus Winter School Storage Systems Flash Memory André Brinkmann Flash Memory Floa:ng gate of a flash cell is electrically isolated Applying high voltages between source and drain accelerates electrons

More information

The Adop)on Pa-erns of Mobile Telephones by Micro and Small Enterprises in Ghana

The Adop)on Pa-erns of Mobile Telephones by Micro and Small Enterprises in Ghana The Adop)on Pa-erns of Mobile Telephones by Micro and Small Enterprises in Ghana Godfred Kwasi Frempong Science and Technology Policy Research Ins)tute gkfrempong@csir- stepri.org Introduc*on Development

More information

An Integrated Approach to Manage IT Network Traffic - An Overview Click to edit Master /tle style

An Integrated Approach to Manage IT Network Traffic - An Overview Click to edit Master /tle style An Integrated Approach to Manage IT Network Traffic - An Overview Click to edit Master /tle style Agenda A quick look at ManageEngine Tradi/onal Traffic Analysis Techniques & Tools Changing face of Network

More information

Rethink. Recruitment. McFrank & Williams Adver3sing Agency

Rethink. Recruitment. McFrank & Williams Adver3sing Agency Rethink. Recruitment. McFrank & Williams Adver3sing Agency Introduction 1 Recruitment Solutions & Tools with our proprietary technologies Be#er results require different methods McFrank & Williams Advertising

More information

Physiotherapy & Occupational Therapy

Physiotherapy & Occupational Therapy SOS PRESENTS... Physiotherapy & Occupational Therapy Presented by: Avideh Khalili & Helia Ghazinejad Slides will be available at bethune.yorku.ca/ events Agenda Introduc7on Applica9on Process PT OT Opportuni9es

More information

Scalus A)ribute Workshop. Paris, April 14th 15th

Scalus A)ribute Workshop. Paris, April 14th 15th Scalus A)ribute Workshop Paris, April 14th 15th Content Mo=va=on, objec=ves, and constraints Scalus strategy Scenario and architectural views How the architecture works Mo=va=on for this MCITN Storage

More information

San Jacinto College Banner & Enterprise Applica5on Review Task Force Report. November 01, 2011 FINAL

San Jacinto College Banner & Enterprise Applica5on Review Task Force Report. November 01, 2011 FINAL San Jacinto College Banner & Enterprise Applica5on Review Task Force Report November 01, 2011 FINAL 1 Content Review goal and approach 3 Barriers to effec5ve use of Banner: Consultant observa5ons 10 Consultant

More information

Advanced Fraud Detection & Prevention Through Big Data

Advanced Fraud Detection & Prevention Through Big Data Advanced Fraud Detection & Prevention Through Big Data Mark Johnson Director, Engineered Systems, Oracle Public Sector 1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. The following

More information

Expanding Assessment of Analy3cal Skills among Biology Majors: From Introductory labs to Upper Division Elec3ves

Expanding Assessment of Analy3cal Skills among Biology Majors: From Introductory labs to Upper Division Elec3ves Expanding Assessment of Analy3cal Skills among Biology Majors: From Introductory labs to Upper Division Elec3ves Presented by Kathleen McAuley PI: Serena Moseman- Val3erra, Ph.D. Department of Biological

More information

Suppor&ng a social media research environment by mining big textual data. Sophia Ananiadou Na-onal Centre for Text Mining www.nactem.ac.

Suppor&ng a social media research environment by mining big textual data. Sophia Ananiadou Na-onal Centre for Text Mining www.nactem.ac. Suppor&ng a social media research environment by mining big textual data Sophia Ananiadou Na-onal Centre for Text Mining www.nactem.ac.uk Mo-va-on Much social media data consists of unstructured, noisy

More information

The model of SWOT-analysis is the most

The model of SWOT-analysis is the most Ten Mistakes at the Usage of the SWOT-Analysis in the Strategic Marketing Planning in the Healthcare Institutions Chief Assist. Prof. Alexander Valkov, Ph.D. Department of Public Administration and Regional

More information