How To Use A Webmail On A Pc Or Macodeo.Com
|
|
- Bernard Mathews
- 3 years ago
- Views:
Transcription
1 Big data workloads and real-world data sets Gang Lu Institute of Computing Technology, Chinese Academy of Sciences BigDataBench Tutorial MICRO 2014 Cambridge, UK INSTITUTE OF COMPUTING TECHNOLOGY 1
2 Five domains n Search engine n Social network n E- commence n Mul9- media n Bioinforma9cs
3 Search Engine General search and ver9cal search Online server and Offline analy9cs
4 n Parsing: n Search Engine: Parsing Extract the text content and out links from the raw web pages
5 n Indexing n Search Engine: Indexing The process of create the mapping of term to document id lists
6 Search Engine: PageRank n PageRank n Compute the importance of the page according to the web link graph using PageRank
7 n Querying n Search Engine: Search query The online web search server serving users requests
8 n Sor9ng n Search Engine: Sor9ng Sort the results according the page ranks and the relevance of between the query and the document
9 Search Engine: Recommenda9on n Recommenda9on n Recommend related queries to users by mining the search log
10 Search Engine: Sta9s9c cou9ng n Sta9s9c coun9ng n Coun9ng the word frequency to extract the key word which represent the features of the page
11 Search Engine: Classifica9on n Classifica9on n Classify text content into different categories, users can filter the results to a special category they are interested in
12 Search Engine: Filter & Seman9c n Filter n Iden9fy pages with specific topic which can be used for ver9cal search n Seman9c extrac9on n extract seman9c informa9on extrac9on
13 Search Engine: Data access n Data access opear9ons n Read, write, and scan the seman9c informa9on.
14 Social network n Data sets n User table n Rela9on table n Ar9cle table n Workloads n Offline analy9cs
15 Social network: Data schema User table Rela9on table Tweet table
16 Social network: Workloads n Hot review topic n Select the top N tweets by the number of review n Hot transmit topic n Select the tweets which are transmiwed more than N 9mes. n Ac9ve user n Select the top N person who post the largest number of tweets. n Leader of opinion n Select top ones whose number of review and transmit are both large than N.
17 Social network: Workloads n Topic classify n Classify the tweets to certain category according to the topic. n Sen9ment classify n Classify the tweets to nega9ve or posi9ve according to the sen9ment. n Friend recommenda9on n Recommend friend to person according the rela9onal graph. n Community detec9on n Detec9ng clusters or communi9es in large social networks. n Breadth first search n Sort persons according to the distance between two people.
18 Specifica9on: E- commerce Order table Item table
19 E- commerce n Data sets n Order table n Item table n Workloads n Offline analy9cs
20 E- commerce: Workloads n Select query n Find the items of which the sales amount is over 100 in a single order. n Aggrega9on query n Count the sales number of each goods. n Join query n Count the number of each goods that each buyer purchased between certain period of 9me. n Recommenda9on n Predict the preferences of the buyer and recommend goods to them. n Sensi9ve classifica9on n Iden9fy posi9ve or nega9ve review. n Basic data opera9on n Unit of opera9on of the data The workloads of select, aggrega2on, and join are similar as queries used in A. Pavlo s sigmod09 paper,but are BigDataBench specified in the e- commence environment MICRO 2014
21 Mul9media Voice Data Extrac1on Speech Recogni1on Video Data MPEG Decoder Frame Data Extrac1on Feature Extrac1on Image Segmenta1on Face Detec1on Three- Dimensional Reconstruc1on Tracing
22 Mul9media: Workloads n MPEG Decoder. n Decode video streams using MPEG- 2 standard. n Feature extrac9on n For a given video frame, extract features which are invariant to scale, noise, and illumina9on. n Speech Recogni9on. n For a given audio file, recognize the content of the file and find whether exists sensi9ve words.
23 n Ray Tracing. Mul9media: Workloads n Render a 2- Dimensional video frame to a 3- Dimensional scene. n Image Segmenta9on. n Segment the input video frame according to color, intensity, and texture, and extract concerned regions. n Face Detec9on. n Detect whether face exists in the input data, if exists, then extract the face. n Deep Learning. n The input images are classified into different categories, and then detect human face.
24 Bioinforma9cs n Sequence assembly. n Assemble scawered and repe99ve DNA fragments to original long sequence. n Sequence alignment. n Align assembled DNA sequence to known sequences in the database, and detect disease. Gene Sequencing Genome Sequence Data Sequence Assembly Sequence Mapping Sequence Alignment Detec1on Result
25 Summary:Real data sets
26 Summary:Search Engine Various implementa1on
27 Summary:Social network
28 Summary:E- commerce
29 Summary:Mul9media
30 Summary:Bioinforma9cs
31 Any Questions
Topic Extrac,on from Online Reviews for Classifica,on and Recommenda,on (2013) R. Dong, M. Schaal, M. P. O Mahony, B. Smyth
Topic Extrac,on from Online Reviews for Classifica,on and Recommenda,on (2013) R. Dong, M. Schaal, M. P. O Mahony, B. Smyth Lecture Algorithms to Analyze Big Data Speaker Hüseyin Dagaydin Heidelberg, 27
More informationData Warehousing. Yeow Wei Choong Anne Laurent
Data Warehousing Yeow Wei Choong Anne Laurent Databases Databases are developed on the IDEA that DATA is one of the cri>cal materials of the Informa>on Age Informa>on, which is created by data, becomes
More informationSuppor&ng a social media research environment by mining big textual data. Sophia Ananiadou Na-onal Centre for Text Mining www.nactem.ac.
Suppor&ng a social media research environment by mining big textual data Sophia Ananiadou Na-onal Centre for Text Mining www.nactem.ac.uk Mo-va-on Much social media data consists of unstructured, noisy
More informationIns+tuto Superior Técnico Technical University of Lisbon. Big Data. Bruno Lopes Catarina Moreira João Pinho
Ins+tuto Superior Técnico Technical University of Lisbon Big Data Bruno Lopes Catarina Moreira João Pinho Mo#va#on 2 220 PetaBytes Of data that people create every day! 2 Mo#va#on 90 % of Data UNSTRUCTURED
More informationOpportuni)es and Challenges of Textual Big Data for the Humani)es
Opportuni)es and Challenges of Textual Big Data for the Humani)es Dr. Adam Wyner, Department of Compu)ng Prof. Barbara Fennell, Department of Linguis)cs THiNK Network Knowledge Exchange in the Humani)es
More informationXML, Seman9c Web and Content Analy9cs
XML, Seman9c Web and Content Analy9cs XML Prague Pre- conference 2014 Felix Sasaki DFKI / W3C Fellow 1 What do you need to follow this session? Ideal: a computer with internet access, to be able to provide
More informationBig Data in medical image processing
Big Data in medical image processing Konstan3n Bychenkov, CEO Aligned Research Group LLC bychenkov@alignedresearch.com Big data in medicine Genomic Research Popula3on Health Images M- Health hips://cloud.google.com/genomics/v1beta2/reference/
More informationKeeping Pace with Big Data
- A Data Mining Perspec>ve Huan Liu, Tempe, AZ hep://www.public.asu.edu/~huanliu NSF Workshop on Big Data Analy6cs for Infrastructure and Building Resilience and Sustainability, Beijing, China Sept 19-20,
More informationSocial Media Marke-ng for Academic Research
Social Media Marke-ng for Academic Research 1 David Altman Mar.n Son Susu Wong @MassTTC #Social @TOMO3603 Using Social Media in Technology Licensing Offices 2 David Altman Manager Marke9ng and Communica9ons
More informationSocial Network Mining
SSIIM - Seminários de Sistemas Inteligentes, Interacção e Mul8média, MIEIC Social Network Mining Eduarda Mendes Rodrigues Assistant Professor DEI- FEUP, Universidade do Porto hhp://www.fe.up.pt/~eduarda
More informationTOLOMEO. ORFEO Toolbox. Jordi Inglada - CNES. TOoLs for Open Mul/- risk assessment using Earth Observa/on data TOLOMEO
ORFEO Toolbox Jordi Inglada - CNES TOoLs for Open Mul/- risk assessment using Earth Observa/on data Outline ORFEO Toolbox : general characteris>cs Example of OTB features OTB Applica>ons & Processing Chains
More informationThe Library (Big) Data scien4st
The Library (Big) Data scien4st IFLA/ALA webinar: Big Data: new roles and opportuni4es for new librarians June 15 th 2016 IFLA Big Data Special Interest Group (SIG) Wouter Klapwijk, Stellenbosch University,
More informationSocial Media Analy.cs (SMA)
Social Media Analy.cs (SMA) Emanuele Della Valle DEIB - Politecnico di Milano emanuele.dellavalle@polimi.it hap://emanueledellavalle.org What's social media? haps://www.youtube.com/watch?v=sgniiud_oqg
More informationSearch and Real-Time Analytics on Big Data
Search and Real-Time Analytics on Big Data Sewook Wee, Ryan Tabora, Jason Rutherglen Accenture & Think Big Analytics Strata New York October, 2012 Big Data: data becomes your core asset. It realizes its
More informationOnline Gambling - Advantages And Disadvantages
MOVING YOUR BUSINESS ONLINE TO MAXIMIZE ROI By Shelby Landeck Manager of Client Relations, Income Access PRESENTATION OVERVIEW Why going online is important And what your business can achieve online Defining
More informationHow To Analyze Medical Image Data With A Feature Based Approach To Big Data Medical Image Analysis
A Feature- based Approach to Big Data Medical Image Analysis Ma$hew Toews $, Chris/an Wachinger, Raul San Jose Estepar, William Wells III $ École de Technologie Supérieur, Montreal Canada BWH, Harvard
More informationHoneycomb Crea/ve Works is financed by the European Union s European Regional Development Fund through the INTERREG IVA Cross- border Programme
Honeycomb Crea/ve Works is financed by the European Union s European Regional Development Fund through the INTERREG IVA Cross- border Programme managed by the Special EU Programmes Body. Web Analy*cs In
More informationPower to the People: Analy0cs for All
Arijit Sengupta CEO, BeyondCore, Inc. Power to the People: Analy0cs for All " Ten patents related to Advanced Analytics, Privacy/Security and BPaaS. " Previously worked at Oracle, Microsoft, Yankee Group
More informationDNS Big Data Analy@cs
Klik om de s+jl te bewerken Klik om de models+jlen te bewerken! Tweede niveau! Derde niveau! Vierde niveau DNS Big Data Analy@cs Vijfde niveau DNS- OARC Fall 2015 Workshop October 4th 2015 Maarten Wullink,
More information2014/02/13 Sphinx Lunch
2014/02/13 Sphinx Lunch Best Student Paper Award @ 2013 IEEE Workshop on Automatic Speech Recognition and Understanding Dec. 9-12, 2013 Unsupervised Induction and Filling of Semantic Slot for Spoken Dialogue
More informationSecure Because Math: Understanding ML- based Security Products (#SecureBecauseMath)
Secure Because Math: Understanding ML- based Security Products (#SecureBecauseMath) Alex Pinto Chief Data Scien2st Niddel / MLSec Project @alexcpsec @MLSecProject @NiddelCorp Agenda Security Singularity
More informationCloudian The Storage Evolution to the Cloud.. Cloudian Inc. Pre Sales Engineering
Cloudian The Storage Evolution to the Cloud.. Cloudian Inc. Pre Sales Engineering Agenda Industry Trends Cloud Storage Evolu4on of Storage Architectures Storage Connec4vity redefined S3 Cloud Storage Use
More informationBig Data. The Big Picture. Our flexible and efficient Big Data solu9ons open the door to new opportuni9es and new business areas
Big Data The Big Picture Our flexible and efficient Big Data solu9ons open the door to new opportuni9es and new business areas What is Big Data? Big Data gets its name because that s what it is data that
More informationUnderstanding and Detec.ng Real- World Performance Bugs
Understanding and Detec.ng Real- World Performance Bugs Gouliang Jin, Linhai Song, Xiaoming Shi, Joel Scherpelz, and Shan Lu Presented by Cindy Rubio- González Feb 10 th, 2015 Mo.va.on Performance bugs
More informationEnsemble Methods. Adapted from slides by Todd Holloway h8p://abeau<fulwww.com/2007/11/23/ ensemble- machine- learning- tutorial/
Ensemble Methods Adapted from slides by Todd Holloway h8p://abeau
More informationComputer Networks. Examples of network applica3ons. Applica3on Layer
Computer Networks Applica3on Layer 1 Examples of network applica3ons e- mail web instant messaging remote login P2P file sharing mul3- user network games streaming stored video clips social networks voice
More informationSocial Media Monitoring by Using Data Mining. Fuat Basık
Social Media Monitoring by Using Data Mining Fuat Basık Presentation Plan Introduc0on Mo0va0on Stream Processing Data Set Turkish Language Pre Processing and Stemming Term Frequency and Inverse Document
More informationRun$me Query Op$miza$on
Run$me Query Op$miza$on Robust Op$miza$on for Graphs 2006-2014 All Rights Reserved 1 RDF Join Order Op$miza$on Typical approach Assign es$mated cardinality to each triple pabern. Bigdata uses the fast
More informationHow To Create A Data Science System
Enhance Collaboration and Data Sharing for Faster Decisions and Improved Mission Outcome Richard Breakiron Senior Director, Cyber Solutions Rbreakiron@vion.com Office: 571-353-6127 / Cell: 803-443-8002
More informationThe Data Reservoir. 10 th September 2014. Mandy Chessell FREng CEng FBCS Dis4nguished Engineer, Master Inventor Chief Architect, Informa4on Solu4ons
Mandy Chessell FREng CEng FBCS Dis4nguished Engineer, Master Inventor Chief Architect, Solu4ons The Reservoir 10 th September 2014 A growing demand Business Teams want Open access to more informa4on More
More information.nl ENTRADA. CENTR-tech 33. November 2015 Marco Davids, SIDN Labs. Klik om de s+jl te bewerken
Klik om de s+jl te bewerken Klik om de models+jlen te bewerken Tweede niveau Derde niveau Vierde niveau.nl ENTRADA Vijfde niveau CENTR-tech 33 November 2015 Marco Davids, SIDN Labs Wie zijn wij? Mijlpalen
More informationProgramming and Debugging Large- Scale Data Processing Workflows. Christopher Olston and many others Yahoo! Research
Programming and Debugging Large- Scale Data Processing Workflows Christopher Olston and many others Yahoo! Research Context Elaborate processing of large data sets e.g.: web search pre- processing cross-
More informationOVERVIEW OF DATA EXPLORATION TECHNIQUES. Stratos Idreos, Olga Papaemmanouil, Surajit Chaudhuri SIGMOD 2015, Melbourne
OVERVIEW OF DATA EXPLORATION TECHNIQUES Stratos Idreos, Olga Papaemmanouil, Surajit Chaudhuri SIGMOD 2015, Melbourne USER INTERACTION express interests query/results recommendasons annotate collaborate
More informationProcessing of Mix- Sensi0vity Video Surveillance Streams on Hybrid Clouds
Processing of Mix- Sensi0vity Video Surveillance Streams on Hybrid Clouds Chunwang Zhang, Ee- Chien Chang School of Compu2ng, Na2onal University of Singapore 28 th June, 2014 Outline 1. Mo0va0on 2. Hybrid
More informationUniversità Telema/ca Internazionale UNINETTUNO Corso Vi9orio Emanuele II, n.39 00186 Roma Italia info@unine9unouniversity.net
Università Telema/ca Internazionale UNINETTUNO Corso Vi9orio Emanuele II, n.39 00186 Roma Italia info@unine9unouniversity.net Access to the online learning environment Management of the Student s page
More informationData Management in the Cloud: Limitations and Opportunities. Annies Ductan
Data Management in the Cloud: Limitations and Opportunities Annies Ductan Discussion Outline: Introduc)on Overview Vision of Cloud Compu8ng Managing Data in The Cloud Cloud Characteris8cs Data Management
More informationData Stream Algorithms in Storm and R. Radek Maciaszek
Data Stream Algorithms in Storm and R Radek Maciaszek Who Am I? l Radek Maciaszek l l l l l l Consul9ng at DataMine Lab (www.dataminelab.com) - Data mining, business intelligence and data warehouse consultancy.
More informationSeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications
Product Bulletin Sequencing Software SeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications Comprehensive reference sequence handling Helps interpret the role of each
More informationThe Scientific Data Mining Process
Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In
More informationApplying Deep Learning to Car Data Logging (CDL) and Driver Assessor (DA) October 22-Oct-15
Applying Deep Learning to Car Data Logging (CDL) and Driver Assessor (DA) October 22-Oct-15 GENIVI is a registered trademark of the GENIVI Alliance in the USA and other countries Copyright GENIVI Alliance
More informationSearch and Information Retrieval
Search and Information Retrieval Search on the Web 1 is a daily activity for many people throughout the world Search and communication are most popular uses of the computer Applications involving search
More informationNCDS Leadership Summit " The Friday Center" Chapel Hill, North Carolina" April 23 & 24, 2013!
NCDS Leadership Summit " The Friday Center" Chapel Hill, North Carolina" April 23 & 24, 2013! Data Collection Scale of Problem Challenges v Research versus clinical contexts v Science versus medicine v
More informationAnalysis of Web Archives. Vinay Goel Senior Data Engineer
Analysis of Web Archives Vinay Goel Senior Data Engineer Internet Archive Established in 1996 501(c)(3) non profit organization 20+ PB (compressed) of publicly accessible archival material Technology partner
More informationBig Data Benchmark Suite
BigDataBench: An Open source Big Data Benchmark Suite Jianfeng Zhan http://prof.ict.ac.cn/bigdatabench Professor, ICT, Chinese Academy of Sciences and University of Chinese Academy of Sciences WBDB 2015
More informationSeman&c Web: Benefits For Clinical Decision Support At The Bedside. Emory Fry, MD SemTechBiz 2013
Seman&c Web: Benefits For Clinical Decision Support At The Bedside Emory Fry, MD SemTechBiz 2013 Clinical Decision Support (CDS) A system providing knowledge and person specific or popula8on informa8on
More informationWLAN Spectrum Analyzer Technology White Paper HUAWEI TECHNOLOGIES CO., LTD. Issue 01. Date 2013-05-10
WLAN Spectrum Analyzer Technology White Paper Issue 01 Date 2013-05-10 HUAWEI TECHNOLOGIES CO., LTD. 2013. All rights reserved. No part of this document may be reproduced or transmitted in any form or
More informationM3039 MPEG 97/ January 1998
INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND ASSOCIATED AUDIO INFORMATION ISO/IEC JTC1/SC29/WG11 M3039
More informationSearch Result Optimization using Annotators
Search Result Optimization using Annotators Vishal A. Kamble 1, Amit B. Chougule 2 1 Department of Computer Science and Engineering, D Y Patil College of engineering, Kolhapur, Maharashtra, India 2 Professor,
More informationSearch and Data Mining: Techniques. Applications Anya Yarygina Boris Novikov
Search and Data Mining: Techniques Applications Anya Yarygina Boris Novikov Introduction Data mining applications Data mining system products and research prototypes Additional themes on data mining Social
More informationDTCC Data Quality Survey Industry Report
DTCC Data Quality Survey Industry Report November 2013 element 22 unlocking the power of your data Contents 1. Introduction 3 2. Approach and participants 4 3. Summary findings 5 4. Findings by topic 6
More informationAnalysis of Data Mining Concepts in Higher Education with Needs to Najran University
590 Analysis of Data Mining Concepts in Higher Education with Needs to Najran University Mohamed Hussain Tawarish 1, Farooqui Waseemuddin 2 Department of Computer Science, Najran Community College. Najran
More informationRetaining globally distributed high availability Art van Scheppingen Head of Database Engineering
Retaining globally distributed high availability Art van Scheppingen Head of Database Engineering Overview 1. Who is Spil Games? 2. Theory 3. Spil Storage Pla9orm 4. Ques=ons? 2 Who are we? Who is Spil
More informationTLD Data Analysis. ICANN Tech Day, Dublin. October 19th 2015 Maarten Wullink, SIDN. Klik om de s+jl te bewerken
Klik om de s+jl te bewerken Klik om de models+jlen te bewerken Tweede niveau Derde niveau Vierde niveau TLD Data Analysis Vijfde niveau ICANN Tech Day, Dublin October 19th 2015 Maarten Wullink, SIDN Wie
More informationCS 5150 So(ware Engineering Evalua4on and User Tes4ng
Cornell University Compu1ng and Informa1on Science CS 5150 So(ware Engineering Evalua4on and User Tes4ng William Y. Arms Usability: The Analyze/Design/Build/Evaluate Loop Analyze requirements Design User
More informationIntroduc8on to Apache Spark
Introduc8on to Apache Spark Jordan Volz, Systems Engineer @ Cloudera 1 Analyzing Data on Large Data Sets Python, R, etc. are popular tools among data scien8sts/analysts, sta8s8cians, etc. Why are these
More informationANALYTICAL TECHNIQUES FOR DATA VISUALIZATION
ANALYTICAL TECHNIQUES FOR DATA VISUALIZATION CSE 537 Ar@ficial Intelligence Professor Anita Wasilewska GROUP 2 TEAM MEMBERS: SAEED BOOR BOOR - 110564337 SHIH- YU TSAI - 110385129 HAN LI 110168054 SOURCES
More informationResearch at the Department of Computer Science and Software Engineering. Professor Yong Yue BEng, PhD, CEng, FIET, FIMechE 17 October 2014
Research at the Department of Computer Science and Software Engineering Professor Yong Yue BEng, PhD, CEng, FIET, FIMechE 17 October 2014 Research Areas Ar%ficial intelligence Robo%cs Data mining Image
More informationApplying Machine Learning to Network Security Monitoring. Alex Pinto Chief Data Scien2st MLSec Project @alexcpsec @MLSecProject!
Applying Machine Learning to Network Security Monitoring Alex Pinto Chief Data Scien2st MLSec Project @alexcpsec @MLSecProject! whoami Almost 15 years in Informa2on Security, done a licle bit of everything.
More informationGanzheitliches Datenmanagement
Ganzheitliches Datenmanagement für Hadoop Michael Kohs, Senior Sales Consultant @mikchaos The Problem with Big Data Projects in 2016 Relational, Mainframe Documents and Emails Data Modeler Data Scientist
More informationData Obesity: Ethics, Law or Regulation?
Data Obesity: Ethics, Law or Regulation? Mireille Hildebrandt Chair of Smart Environments, Data Protec:on and the Rule of Law, RU Nijmegen Professor of Technology Law and Law in Technology, Vrije Universiteit
More informationBPOE Research Highlights
BPOE Research Highlights Jianfeng Zhan ICT, Chinese Academy of Sciences 2013-10- 9 http://prof.ict.ac.cn/jfzhan INSTITUTE OF COMPUTING TECHNOLOGY What is BPOE workshop? B: Big Data Benchmarks PO: Performance
More informationTheo JD Bothma Department of Informa1on Science theo.bothma@up.ac.za
Theo JD Bothma Department of Informa1on Science theo.bothma@up.ac.za Reflec1ons on the role of corpora and big data in e- lexicography in rela1on to end user informa1on needs CILC 2015 7th Interna1onal
More informationViCSiM. CAN/LIN Simulator and Monitor
CAN/LIN Simulator and Monitor ViCSiM CAN/LIN Communication Simulator and Monitor Even though it is low in price, it has advanced functions, such as log playback simulation and graph monitor. A product
More informationPrice pa)erns, charts and technical analysis: Technical Analysis. Aswath Damodaran
Price pa)erns, charts and technical analysis: Technical Analysis Aswath Damodaran Founda;ons of Technical Analysis: What are the assump;ons? 1. Price is determined solely by the interac;on of supply &
More informationGraph and Spa*al Analy*cs Built for Big Data Pla9orms
Graph and Spa*al Analy*cs Built for Big Data Pla9orms Jim Steiner Vice President Server Technologies October 28, 2015 Safe Harbor Statement The following is intended to outline our general product direc*on.
More informationHiBench Introduction. Carson Wang (carson.wang@intel.com) Software & Services Group
HiBench Introduction Carson Wang (carson.wang@intel.com) Agenda Background Workloads Configurations Benchmark Report Tuning Guide Background WHY Why we need big data benchmarking systems? WHAT What is
More informationBigDataBench. Khushbu Agarwal
BigDataBench Khushbu Agarwal Last Updated: May 23, 2014 CONTENTS Contents 1 What is BigDataBench? [1] 1 1.1 SUMMARY.................................. 1 1.2 METHODOLOGY.............................. 1 2
More informationExtrac'ng People s Hobby and Interest Informa'on from Social Media Content
Extrac'ng People s Hobby and Interest Informa'on from Social Media Content Thomas Forss, Shuhua Liu and Kaj- Mikael Björk Dept of Business Administra?on and Analy?cs Arcada University of Applied Sciences
More informationEHR CURATION FOR MEDICAL MINING
EHR CURATION FOR MEDICAL MINING Ernestina Menasalvas Medical Mining Tutorial@KDD 2015 Sydney, AUSTRALIA 2 Ernestina Menasalvas "EHR Curation for Medical Mining" 08/2015 Agenda Motivation the potential
More informationMarch 10 th 2011, OSG All Hands Mee6ng, Network Performance Jason Zurawski Internet2 NDT
March 10 th 2011, OSG All Hands Mee6ng, Network Performance Jason Zurawski Internet2 NDT Agenda Tutorial Agenda: Network Performance Primer Why Should We Care? (15 Mins) GeNng the Tools (10 Mins) Use of
More informationESS Cable Naming and Labeling Policy. Karin Rathsman Tuesday, Jan 28, 2014
ESS Cable Naming and Labeling Policy Karin Rathsman Tuesday, Jan 28, 2014 Introduc;on Scope The cable naming conven;on applies to all cables connec;ng device ports. By device we refer to equipment defined
More informationIMAGING SOFTWARE. Image-Pro Insight Image Analysis Made Easy. Capture, Process, Measure, and Share
IMAGING SOFTWARE Image-Pro Insight Image Analysis Made Easy Capture, Process, Measure, and Share Image-Pro Insight Image Analysis Made Easy Capture, Process, Measure, and Share Image-Pro Insight, the latest
More information2003-2015 Take 5 Solutions - All Rights Reserved.
2003 - Take 5 Solutions - All Rights Reserved. Overview Why Take 5 Solu/ons? Take 5's Unique Advantages Leadership Team Product Offerings Direct Mail List Rental Email List Rental and Retarge/ng Social
More informationData Management in the Cloud
With thanks to Michael Grossniklaus! Data Management in the Cloud Lecture 8 Data Models Document: MongoDB I ve failed over and over and over again in my life. And that is why I succeed. Michael Jordan
More informationMaking Sense of Big Data. Dr. Thomas E. Potok Computa2onal Data Analy2cs Group Leader Oak Ridge Na2onal Laboratory potokte@ornl.
Making Sense of Big Data Dr. Thomas E. Potok Computa2onal Data Analy2cs Group Leader Oak Ridge Na2onal Laboratory potokte@ornl.gov 865-574- 0834 ORNL s Big Data Legacy Science National Security Energy
More informationPrivacy- Preserving P2P Data Sharing with OneSwarm. Presented by. Adnan Malik
Privacy- Preserving P2P Data Sharing with OneSwarm Presented by Adnan Malik Privacy The protec?on of informa?on from unauthorized disclosure Centraliza?on and privacy threat Websites Facebook TwiFer Peer
More informationPerformance Management in Big Data Applica6ons. Michael Kopp, Technology Strategist @mikopp
Performance Management in Big Data Applica6ons Michael Kopp, Technology Strategist NoSQL: High Volume/Low Latency DBs Web Java Key Challenges 1) Even Distribu6on 2) Correct Schema and Access paperns 3)
More information1 Actuate Corpora-on 2013. Big Data Business Analy/cs
1 Big Data Business Analy/cs Introducing BIRT Analy3cs Provides analysts and business users with advanced visual data discovery and predictive analytics to make better, more timely decisions in the age
More informationThree Step Redirect API
Inspire Commerce &.pay Three Step Redirect API Inspire Commerce 800-261-3173 support@inspirecommerce.com Contents Overview... 3 Methodology... 3 XML Communica:on... 5 Transac:on Opera:ons... 6 Customer
More informationOld and New Building Blocks Come Together For Big Data. MapR Technologies - Confiden6al
Old and New Building Blocks Come Together For Big Data 1 Contact: tdunning@maprtech.com @ted_dunning Slides and such hap://slideshare.net/tdunning Hash tags: #mapr #goto #d3 #node 2 Embarrassment of Riches
More informationBIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata
BIG DATA: FROM HYPE TO REALITY Leandro Ruiz Presales Partner for C&LA Teradata Evolution in The Use of Information Action s ACTIVATING MAKE it happen! Insights OPERATIONALIZING WHAT IS happening now? PREDICTING
More informationAlarms of Stream MultiScreen monitoring system
STREAM LABS Alarms of Stream MultiScreen monitoring system Version 1.0, June 2013. Version history Version Author Comments 1.0 Krupkin V. Initial version of document. Alarms for MPEG2 TS, RTMP, HLS, MMS,
More informationTeaching Analy-cs, Big Data and Sustainability: An IS perspec-ve
Teaching Analy-cs, Big Data and Sustainability: An IS perspec-ve Raja Sooriamurthi / Randy Weinberg Informa(on Systems Program Carnegie Mellon University {raja,rweinberg}@cmu.edu Presenta-on Outline The
More informationAJR Automa+c Jamming Recogni+on
AJR Automa+c Jamming Recogni+on Internet of Things 10 th June 2015 Charles Curry BEng, CEng, FIET Chronos Technology Ltd Presenta(on Contents Chronos Technology The Journey How we got to Now in the IoT
More informationWeb Services and Development of Semantic Applications
Web Services and Development of Semantic Applications Trish Whetzel Outreach Coordinator THE NATIONAL CENTER FOR BIOMEDICAL ONTOLOGY Na#onal Center for Biomedical Ontology Mission To create software for
More informationA SURVEY ON WEB MINING TOOLS
IMPACT: International Journal of Research in Engineering & Technology (IMPACT: IJRET) ISSN(E): 2321-8843; ISSN(P): 2347-4599 Vol. 3, Issue 10, Oct 2015, 27-34 Impact Journals A SURVEY ON WEB MINING TOOLS
More informationECBDL 14: Evolu/onary Computa/on for Big Data and Big Learning Workshop July 13 th, 2014 Big Data Compe//on
ECBDL 14: Evolu/onary Computa/on for Big Data and Big Learning Workshop July 13 th, 2014 Big Data Compe//on Jaume Bacardit jaume.bacardit@ncl.ac.uk The Interdisciplinary Compu/ng and Complex BioSystems
More informationGyrus: A Framework for User- Intent Monitoring of Text- Based Networked ApplicaAons
Gyrus: A Framework for User- Intent Monitoring of Text- Based Networked ApplicaAons Yeongjin Jang*, Simon P. Chung*, Bryan D. Payne, and Wenke Lee* *Georgia Ins=tute of Technology Nebula, Inc 1 Tradi=onal
More informationIMPACT OF THE NEW ICD- 10 CODING SYSTEM ON THE MEDICAL BILLING AND PAYMENT PROCESS
IMPACT OF THE NEW ICD- 10 CODING SYSTEM ON THE MEDICAL BILLING AND PAYMENT PROCESS ICD- 10 Acronym Interna(onal Classifica(on of Diseases Tenth Revision ICD- 10 Basic Facts Replaces ICD- 9 Five digit coding
More informationBig Data from a Database Theory Perspective
Big Data from a Database Theory Perspective Martin Grohe Lehrstuhl Informatik 7 - Logic and the Theory of Discrete Systems A CS View on Data Science Applications Data System Users 2 Us Data HUGE heterogeneous
More informationLSST Data Management plans: Pipeline outputs and Level 2 vs. Level 3
LSST Data Management plans: Pipeline outputs and Level 2 vs. Level 3 Mario Juric Robert Lupton LSST DM Project Scien@st Algorithms Lead LSST SAC Name of Mee)ng Loca)on Date - Change in Slide Master 1 Data
More informationDiscovering Computers Fundamentals, 2010 Edition. Living in a Digital World
Discovering Computers Fundamentals, 2010 Edition Living in a Digital World Objec&ves Overview Discuss the importance of project management, feasibility assessment, documenta8on, and data and informa8on
More informationApache Hadoop: The Pla/orm for Big Data. Amr Awadallah CTO, Founder, Cloudera, Inc. aaa@cloudera.com, twicer: @awadallah
Apache Hadoop: The Pla/orm for Big Data Amr Awadallah CTO, Founder, Cloudera, Inc. aaa@cloudera.com, twicer: @awadallah 1 The Problems with Current Data Systems BI Reports + Interac7ve Apps RDBMS (aggregated
More informationUniversity Uses Business Intelligence Software to Boost Gene Research
Microsoft SQL Server 2008 R2 Customer Solution Case Study University Uses Business Intelligence Software to Boost Gene Research Overview Country or Region: Scotland Industry: Education Customer Profile
More informationPu?ng B2B Research to the Legal Test
With the global leader in sampling and data services Pu?ng B2B Research to the Legal Test Ashlin Quirk, SSI General Counsel 2014 Survey Sampling Interna6onal 1 2014 Survey Sampling Interna6onal Se?ng the
More informationDevelop Computer Animation
Name: Block: A. Introduction 1. Animation simulation of movement created by rapidly displaying images or frames. Relies on persistence of vision the way our eyes retain images for a split second longer
More informationVideo compression: Performance of available codec software
Video compression: Performance of available codec software Introduction. Digital Video A digital video is a collection of images presented sequentially to produce the effect of continuous motion. It takes
More informationvolume 43 scene 01 post production
scene 01 post production scene 01 raw scene 01 wire scene 02 post production scene 02 raw scene 02 wire scene 03 post production scene 03 raw scene 03 wire scene 04 post production scene 04 raw scene 04
More informationBig Data /Data Science Data Intensive (Science) Technologies
Big Data /Data Science Data Intensive (Science) Technologies Adam Belloum Ins:tute of Informa:cs University of Amsterdam a.s.z.belloum@uva.nl High Performance compu:ng Curriculum, Jan 2015 hmp://www.hpc.uva.nl/
More informationHow To Get More Data From Your Computer
Industry Perspective: Big Data and Big Data Analytics David Barnes Program Director Emerging Internet Technologies IBM Software Group What is Big Data? The Adjacent Possible Inexpensive disk + Increased
More information