Computational Linguistics and Learning from Big Data. Gabriel Doyle UCSD Linguistics
|
|
- Lora Heath
- 8 years ago
- Views:
Transcription
1 Computational Linguistics and Learning from Big Data Gabriel Doyle UCSD Linguistics
2 From not enough data to too much Finding people: 90s, 700 datapoints, 7 years People finding you: 00s, datapoints, 3 years People just talking: 10s, datapoints, 5 days
3 Big data Benefits Problems Cheap to collect Unsolicited Huge size Covers rare events Little control Noisy data Difficult to analyze
4 Need for intelligent analysis Big data is too big to analyze dumbly no one can read millions of tweets Analysis needed to establish relevance are they talking about what we re interested in? meaning what are they saying about it? use what does it mean to us?
5 Structured & Unstructured Data Surveys, focus groups, questionnaires, etc. yield structured data we know what we re asking we force the respondents to fit that structure Imposing structure is costly can only get answers to the questions we ask respondents can t tell us what they might think need to design & implement the structure
6 Structured & Unstructured Data The internet / social media / devices provide unstructured data People tell us what they want to say, not what we want to know Modern computational linguistic analyses can bridge the gap between our interests fewer constraints on data coming in low cost to speaker, medium cost to analyst
7 The dangers of simplistic analysis Don t want ads for cutlery on a story about a stabbing Eastland Mall in Pittsburgh s closed BUT Eastland Mall in Bloomington isn t I m not happy the food was expensive vs. I m happy the food was not expensive
8 Computational approaches Word-sense disambiguation Named-entity recognition Automated parsing Sentiment analysis Information extraction Topic modeling what are people talking about? what are people saying about it? putting it together
9 Word-sense disambiguation Language is ambiguous what does mean mean? Distinguish between multiple meanings of a word going to the park vs. will park my car connotations: chintzy cheap vs. frugal cheap can be done with supervision (e.g., WordNet) or unsupervised
10 Named-entity recognition Identifying names of people & things finding out what people are talking about Identifies & connects information about an object central to information extraction Can be tied to other modalities identifying people in photos from captions Berg et al 2004
11 Cross-modal named-entities
12 Named-entity recognition
13 Named-entity resources ANNIE, Stanford NER excellent performance on edited newsprint [90%+] poor performance on tweets & social media [40-70%] Derczynski & Bontcheva 2014 increased noise-tolerance, post-editing improves performance to 84% on tweets
14 Automated parsing Extracting the structure of a sentence
15 Automated parsing Core step for getting specific semantic information Structure of a sentence has a huge effect on meaning I m not happy the food was expensive I m happy the food was not expensive Existing parsers are really good, as long as the text isn t too bad
16 Sentiment analysis Basic idea: what emotion is being expressed here? who has the emotion? what s the emotion directed at? what reason is offered? Learning: train with known data and then extend to unknown e.g., given a set of reviews, what features do the good/bad have?
17 Sentiment analysis + parsing Socher et al 2013: sentiment percolates up a parse tree This movie doesn t care about [anything good]
18 Topic models Want to bundle documents/words into groups covering similar topics (Blei, Ng, & Jordan 03) Intuition: Words appearing in the same document are more likely to be related Documents built by choosing topics then choosing words from topics Topic model infers the topics per document & words per topic
19 Buying a computer Computers: 45% computer: 23% internet: 14% laptop: 12% Shopping: 13% store: 20% buy: 19% price: 11% Research: 19% When it came time to upgrade our computer, when I had to figure out the meanings of solidstate drives and quad-cores, I headed to the Internet to do my research, finding the right stores and the right sites to answer my questions
20 Topic models Good for general semantic classification grouping news stories, blog posts, etc. categorizing documents into known classes Many extensions, not just text timeseries data, author recognition connecting text to images (Costa Pereira et al 13) financial data (Doyle & Elkan 09) Pompeiian households (Mimno 09)
21 Information extraction Produces a structured representation of information ( knowledge base ) human-readable or machine-readable information as relations between entities throw(quarterback,pass) within- or across-document learning
22 IE example: learning football Hovy et al 2011: Unsupervised Discovery of Domain-Specific Knowledge from Text The last time the Detroit Lions won a game in the Metrodome, Scott Mitchell threw a touchdown pass to Herman Moore throw(scottmitchell,touchdown,hermanmoore) is.a(scottmitchell,quarterback) is.a(hermanmoore,widereceiver) throw(qb,touchdown,wr) Big, young, talented and inexperienced, Scott Mitchell, the former backup quarterback for the Miami Dolphins, was in prime position to profit Lions wide receiver Herman Moore reflects on the Detroit-Chicago rivalry
23 IE example: learning football Parse input using automated parser Use parse + named entities to build semantic structure Use multiple levels of semantic representation to identify general rules Learn on 33,000 New York Times articles 95% sensible propositions extracted
24 Overview Big data demands intelligent analysis methods are out there already plus new ones all the time Think through the problem you want to solve what data sources do you have? what information would you ask for if you could? what structure do you want to impose? which method(s) yield that structure?
25 Computational methods summary Automated parsing basic step in structuring natural language data won t fail, will buy vs. will fail, won t buy key to extracting specific information Word-sense disambiguation basic step for assessing what s being discussed toilet tank vs. military tank makes sure you re looking at relevant data
26 Computational methods summary Sentiment analysis general emotional assessment automatic ratings, user triage noisy due to irony, sarcasm, etc. Named-entity recognition figuring out the lexicon what do people talk about? building knowledge of things
27 Computational methods summary Topic models document-level semantic classification overall gist of an article good for multimedia linkages Information Extraction specific semantic structures Who s doing what to whom? establishing rules & knowledge
28 Overall summary Computational methods exist to structure large-scale unstructured data Identify what structure you want to get out find the class of methods that develop such structure combine multiple methods if necessary Test extensively! lots of noise in unstructured data
29 Starting-Point References NER: Derczynski & Bontcheva 2014, Passive-Aggressive Sequence Labeling with Discriminative Post-Editing for Recognizing Person Entities in Tweets NER/MM: Berg, Berg, Edwards, & Forsyth 2004, Who s in the Picture? Sentiment: Socher, Bauer, Manning, & Ng 2013, Parsing with Compositional Vector Grammars IE: Hovy, Zhang, Hovy, & Peñas 2011, Unsupervised Discovery of Domain-Specific Knowledge from Text
Machine Learning for Data Science (CS4786) Lecture 1
Machine Learning for Data Science (CS4786) Lecture 1 Tu-Th 10:10 to 11:25 AM Hollister B14 Instructors : Lillian Lee and Karthik Sridharan ROUGH DETAILS ABOUT THE COURSE Diagnostic assignment 0 is out:
More informationIntroduction. A. Bellaachia Page: 1
Introduction 1. Objectives... 3 2. What is Data Mining?... 4 3. Knowledge Discovery Process... 5 4. KD Process Example... 7 5. Typical Data Mining Architecture... 8 6. Database vs. Data Mining... 9 7.
More informationSentiment Analysis. D. Skrepetos 1. University of Waterloo. NLP Presenation, 06/17/2015
Sentiment Analysis D. Skrepetos 1 1 Department of Computer Science University of Waterloo NLP Presenation, 06/17/2015 D. Skrepetos (University of Waterloo) Sentiment Analysis NLP Presenation, 06/17/2015
More informationThe Truth About Sentiment & Natural Language Processing
The Truth About Sentiment & Natural Language Processing By Synthesio Summary Introduction.2 Artificial Intelligence s difficulties with sentiment.3 Human analysis is an obligatory step when analyzing web
More informationAutomatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast
Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast Hassan Sawaf Science Applications International Corporation (SAIC) 7990
More informationText Mining - Scope and Applications
Journal of Computer Science and Applications. ISSN 2231-1270 Volume 5, Number 2 (2013), pp. 51-55 International Research Publication House http://www.irphouse.com Text Mining - Scope and Applications Miss
More informationIdentifying Focus, Techniques and Domain of Scientific Papers
Identifying Focus, Techniques and Domain of Scientific Papers Sonal Gupta Department of Computer Science Stanford University Stanford, CA 94305 sonal@cs.stanford.edu Christopher D. Manning Department of
More informationWeb Mining. Margherita Berardi LACAM. Dipartimento di Informatica Università degli Studi di Bari berardi@di.uniba.it
Web Mining Margherita Berardi LACAM Dipartimento di Informatica Università degli Studi di Bari berardi@di.uniba.it Bari, 24 Aprile 2003 Overview Introduction Knowledge discovery from text (Web Content
More informationApplications of Deep Learning to the GEOINT mission. June 2015
Applications of Deep Learning to the GEOINT mission June 2015 Overview Motivation Deep Learning Recap GEOINT applications: Imagery exploitation OSINT exploitation Geospatial and activity based analytics
More informationIntroduction to Data Mining
Introduction to Data Mining 1 Why Data Mining? Explosive Growth of Data Data collection and data availability Automated data collection tools, Internet, smartphones, Major sources of abundant data Business:
More informationTop Notch Second Edition Level 3 Unit-by-Unit CEF Correlations
Top Notch Second Edition Level 3 Unit-by-Unit Correlations Full Course Correlation with International Standards and Exams LEVEL PTE ALTE UCLES IELTS TOEIC TOEFL (paper) TOEFL ibt Fundamentals A1 - - -
More informationLearning is a very general term denoting the way in which agents:
What is learning? Learning is a very general term denoting the way in which agents: Acquire and organize knowledge (by building, modifying and organizing internal representations of some external reality);
More informationDomain Adaptive Relation Extraction for Big Text Data Analytics. Feiyu Xu
Domain Adaptive Relation Extraction for Big Text Data Analytics Feiyu Xu Outline! Introduction to relation extraction and its applications! Motivation of domain adaptation in big text data analytics! Solutions!
More informationSI485i : NLP. Set 6 Sentiment and Opinions
SI485i : NLP Set 6 Sentiment and Opinions It's about finding out what people think... Can be big business Someone who wants to buy a camera Looks for reviews online Someone who just bought a camera Writes
More informationMARKETING AUTOMATION
MARKETING AUTOMATION Benchmarks for Small & Medium Businesses What marketing automation success will look like in a year ahead and how how small and medium businesses plan to achieve it. Ascend2 Research
More informationIntroduction to Pattern Recognition
Introduction to Pattern Recognition Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr CS 551, Spring 2009 CS 551, Spring 2009 c 2009, Selim Aksoy (Bilkent University)
More informationHow To Understand The Value Of Big Data
Big Data Is Not Yet Another IT Project Krish Krishnan President, Sixth Sense Advisors Inc Bridge to Big Data Oct 23 rd 2012 Background Applications, OLTP Systems, Traditional Data Warehouse and Business
More informationBuilding a Question Classifier for a TREC-Style Question Answering System
Building a Question Classifier for a TREC-Style Question Answering System Richard May & Ari Steinberg Topic: Question Classification We define Question Classification (QC) here to be the task that, given
More informationA Comparative Study on Sentiment Classification and Ranking on Product Reviews
A Comparative Study on Sentiment Classification and Ranking on Product Reviews C.EMELDA Research Scholar, PG and Research Department of Computer Science, Nehru Memorial College, Putthanampatti, Bharathidasan
More informationText Analysis for Big Data. Magnus Sahlgren
Text Analysis for Big Data Magnus Sahlgren Data Size Style (editorial vs social) Language (there are other languages than English out there!) Data Size Style (editorial vs social) Language (there are
More informationInformation Management course
Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 01 : 06/10/2015 Practical informations: Teacher: Alberto Ceselli (alberto.ceselli@unimi.it)
More informationTEXT ANALYTICS INTEGRATION
TEXT ANALYTICS INTEGRATION A TELECOMMUNICATIONS BEST PRACTICES CASE STUDY VISION COMMON ANALYTICAL ENVIRONMENT Structured Unstructured Analytical Mining Text Discovery Text Categorization Text Sentiment
More informationClustering Connectionist and Statistical Language Processing
Clustering Connectionist and Statistical Language Processing Frank Keller keller@coli.uni-sb.de Computerlinguistik Universität des Saarlandes Clustering p.1/21 Overview clustering vs. classification supervised
More informationText Analytics Beginner s Guide. Extracting Meaning from Unstructured Data
Text Analytics Beginner s Guide Extracting Meaning from Unstructured Data Contents Text Analytics 3 Use Cases 7 Terms 9 Trends 14 Scenario 15 Resources 24 2 2013 Angoss Software Corporation. All rights
More informationSemi-Supervised Learning for Blog Classification
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence (2008) Semi-Supervised Learning for Blog Classification Daisuke Ikeda Department of Computational Intelligence and Systems Science,
More informationSearch and Information Retrieval
Search and Information Retrieval Search on the Web 1 is a daily activity for many people throughout the world Search and communication are most popular uses of the computer Applications involving search
More informationSentiment analysis on news articles using Natural Language Processing and Machine Learning Approach.
Sentiment analysis on news articles using Natural Language Processing and Machine Learning Approach. Pranali Chilekar 1, Swati Ubale 2, Pragati Sonkambale 3, Reema Panarkar 4, Gopal Upadhye 5 1 2 3 4 5
More informationAutomatic Knowledge Base Construction Systems. Dr. Daisy Zhe Wang CISE Department University of Florida September 3th 2014
Automatic Knowledge Base Construction Systems Dr. Daisy Zhe Wang CISE Department University of Florida September 3th 2014 1 Text Contains Knowledge 2 Text Contains Automatically Extractable Knowledge 3
More informationUsing Artificial Intelligence to Manage Big Data for Litigation
FEBRUARY 3 5, 2015 / THE HILTON NEW YORK Using Artificial Intelligence to Manage Big Data for Litigation Understanding Artificial Intelligence to Make better decisions Improve the process Allay the fear
More informationData Mining on Social Networks. Dionysios Sotiropoulos Ph.D.
Data Mining on Social Networks Dionysios Sotiropoulos Ph.D. 1 Contents What are Social Media? Mathematical Representation of Social Networks Fundamental Data Mining Concepts Data Mining Tasks on Digital
More informationMARKETING AUTOMATION
MARKETING AUTOMATION Benchmarks for small & medium businesses What marketing automation success will look like in a year ahead and how how small and medium businesses plan to achieve it. Ascend2 Research
More informationNAVIGATING SCIENTIFIC LITERATURE A HOLISTIC PERSPECTIVE. Venu Govindaraju
NAVIGATING SCIENTIFIC LITERATURE A HOLISTIC PERSPECTIVE Venu Govindaraju BIOMETRICS DOCUMENT ANALYSIS PATTERN RECOGNITION 8/24/2015 ICDAR- 2015 2 Towards a Globally Optimal Approach for Learning Deep Unsupervised
More informationCustomer Journey Mapping for B2B Success
Customer Journey Mapping for B2B Success GE Power & Water Rama V. Mahajanam September 15th, 2015 Imagination at work. Why Customer Experience? There is only one boss. The customer. And he can fire everybody
More informationNew Frontiers of Automated Content Analysis in the Social Sciences
Symposium on the New Frontiers of Automated Content Analysis in the Social Sciences University of Zurich July 1-3, 2015 www.aca-zurich-2015.org Abstract Automated Content Analysis (ACA) is one of the key
More informationUsing LSI for Implementing Document Management Systems Turning unstructured data from a liability to an asset.
White Paper Using LSI for Implementing Document Management Systems Turning unstructured data from a liability to an asset. Using LSI for Implementing Document Management Systems By Mike Harrison, Director,
More informationFinancial Trading System using Combination of Textual and Numerical Data
Financial Trading System using Combination of Textual and Numerical Data Shital N. Dange Computer Science Department, Walchand Institute of Rajesh V. Argiddi Assistant Prof. Computer Science Department,
More informationThe Data Mining Process
Sequence for Determining Necessary Data. Wrong: Catalog everything you have, and decide what data is important. Right: Work backward from the solution, define the problem explicitly, and map out the data
More informationMachine Learning and Data Mining. Fundamentals, robotics, recognition
Machine Learning and Data Mining Fundamentals, robotics, recognition Machine Learning, Data Mining, Knowledge Discovery in Data Bases Their mutual relations Data Mining, Knowledge Discovery in Databases,
More informationA Survey on Product Aspect Ranking
A Survey on Product Aspect Ranking Charushila Patil 1, Prof. P. M. Chawan 2, Priyamvada Chauhan 3, Sonali Wankhede 4 M. Tech Student, Department of Computer Engineering and IT, VJTI College, Mumbai, Maharashtra,
More informationONLINE RESUME PARSING SYSTEM USING TEXT ANALYTICS
ONLINE RESUME PARSING SYSTEM USING TEXT ANALYTICS Divyanshu Chandola 1, Aditya Garg 2, Ankit Maurya 3, Amit Kushwaha 4 1 Student, Department of Information Technology, ABES Engineering College, Uttar Pradesh,
More informationIntelligent Search for Answering Clinical Questions Coronado Group, Ltd. Innovation Initiatives
Intelligent Search for Answering Clinical Questions Coronado Group, Ltd. Innovation Initiatives Search The Way You Think Copyright 2009 Coronado, Ltd. All rights reserved. All other product names and logos
More informationPackage syuzhet. February 22, 2015
Type Package Package syuzhet February 22, 2015 Title Extracts Sentiment and Sentiment-Derived Plot Arcs from Text Version 0.2.0 Date 2015-01-20 Maintainer Matthew Jockers Extracts
More informationTIETS34 Seminar: Data Mining on Biometric identification
TIETS34 Seminar: Data Mining on Biometric identification Youming Zhang Computer Science, School of Information Sciences, 33014 University of Tampere, Finland Youming.Zhang@uta.fi Course Description Content
More informationClassification of Virtual Investing-Related Community Postings
Classification of Virtual Investing-Related Community Postings Balaji Rajagopalan * Oakland University rajagopa@oakland.edu Matthew Wimble Oakland University mwwimble@oakland.edu Prabhudev Konana * University
More informationShould HR Care About Big Data? E D COHEN, L E A R NING I NDUSTRY CONSULTA NT
Should HR Care About Big Data? E D COHEN, L E A R NING I NDUSTRY CONSULTA NT Tweeting? Please mention: #BigData (800) 263-6317 or (805) 690-5753 For More Info / To Register / To Access Archive: Search
More informationOutline of today s lecture
Outline of today s lecture Generative grammar Simple context free grammars Probabilistic CFGs Formalism power requirements Parsing Modelling syntactic structure of phrases and sentences. Why is it useful?
More informationThe biggest risk to your company is not being able to change fast enough Business Rules are the answer. Ron Ross
The Business Rules Approach 1 of 7 by David Wright The biggest risk to your company is not being able to change fast enough Business Rules are the answer. Ron Ross I am a great appreciator of Mr. Ross.
More informationPredicting stocks returns correlations based on unstructured data sources
Predicting stocks returns correlations based on unstructured data sources Mateusz Radzimski, José Luis Sánchez-Cervantes, José Luis López Cuadrado, Ángel García-Crespo Departamento de Informática Universidad
More informationMachine Learning using MapReduce
Machine Learning using MapReduce What is Machine Learning Machine learning is a subfield of artificial intelligence concerned with techniques that allow computers to improve their outputs based on previous
More informationDoctoral Consortium 2013 Dept. Lenguajes y Sistemas Informáticos UNED
Doctoral Consortium 2013 Dept. Lenguajes y Sistemas Informáticos UNED 17 19 June 2013 Monday 17 June Salón de Actos, Facultad de Psicología, UNED 15.00-16.30: Invited talk Eneko Agirre (Euskal Herriko
More informationDEMYSTIFYING BIG DATA. What it is, what it isn t, and what it can do for you.
DEMYSTIFYING BIG DATA What it is, what it isn t, and what it can do for you. JAMES LUCK BIO James Luck is a Data Scientist with AT&T Consulting. He has 25+ years of experience in data analytics, in addition
More informationMachine Learning CS 6830. Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science bunescu@ohio.edu
Machine Learning CS 6830 Razvan C. Bunescu School of Electrical Engineering and Computer Science bunescu@ohio.edu What is Learning? Merriam-Webster: learn = to acquire knowledge, understanding, or skill
More informationSpeakout Pre-Intermediate
Speakout Pre-Intermediate Lead in: Review: Classroom language Spelling Parts of Speech Tenses and structures Question words Auxiliary verbs Vocabulary Unit 1 Life Language Question forms Past Simple A2
More informationChallenges of Cloud Scale Natural Language Processing
Challenges of Cloud Scale Natural Language Processing Mark Dredze Johns Hopkins University My Interests? Information Expressed in Human Language Machine Learning Natural Language Processing Intelligent
More informationSentiment analysis on tweets in a financial domain
Sentiment analysis on tweets in a financial domain Jasmina Smailović 1,2, Miha Grčar 1, Martin Žnidaršič 1 1 Dept of Knowledge Technologies, Jožef Stefan Institute, Ljubljana, Slovenia 2 Jožef Stefan International
More informationWhy Semantic Analysis is Better than Sentiment Analysis. A White Paper by T.R. Fitz-Gibbon, Chief Scientist, Networked Insights
Why Semantic Analysis is Better than Sentiment Analysis A White Paper by T.R. Fitz-Gibbon, Chief Scientist, Networked Insights Why semantic analysis is better than sentiment analysis I like it, I don t
More informationCORRALLING THE WILD, WILD WEST OF SOCIAL MEDIA INTELLIGENCE
CORRALLING THE WILD, WILD WEST OF SOCIAL MEDIA INTELLIGENCE Michael Diederich, Microsoft CMG Research & Insights Introduction The rise of social media platforms like Facebook and Twitter has created new
More informationBig Data and Open Data
Big Data and Open Data Bebo White SLAC National Accelerator Laboratory/ Stanford University!! bebo@slac.stanford.edu dekabytes hectobytes Big Data IS a buzzword! The Data Deluge From the beginning of
More informationEmployee Survey Analysis
Employee Survey Analysis Josh Froelich, Megaputer Intelligence Sergei Ananyan, Megaputer Intelligence www.megaputer.com Megaputer Intelligence, Inc. 120 West Seventh Street, Suite 310 Bloomington, IN 47404
More informationBI and the Unstructured Data Challenge
BI and the Unstructured Data Challenge Seth Grimes Alta Plana Corporation 301-270-0795 -- http://altaplana.com Washington DC chapter May 9, 2008 2 Introduction Seth Grimes Principal Consultant with Alta
More informationSENTIMENT ANALYSIS: TEXT PRE-PROCESSING, READER VIEWS AND CROSS DOMAINS EMMA HADDI BRUNEL UNIVERSITY LONDON
BRUNEL UNIVERSITY LONDON COLLEGE OF ENGINEERING, DESIGN AND PHYSICAL SCIENCES DEPARTMENT OF COMPUTER SCIENCE DOCTOR OF PHILOSOPHY DISSERTATION SENTIMENT ANALYSIS: TEXT PRE-PROCESSING, READER VIEWS AND
More informationACEDS Membership Benefits Training, Resources and Networking for the E-Discovery Community
ACEDS Membership Benefits Training, Resources and Networking for the E-Discovery Community! Exclusive News and Analysis! Weekly Web Seminars! Podcasts! On- Demand Training! Networking! Resources! Jobs
More informationData Warehousing and Data Mining
Data Warehousing and Data Mining Winter Semester 2010/2011 Free University of Bozen, Bolzano DW Lecturer: Johann Gamper gamper@inf.unibz.it DM Lecturer: Mouna Kacimi mouna.kacimi@unibz.it http://www.inf.unibz.it/dis/teaching/dwdm/index.html
More informationChapter 6 - Enhancing Business Intelligence Using Information Systems
Chapter 6 - Enhancing Business Intelligence Using Information Systems Managers need high-quality and timely information to support decision making Copyright 2014 Pearson Education, Inc. 1 Chapter 6 Learning
More informationTwitter sentiment vs. Stock price!
Twitter sentiment vs. Stock price! Background! On April 24 th 2013, the Twitter account belonging to Associated Press was hacked. Fake posts about the Whitehouse being bombed and the President being injured
More informationANALYTICS IN BIG DATA ERA
ANALYTICS IN BIG DATA ERA ANALYTICS TECHNOLOGY AND ARCHITECTURE TO MANAGE VELOCITY AND VARIETY, DISCOVER RELATIONSHIPS AND CLASSIFY HUGE AMOUNT OF DATA MAURIZIO SALUSTI SAS Copyr i g ht 2012, SAS Ins titut
More informationTechnology & Applications. Three Technology Must-Haves to Improve Sales Effectiveness and Boost Win Rates
Technology & Applications Three Technology Must-Haves to Improve Sales Effectiveness and Boost Win Rates Executive Summary To drive sales excellence, sales professionals need to monitor their objectives,
More informationThe multilayer sentiment analysis model based on Random forest Wei Liu1, Jie Zhang2
2nd International Conference on Advances in Mechanical Engineering and Industrial Informatics (AMEII 2016) The multilayer sentiment analysis model based on Random forest Wei Liu1, Jie Zhang2 1 School of
More informationData Mining Yelp Data - Predicting rating stars from review text
Data Mining Yelp Data - Predicting rating stars from review text Rakesh Chada Stony Brook University rchada@cs.stonybrook.edu Chetan Naik Stony Brook University cnaik@cs.stonybrook.edu ABSTRACT The majority
More informationSENTIMENT ANALYSIS: A STUDY ON PRODUCT FEATURES
University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln Dissertations and Theses from the College of Business Administration Business Administration, College of 4-1-2012 SENTIMENT
More informationSurvey Results: Requirements and Use Cases for Linguistic Linked Data
Survey Results: Requirements and Use Cases for Linguistic Linked Data 1 Introduction This survey was conducted by the FP7 Project LIDER (http://www.lider-project.eu/) as input into the W3C Community Group
More informationSearch Engine Optimization:
Search Engine Optimization: Sure Fire SEO Strategies That Will Get You Ahead PEPPERGANG Digital Media With Spice Table of Contents Executive Summary......3 Earn Links Naturally............4. Badge Program................4
More informationText Analytics with Ambiverse. Text to Knowledge. www.ambiverse.com
Text Analytics with Ambiverse Text to Knowledge www.ambiverse.com Version 1.0, February 2016 WWW.AMBIVERSE.COM Contents 1 Ambiverse: Text to Knowledge............................... 5 1.1 Text is all Around
More informationMachine Learning: Overview
Machine Learning: Overview Why Learning? Learning is a core of property of being intelligent. Hence Machine learning is a core subarea of Artificial Intelligence. There is a need for programs to behave
More informationEffective Self-Training for Parsing
Effective Self-Training for Parsing David McClosky dmcc@cs.brown.edu Brown Laboratory for Linguistic Information Processing (BLLIP) Joint work with Eugene Charniak and Mark Johnson David McClosky - dmcc@cs.brown.edu
More informationData are everywhere. IBM projects that every day we generate 2.5 quintillion bytes of data. In relative terms, this means 90
FREE echapter C H A P T E R1 Big Data and Analytics Data are everywhere. IBM projects that every day we generate 2.5 quintillion bytes of data. In relative terms, this means 90 percent of the data in the
More informationWhy language is hard. And what Linguistics has to say about it. Natalia Silveira Participation code: eagles
Why language is hard And what Linguistics has to say about it Natalia Silveira Participation code: eagles Christopher Natalia Silveira Manning Language processing is so easy for humans that it is like
More informationReal World Application and Usage of IBM Advanced Analytics Technology
Real World Application and Usage of IBM Advanced Analytics Technology Anthony J. Young Pre-Sales Architect for IBM Advanced Analytics February 21, 2014 Welcome Anthony J. Young Lives in Austin, TX Focused
More informationSome Research Challenges for Big Data Analytics of Intelligent Security
Some Research Challenges for Big Data Analytics of Intelligent Security Yuh-Jong Hu hu at cs.nccu.edu.tw Emerging Network Technology (ENT) Lab. Department of Computer Science National Chengchi University,
More information2014/02/13 Sphinx Lunch
2014/02/13 Sphinx Lunch Best Student Paper Award @ 2013 IEEE Workshop on Automatic Speech Recognition and Understanding Dec. 9-12, 2013 Unsupervised Induction and Filling of Semantic Slot for Spoken Dialogue
More informationMovie Classification Using k-means and Hierarchical Clustering
Movie Classification Using k-means and Hierarchical Clustering An analysis of clustering algorithms on movie scripts Dharak Shah DA-IICT, Gandhinagar Gujarat, India dharak_shah@daiict.ac.in Saheb Motiani
More informationOnce you have clearly defined your ideal client, use these practical applications for your business web presence:
Step #1 Define Your Ideal Client Step #1 Define Your Ideal Client In today s online environment, having just a web site doesn t usually cut it. As a business owner, your ultimate goal should be to build
More informationPredictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD
Predictive Analytics Techniques: What to Use For Your Big Data March 26, 2014 Fern Halper, PhD Presenter Proven Performance Since 1995 TDWI helps business and IT professionals gain insight about data warehousing,
More informationPrediction of Stock Market Shift using Sentiment Analysis of Twitter Feeds, Clustering and Ranking
382 Prediction of Stock Market Shift using Sentiment Analysis of Twitter Feeds, Clustering and Ranking 1 Tejas Sathe, 2 Siddhartha Gupta, 3 Shreya Nair, 4 Sukhada Bhingarkar 1,2,3,4 Dept. of Computer Engineering
More informationMachine Learning Log File Analysis
Machine Learning Log File Analysis Research Proposal Kieran Matherson ID: 1154908 Supervisor: Richard Nelson 13 March, 2015 Abstract The need for analysis of systems log files is increasing as systems
More informationFine-grained German Sentiment Analysis on Social Media
Fine-grained German Sentiment Analysis on Social Media Saeedeh Momtazi Information Systems Hasso-Plattner-Institut Potsdam University, Germany Saeedeh.momtazi@hpi.uni-potsdam.de Abstract Expressing opinions
More informationThe Evolution, Uses, and Case Studies of Technology Assisted Review
FEBRUARY 4 6, 2014 / THE HILTON NEW YORK The Evolution, Uses, and Case Studies of Technology Assisted Review One Size Does Not Fit All #LTNY Meet Our Panelists The Honorable Dave Waxse U.S. Magistrate
More informationEHR CURATION FOR MEDICAL MINING
EHR CURATION FOR MEDICAL MINING Ernestina Menasalvas Medical Mining Tutorial@KDD 2015 Sydney, AUSTRALIA 2 Ernestina Menasalvas "EHR Curation for Medical Mining" 08/2015 Agenda Motivation the potential
More informationPUSH INTELLIGENCE. Bridging the Last Mile to Business Intelligence & Big Data. 2013 Copyright Metric Insights, Inc.
PUSH INTELLIGENCE Bridging the Last Mile to Business Intelligence & Big Data 2013 Copyright Metric Insights, Inc. INTRODUCTION... 3 CHALLENGES WITH BI... 4 The Dashboard Dilemma... 4 Architectural Limitations
More informationRobust Sentiment Detection on Twitter from Biased and Noisy Data
Robust Sentiment Detection on Twitter from Biased and Noisy Data Luciano Barbosa AT&T Labs - Research lbarbosa@research.att.com Junlan Feng AT&T Labs - Research junlan@research.att.com Abstract In this
More informationData Mining Part 5. Prediction
Data Mining Part 5. Prediction 5.1 Spring 2010 Instructor: Dr. Masoud Yaghini Outline Classification vs. Numeric Prediction Prediction Process Data Preparation Comparing Prediction Methods References Classification
More information01219211 Software Development Training Camp 1 (0-3) Prerequisite : 01204214 Program development skill enhancement camp, at least 48 person-hours.
(International Program) 01219141 Object-Oriented Modeling and Programming 3 (3-0) Object concepts, object-oriented design and analysis, object-oriented analysis relating to developing conceptual models
More informationReal-Time Analytics: Integrating Social Media Insights with Traditional Data
SAP Brief SAP Rapid Deployment s SAP HANA Sentiment Intelligence Rapid-Deployment Objectives Real-Time Analytics: Integrating Social Media Insights with Traditional Data Capturing customer sentiment from
More informationVeracity of data. New approaches are emerging to account for uncertainty in data at a giant scale. 2013 IBM Corporation
Veracity of data 1. The degree to which data is accurate, reliable, certain 2. An emerging platform for organizing, understanding and deriving value from big data Introduction Financial decisions require
More informationParticular Requirements on Opinion Mining for the Insurance Business
Particular Requirements on Opinion Mining for the Insurance Business Sven Rill, Johannes Drescher, Dirk Reinel, Jörg Scheidt, Florian Wogenstein Institute of Information Systems (iisys) University of Applied
More informationSpatio-Temporal Patterns of Passengers Interests at London Tube Stations
Spatio-Temporal Patterns of Passengers Interests at London Tube Stations Juntao Lai *1, Tao Cheng 1, Guy Lansley 2 1 SpaceTimeLab for Big Data Analytics, Department of Civil, Environmental &Geomatic Engineering,
More informationWhy are Organizations Interested?
SAS Text Analytics Mary-Elizabeth ( M-E ) Eddlestone SAS Customer Loyalty M-E.Eddlestone@sas.com +1 (607) 256-7929 Why are Organizations Interested? Text Analytics 2009: User Perspectives on Solutions
More informationSPECIFICATION BY EXAMPLE. Gojko Adzic. How successful teams deliver the right software. MANNING Shelter Island
SPECIFICATION BY EXAMPLE How successful teams deliver the right software Gojko Adzic MANNING Shelter Island Brief Contents 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 Preface xiii Acknowledgments xxii
More informationCombining Social Data and Semantic Content Analysis for L Aquila Social Urban Network
I-CiTies 2015 2015 CINI Annual Workshop on ICT for Smart Cities and Communities Palermo (Italy) - October 29-30, 2015 Combining Social Data and Semantic Content Analysis for L Aquila Social Urban Network
More informationData Mining and Knowledge Discovery in Databases (KDD) State of the Art. Prof. Dr. T. Nouri Computer Science Department FHNW Switzerland
Data Mining and Knowledge Discovery in Databases (KDD) State of the Art Prof. Dr. T. Nouri Computer Science Department FHNW Switzerland 1 Conference overview 1. Overview of KDD and data mining 2. Data
More information