Introduction to Text Mining and Semantics. Seth Grimes -- President, Alta Plana
|
|
|
- Gerard Pope
- 10 years ago
- Views:
Transcription
1 Introduction to Text Mining and Semantics Seth Grimes -- President, Alta Plana
2 New York Times October 9, 1958
3 Text expresses a vast, rich range of information, but encodes this information in a form that is difficult to decipher automatically. -- Marti A. Hearst, Untangling Text Data Mining, 1999
4 Document input and processing Information extraction Knowledge management Hans Peter Luhn, A Business Intelligence System, IBM Journal, October 1958
5 Statistical information derived from word frequency and distribution is used by the machine to compute a relative measure of significance. Hans Peter Luhn, The Automatic Creation of Literature Abstracts, IBM Journal, April 1958
6 This rather unsophisticated argument on significance avoids such linguistic implications as grammar and syntax... No attention is paid to the logical and semantic relationships the author has established. -- Hans Peter Luhn, 1958
7 Miranda: O wonder! How many goodly creatures are there here! How beauteous mankind is! O brave new world, That has such people in't! Prospero: 'Tis new to thee. The shipwreck in The Tempest, Act I, Scene 1, in an 1797 engraving based on a painting by George Romney.
8 New York Times, September 8, 1957 Anaphora / coreference: They
9 Kind = type, variety, not a sentiment. External reference Unfiltered duplicates
10 The broadcast, media, and entertainment industries garner about 4% of the world s revenues but already generate, manage, or otherwise oversee 50% of the digital universe. Approximately 70% of the digital universe is created by individuals. The Diverse and Exploding Digital Universe, (IDC, 2008)
11 Web sites, news & journal articles, images, video. Blogs, forum postings, and social media. , Contact-center notes and transcripts; recorded conversation. Surveys, feedback forms, warranty & insurance claims. Office documents, regulatory filings, reports, scientific papers. And every other sort of document imaginable. Is Search up to the job?
12 How are the quality, value & authority of search results? Hotel s opinion Guest s opinion about Priceline Who profits from search?
13 How can we do better? We have many of the tools in place -- from Web 2.0 technologies The Diverse and Exploding Digital Universe, (IDC, 2008)
14 Web 2.0 is the business revolution in the computer industry caused by the move to the Internet as a platform. -- Tim O Reilly, 2004 [A] move from personal websites to blogs and blog site aggregation, from publishing to participation, an ongoing and interactive process... to links based on tagging. -- Terry Flew, New Media: An Introduction, 2008 Web 2.0 is dynamic, personalized, interactive, collaborative.
15 How can we do better? We have many of the tools in place -- from Web 2.0 technologies to unstructured data search software and the Semantic Web -- to tame the digital universe. Done right, we can turn information growth into economic growth. The Diverse and Exploding Digital Universe, (IDC, 2008)
16 Text mining enables smarter search that better responds to user goals, e.g., answers
17 For even better findability: The Semantic Web is a web of data, in some ways like a global database. -- Tim Berners-Lee, 1998 Web 3.0 is Web the Semantic Web + semantic tools. Recurring themes: Semantically enriched content. Linked Data. Context sensitive. Location aware.
18
19 Text mining / analytics enables Web 3.0 and the Semantic Web. Automated content categorization and classification. Text augmentation: metadata generation, content tagging. Information extraction to databases. Exploratory analysis and visualization. Technical concepts: Linked Data Microformats, RDF, SPARQL OWL
20 I recently published a study report, Text Analytics 2009: User Perspectives on Solutions and Providers. I estimated a $350 million global market in 2008, up 40% from I relayed findings from a survey that asked
21 What are your primary applications where text comes into play? Brand / product / reputation management Competitive intelligence Voice of the Customer / Customer Experience Research (not listed) Customer service Content management or publishing Life sciences or clinical medicine Insurance, risk management, or fraud Financial services E-discovery Product/service design, quality assurance, Other Compliance Law enforcement 8% 7% 22% 19% 18% 17% 15% 15% 14% 13% 40% 37% 33% 33% 0% 10% 20% 30% 40% 50%
22 What textual information are you analyzing or do you plan to analyze? Current users responded: blogs and other social media (twitter, social-network sites, etc.) 62% news articles 55% on-line forums 41% and correspondence 38% customer/market surveys 35%
23 Do you need (or expect to need) to extract or analyze: Other 15% Other entities phone numbers, & street addresses 40% Metadata such as document author, publication date, title, headers, etc. Events, relationships, and/or facts Concepts, that is, abstract groups of entities Sentiment, opinions, attitudes, emotions Topics and themes Named entities people, companies, geographic locations, brands, ticker 53% 55% 58% 60% 65% 71% 0% 10% 20% 30% 40% 50% 60% 70% 80%
24 Please rate your overall experience your satisfaction with text mining.
Text Analytics Beginner s Guide. Extracting Meaning from Unstructured Data
Text Analytics Beginner s Guide Extracting Meaning from Unstructured Data Contents Text Analytics 3 Use Cases 7 Terms 9 Trends 14 Scenario 15 Resources 24 2 2013 Angoss Software Corporation. All rights
Voice. listen, understand and respond. enherent. wish, choice, or opinion. openly or formally expressed. May 2010. - Merriam Webster. www.enherent.
Voice wish, choice, or opinion openly or formally expressed - Merriam Webster listen, understand and respond May 2010 2010 Corp. All rights reserved. www..com Overwhelming Dialog Consumers are leading
Text Mining - Scope and Applications
Journal of Computer Science and Applications. ISSN 2231-1270 Volume 5, Number 2 (2013), pp. 51-55 International Research Publication House http://www.irphouse.com Text Mining - Scope and Applications Miss
Best practices for evaluating and selecting content analytics tools
Best practices for evaluating and selecting content analytics tools Sponsored by IBM Speaker: Seth Grimes, a Business Intelligence and Decision Systems expert Moderated by Jonathan Gourlay, Site and News
Auto-Classification for Document Archiving and Records Declaration
Auto-Classification for Document Archiving and Records Declaration Josemina Magdalen, Architect, IBM November 15, 2013 Agenda IBM / ECM/ Content Classification for Document Archiving and Records Management
TEXT ANALYTICS INTEGRATION
TEXT ANALYTICS INTEGRATION A TELECOMMUNICATIONS BEST PRACTICES CASE STUDY VISION COMMON ANALYTICAL ENVIRONMENT Structured Unstructured Analytical Mining Text Discovery Text Categorization Text Sentiment
IBM Content Analytics with Enterprise Search, Version 3.0
IBM Content Analytics with Enterprise Search, Version 3.0 Highlights Enables greater accuracy and control over information with sophisticated natural language processing capabilities to deliver the right
WHITEPAPER. Text Analytics Beginner s Guide
WHITEPAPER Text Analytics Beginner s Guide What is Text Analytics? Text Analytics describes a set of linguistic, statistical, and machine learning techniques that model and structure the information content
Why are Organizations Interested?
SAS Text Analytics Mary-Elizabeth ( M-E ) Eddlestone SAS Customer Loyalty [email protected] +1 (607) 256-7929 Why are Organizations Interested? Text Analytics 2009: User Perspectives on Solutions
Industry Impact of Big Data in the Cloud: An IBM Perspective
Industry Impact of Big Data in the Cloud: An IBM Perspective Inhi Cho Suh IBM Software Group, Information Management Vice President, Product Management and Strategy email: [email protected] twitter: @inhicho
How To Use Social Media To Improve Your Business
IBM Software Business Analytics Social Analytics Social Business Analytics Gaining business value from social media 2 Social Business Analytics Contents 2 Overview 3 Analytics as a competitive advantage
What do Big Data & HAVEn mean? Robert Lejnert HP Autonomy
What do Big Data & HAVEn mean? Robert Lejnert HP Autonomy Much higher Volumes. Processed with more Velocity. With much more Variety. Is Big Data so big? Big Data Smart Data Project HAVEn: Adaptive Intelligence
GRAPHICAL USER INTERFACE, ACCESS, SEARCH AND REPORTING
MEDIA MONITORING AND ANALYSIS GRAPHICAL USER INTERFACE, ACCESS, SEARCH AND REPORTING Searchers Reporting Delivery (Player Selection) DATA PROCESSING AND CONTENT REPOSITORY ADMINISTRATION AND MANAGEMENT
Beyond listening Driving better decisions with business intelligence from social sources
Beyond listening Driving better decisions with business intelligence from social sources From insight to action with IBM Social Media Analytics State of the Union Opinions prevail on the Internet Social
MLg. Big Data and Its Implication to Research Methodologies and Funding. Cornelia Caragea TARDIS 2014. November 7, 2014. Machine Learning Group
Big Data and Its Implication to Research Methodologies and Funding Cornelia Caragea TARDIS 2014 November 7, 2014 UNT Computer Science and Engineering Data Everywhere Lots of data is being collected and
IBM Social Media Analytics
IBM Analyze social media data to improve business outcomes Highlights Grow your business by understanding consumer sentiment and optimizing marketing campaigns. Make better decisions and strategies across
The Future of Business Analytics is Now! 2013 IBM Corporation
The Future of Business Analytics is Now! 1 The pressures on organizations are at a point where analytics has evolved from a business initiative to a BUSINESS IMPERATIVE More organization are using analytics
Voice of the Customer: How to Move Beyond Listening to Action Merging Text Analytics with Data Mining and Predictive Analytics
WHITEPAPER Voice of the Customer: How to Move Beyond Listening to Action Merging Text Analytics with Data Mining and Predictive Analytics Successful companies today both listen and understand what customers
Engage your customers
Business white paper Engage your customers HP Autonomy s Customer Experience Management market offering Table of contents 3 Introduction 3 The customer experience includes every interaction 3 Leveraging
How To Make Sense Of Data With Altilia
HOW TO MAKE SENSE OF BIG DATA TO BETTER DRIVE BUSINESS PROCESSES, IMPROVE DECISION-MAKING, AND SUCCESSFULLY COMPETE IN TODAY S MARKETS. ALTILIA turns Big Data into Smart Data and enables businesses to
Delivering Smart Answers!
Companion for SharePoint Topic Analyst Companion for SharePoint All Your Information Enterprise-ready Enrich SharePoint, your central place for document and workflow management, not only with an improved
Hexaware E-book on Predictive Analytics
Hexaware E-book on Predictive Analytics Business Intelligence & Analytics Actionable Intelligence Enabled Published on : Feb 7, 2012 Hexaware E-book on Predictive Analytics What is Data mining? Data mining,
Text Analytics for Competitive Analysis and Market Intelligence Aiaioo Labs - 2011
Text Analytics for Competitive Analysis and Market Intelligence Aiaioo Labs - 2011 Bangalore, India Title Text Analytics Introduction Entity Person Comparative Analysis Entity or Event Text Analytics Text
WHITE PAPER. Social media analytics in the insurance industry
WHITE PAPER Social media analytics in the insurance industry Introduction Insurance is a high involvement product, as it is an expense. Consumers obtain information about insurance from advertisements,
Big Data and Semantic Web in Manufacturing. Nitesh Khilwani, PhD Chief Engineer, Samsung Research Institute Noida, India
Big Data and Semantic Web in Manufacturing Nitesh Khilwani, PhD Chief Engineer, Samsung Research Institute Noida, India Outline Big data in Manufacturing Big data Analytics Semantic web technologies Case
IBM Customer Experience Suite and Predictive Analytics
IBM Customer Experience Suite and Predictive Analytics Introduction to the IBM Customer Experience Suite In order to help customers meet their exceptional web experience goals in the most efficient and
Customer Experience Management
Customer Experience Management 10 tips for the successful development and execution of Chris Bland Research Director SPA Future Thinking Introduction, sometimes referred to as Customer Feedback Programmes,
STAR WARS AND THE ART OF DATA SCIENCE
STAR WARS AND THE ART OF DATA SCIENCE MELODIE RUSH, SENIOR ANALYTICAL ENGINEER CUSTOMER LOYALTY Original Presentation Created And Presented By Mary Osborne, Business Visualization Manager At 2014 SAS Global
Multichannel analytics and discovery
Brochure Multichannel analytics and discovery Gain greater insight with Multichannel Discovery from HP Autonomy A better way to understand multichannel activity Autonomy ExploreCloud highlights Social
THE STATE OF Social Media Analytics. How Leading Marketers Are Using Social Media Analytics
THE STATE OF Social Media Analytics May 2016 Getting to Know You: How Leading Marketers Are Using Social Media Analytics» Marketers are expanding their use of advanced social media analytics and combining
IBM Social Media Analytics
IBM Social Media Analytics Analyze social media data to better understand your customers and markets Highlights Understand consumer sentiment and optimize marketing campaigns. Improve the customer experience
Overview, Goals, & Introductions
Improving the Retail Experience with Predictive Analytics www.spss.com/perspectives Overview, Goals, & Introductions Goal: To present the Retail Business Maturity Model Equip you with a plan of attack
Government Management Committee. P:\2013\Internal Services\I&T\gm13005I&T (AFS # 17768)
STAFF REPORT ACTION REQUIRED Social Sentiment Analysis Date: May 31, 2013 To: From: Wards: Reference Number: Government Management Committee Director, 311 Toronto Chief Information Officer All P:\2013\Internal
CAPTURING THE VALUE OF UNSTRUCTURED DATA: INTRODUCTION TO TEXT MINING
CAPTURING THE VALUE OF UNSTRUCTURED DATA: INTRODUCTION TO TEXT MINING Mary-Elizabeth ( M-E ) Eddlestone Principal Systems Engineer, Analytics SAS Customer Loyalty, SAS Institute, Inc. Is there valuable
North Highland Data and Analytics. Data Governance Considerations for Big Data Analytics
North Highland and Analytics Governance Considerations for Big Analytics Agenda Traditional BI/Analytics vs. Big Analytics Types of Requiring Governance Key Considerations Information Framework Organizational
Research Article 2015. International Journal of Emerging Research in Management &Technology ISSN: 2278-9359 (Volume-4, Issue-4) Abstract-
International Journal of Emerging Research in Management &Technology Research Article April 2015 Enterprising Social Network Using Google Analytics- A Review Nethravathi B S, H Venugopal, M Siddappa Dept.
Beyond Watson: The Business Implications of Big Data
Beyond Watson: The Business Implications of Big Data Shankar Venkataraman IBM Program Director, STSM, Big Data August 10, 2011 The World is Changing and Becoming More INSTRUMENTED INTERCONNECTED INTELLIGENT
Semantic Data Management. Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies
Semantic Data Management Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies 1 Enterprise Information Challenge Source: Oracle customer 2 Vision of Semantically Linked Data The Network of Collaborative
Optimizing government and insurance claims management with IBM Case Manager
Enterprise Content Management Optimizing government and insurance claims management with IBM Case Manager Apply advanced case management capabilities from IBM to help ensure successful outcomes Highlights
IBM G-Cloud - IBM Social Media Analytics Software as a Service
IBM G-Cloud - IBM Social Media Analytics Software as a Service Service Definition 1 1. Summary 1.1 Service Description IBM Social Media Analytics Software as a Service is a powerful Cloud-based tool for
Data Refinery with Big Data Aspects
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 7 (2013), pp. 655-662 International Research Publications House http://www. irphouse.com /ijict.htm Data
Big Data. Fast Forward. Putting data to productive use
Big Data Putting data to productive use Fast Forward What is big data, and why should you care? Get familiar with big data terminology, technologies, and techniques. Getting started with big data to realize
Provalis Research Text Analytics and the Victory Index
point Provalis Research Text Analytics and the Victory Index Fern Halper, Ph.D. Fellow Daniel Kirsch Senior Analyst Provalis Research Text Analytics and the Victory Index Unstructured data is everywhere
ORACLE SOCIAL ENGAGEMENT AND MONITORING CLOUD SERVICE
ORACLE SOCIAL ENGAGEMENT AND MONITORING CLOUD SERVICE KEY FEATURES Global social media, web, and news feed data Market-leading listening quality Automatic categorization Configurable dashboards, drill-down
MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts
MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts Julio Villena-Román 1,3, Sara Lana-Serrano 2,3 1 Universidad Carlos III de Madrid 2 Universidad Politécnica de Madrid 3 DAEDALUS
Search and Information Retrieval
Search and Information Retrieval Search on the Web 1 is a daily activity for many people throughout the world Search and communication are most popular uses of the computer Applications involving search
Solve Your Toughest Challenges with Data Mining
IBM Software Business Analytics IBM SPSS Modeler Solve Your Toughest Challenges with Data Mining Use predictive intelligence to make good decisions faster Solve Your Toughest Challenges with Data Mining
Discover How a 360-Degree View of the Customer Boosts Productivity and Profits. eguide
Discover How a 360-Degree View of the Customer Boosts Productivity and Profits eguide eguide Discover How a 360-Degree View of the Customer Boosts Productivity and Profits A guide on the benefits of using
The Business Value of Predictive Analytics
The Business Value of Predictive Analytics Alys Woodward Program Manager, European Business Analytics, Collaboration and Social Solutions, IDC London, UK 15 November 2011 Copyright IDC. Reproduction is
Capture intelligence that matters
Brochure Capture intelligence that matters HP Information Analytics Maximizing your ROI (return on information) Effective business leaders realize that their success hinges on how they address the four
A Capability Model for Business Analytics: Part 2 Assessing Analytic Capabilities
A Capability Model for Business Analytics: Part 2 Assessing Analytic Capabilities The first article of this series presented the capability model for business analytics that is illustrated in Figure One.
Text Mining and Analysis
Text Mining and Analysis Practical Methods, Examples, and Case Studies Using SAS Goutam Chakraborty, Murali Pagolu, Satish Garla From Text Mining and Analysis. Full book available for purchase here. Contents
Manjula Ambur NASA Langley Research Center April 2014
Manjula Ambur NASA Langley Research Center April 2014 Outline What is Big Data Vision and Roadmap Key Capabilities Impetus for Watson Technologies Content Analytics Use Potential use cases What is Big
Turning Big Data into a Big Opportunity
Customer-Centricity in a World of Data: Turning Big Data into a Big Opportunity Richard Maraschi Business Analytics Solutions Leader IBM Global Media & Entertainment Joe Wikert General Manager & Publisher
Solve your toughest challenges with data mining
IBM Software IBM SPSS Modeler Solve your toughest challenges with data mining Use predictive intelligence to make good decisions faster Solve your toughest challenges with data mining Imagine if you could
Using Data Analytics to Detect Fraud. Other Data Analysis Techniques
Using Data Analytics to Detect Fraud Other Data Analysis Techniques Qualitative Data Analysis Most data analysis techniques require the use of data in the form of numbers. Qualitative data analysis is
Loyalty. Social. Listening
Loyalty Social Listening Listen Understand Engage We integrate Social Listening data with existing research and other data to help our clients drive brand preference and customer loyalty Loyalty Social
Advanced Analytics. The Way Forward for Businesses. Dr. Sujatha R Upadhyaya
Advanced Analytics The Way Forward for Businesses Dr. Sujatha R Upadhyaya Nov 2009 Advanced Analytics Adding Value to Every Business In this tough and competitive market, businesses are fighting to gain
HOW TO DO A SMART DATA PROJECT
April 2014 Smart Data Strategies HOW TO DO A SMART DATA PROJECT Guideline www.altiliagroup.com Summary ALTILIA s approach to Smart Data PROJECTS 3 1. BUSINESS USE CASE DEFINITION 4 2. PROJECT PLANNING
Are You Ready for Big Data?
Are You Ready for Big Data? Jim Gallo National Director, Business Analytics April 10, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?
IT services for analyses of various data samples
IT services for analyses of various data samples Ján Paralič, František Babič, Martin Sarnovský, Peter Butka, Cecília Havrilová, Miroslava Muchová, Michal Puheim, Martin Mikula, Gabriel Tutoky Technical
Taking A Proactive Approach To Loyalty & Retention
THE STATE OF Customer Analytics Taking A Proactive Approach To Loyalty & Retention By Kerry Doyle An Exclusive Research Report UBM TechWeb research conducted an online study of 339 marketing professionals
De la Business Intelligence aux Big Data. Marie- Aude AUFAURE Head of the Business Intelligence team Ecole Centrale Paris. 22/01/14 Séminaire Big Data
De la Business Intelligence aux Big Data Marie- Aude AUFAURE Head of the Business Intelligence team Ecole Centrale Paris 22/01/14 Séminaire Big Data 1 Agenda EvoluHon of Business Intelligence SemanHc Technologies
Cleaned Data. Recommendations
Call Center Data Analysis Megaputer Case Study in Text Mining Merete Hvalshagen www.megaputer.com Megaputer Intelligence, Inc. 120 West Seventh Street, Suite 10 Bloomington, IN 47404, USA +1 812-0-0110
Evaluating Multinational Communications Using Social Media Monitoring
Evaluating Multinational Communications Using Social Media Monitoring Marshall Sponder Webmetricsguru.com Socialmediaanalyticsbook.com On the Top Conference Davos February 17 th, 2011 International Social
Using a Multichannel Strategy to Deliver an Exceptional Customer Experience
Using a Multichannel Strategy to Deliver an Exceptional Customer Experience 10 things to consider when building a multichannel strategy to improve the customer experience Jesús Hoyos CRM industry analyst,
Survey Results: Requirements and Use Cases for Linguistic Linked Data
Survey Results: Requirements and Use Cases for Linguistic Linked Data 1 Introduction This survey was conducted by the FP7 Project LIDER (http://www.lider-project.eu/) as input into the W3C Community Group
Deploying Big Data to the Cloud: Roadmap for Success
Deploying Big Data to the Cloud: Roadmap for Success James Kobielus Chair, CSCC Big Data in the Cloud Working Group IBM Big Data Evangelist. IBM Data Magazine, Editor-in- Chief. IBM Senior Program Director,
Sentiment Analysis on Big Data
SPAN White Paper!? Sentiment Analysis on Big Data Machine Learning Approach Several sources on the web provide deep insight about people s opinions on the products and services of various companies. Social
KPMG Unlocks Hidden Value in Client Information with Smartlogic Semaphore
CASE STUDY KPMG Unlocks Hidden Value in Client Information with Smartlogic Semaphore Sponsored by: IDC David Schubmehl July 2014 IDC OPINION Dan Vesset Big data in all its forms and associated technologies,
Solve your toughest challenges with data mining
IBM Software Business Analytics IBM SPSS Modeler Solve your toughest challenges with data mining Use predictive intelligence to make good decisions faster 2 Solve your toughest challenges with data mining
Achieving customer loyalty with customer analytics
IBM Software Business Analytics Customer Analytics Achieving customer loyalty with customer analytics 2 Achieving customer loyalty with customer analytics Contents 2 Overview 3 Using satisfaction to drive
Social Media Implementations
SEM Experience Analytics Social Media Implementations SEM Experience Analytics delivers real sentiment, meaning and trends within social media for many of the world s leading consumer brand companies.
Software Engineering for Big Data. CS846 Paulo Alencar David R. Cheriton School of Computer Science University of Waterloo
Software Engineering for Big Data CS846 Paulo Alencar David R. Cheriton School of Computer Science University of Waterloo Big Data Big data technologies describe a new generation of technologies that aim
WHITE PAPER. CRM Evolved. Introducing the Era of Intelligent Engagement
WHITE PAPER CRM Evolved Introducing the Era of Intelligent Engagement November 2015 CRM Evolved Introduction Digital Transformation, a key focus of successful organizations, proves itself a business imperative,
Reinventing Business Intelligence through Big Data
Reinventing Business Intelligence through Big Data Dr. Flavio Villanustre VP, Technology and lead of the Open Source HPCC Systems initiative LexisNexis Risk Solutions Reed Elsevier LEXISNEXIS From RISK
Chapter 6. Foundations of Business Intelligence: Databases and Information Management
Chapter 6 Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:
NICE MULTI-CHANNEL INTERACTION ANALYTICS
NICE MULTI-CHANNEL INTERACTION ANALYTICS Revealing Customer Intent in Contact Center Communications CUSTOMER INTERACTIONS: The LIVE Voice of the Customer Every day, customer service departments handle
