Sentiment Analysis and Topic Classification: Case study over Spanish tweets
|
|
|
- Sandra Neal
- 10 years ago
- Views:
Transcription
1 Sentiment Analysis and Topic Classification: Case study over Spanish tweets Fernando Batista, Ricardo Ribeiro Laboratório de Sistemas de Língua Falada, INESC- ID Lisboa R. Alves Redol, 9, Lisboa, Portugal ISCTE - - Instituto Universitário de Lisboa Av. Forças Armadas, Lisboa, Portugal {Fernando.Batista, Ricardo.Ribeiro}@inesc-id.pt Introduction By providing revolutionary means for people to communicate and interact, Social Networks take now part in the life of a large of millions of people all over the world. Each social network targets different audiences, offering a range of unique services that people find useful in the course of their lives. In what concerns Twitter, it provides a simple way of writing small messages, which people can use to express everything. Twitter can be accessed in numerous ways, ranging from computers to mobile phones or other mobile devices. That is particularly important because accessing and producing content becomes a trivial task, therefore assuming an important part of people's lives. One relevant aspect that differentiates Twitter from other communication means is its ability to rapidly propagate such content and make it available to specific communities, selected based on their interests. Twitter data is a powerful source of information for assessing and predicting large- scale facts. For example, [11] capture large- scale trends on consumer confidence and political opinion in tweets, strengthening the potential of such data as a supplement for traditional polling. In what concerns stock markets, [5] found that Twitter data can be used to significantly improve stock market predictions accuracy. The huge amount of data, constantly being produced, makes it impracticable to manually process such content. For that reason, it becomes urgent to apply automatic processing strategies that can handle, and take advantage, of such amount of data. However, processing Twitter is all but an easy task, not only because of specific phenomena that can be found in the data, but also because it may require to process a continuous stream of data, and possibly to store some of the data in a way that it can be accessed in the future. This paper addresses two well- known Natural Language Processing (NLP) tasks, sentiment analysis and topic classification, which have been performed over Spanish Twitter data, in the context of the "TASS workshop on Sentiment Analysis" [18], a satellite event of the SEPLN 2012 conference. The remainder of this paper presents an overview of the challenge proposed within the workshop, reports on our experience and presents some conclusions about the achieved results.
2 Related work Text Classification consists of assigning a class from a set of possible predefined classes to a given text. Sentiment Analysis and Topic Detection are two Text Classification tasks, commonly applied both to written and speech corpora, that may be tackled using similar strategies, despite being characterized by their specificities. Sentiment Analysis is often referred by other names (e.g. sentiment mining, opinion mining, etc.) and assigns an opinion, sentiment or emotion to a given portion of text. Topic detection assigns topics from a set of possible predefined topics to a given portion of text. Sentiment analysis can be performed at different complexity levels, where the most basic one consists just on deciding whether a portion of text contains a positive or a negative sentiment. However, it can be performed at more complex levels, like ranking the attitude into a set of more than two classes or, even further, it can be performed in a way that different complex attitude types can be determined, as well as finding the source and the target of such attitudes. Dealing with the huge amounts of data available on Twitter demand clever strategies. One interesting idea, explored by [1] consists of using emoticons, abundantly available on tweets, to automatically label the data and then use such data to train machine learning algorithms. The paper shows that machine learning algorithms trained with such approach achieve above 80% accuracy, when classifying messages as positive or negative. A similar idea was previously explored by [3] for movie reviews, by using star ratings as polarity signals in their training data. This latter paper analyses the performance of different classifiers on movie reviews, and presents a number of techniques that were used by many authors and served as baseline for posterior studies. As an example, they have adapted a technique, introduced by [14], for modeling the contextual effect of negation, adding the prefix NOT_ to every word between a negation word and the first punctuation mark following the negation word. Common approaches to sentiment analysis involve the use of sentiment lexicons of positive and negative words or expressions. The General Inquirer [13] was one of the first available sentiment lexicons freely available for research, which includes several categories of words, such as: positive vs. negative, strong vs. week. Two other examples include [10], an opinion lexicon containing about 7000 words, and the MPQA Subjectivity Cues Lexicon [16], where words are annotated not only as positive vs. negative, but also with intensity. Learning polarity lexicons is another research approach that can be especially useful for dealing with large corpora. The process starts with a seed set of words and the idea is to increasingly find words or phrases with similar polarity, in semi- supervised fashion [12]. The final lexicon contains much more words, possibly learning domain- specific information, and therefore is more prone to be robust. Work on Topic Detection has its origins in 1996 with the Topic Detection and Tracking (TDT) initiative sponsored by the US government. The main motivation for this initiative was the processing of the large amounts of information coming
3 from newswire and broadcast news. The main goal was to organize the information in terms of events and stories that discussed them. The concept of topic was defined as the set of stories about a particular event. Five tasks were defined: story segmentation, first story detection, cluster detection, tracking, and story link detection. The current impact and the amount information generated by social media led to a state of affairs similar to the one that fostered the pioneer work on TDT. Social media is now the context for research tasks like topic (cluster) detection [9, 6] or emerging topic (first story) detection [15]. In that sense, closer to our work are the approaches described by [4] and [9], where tweets are classified into previously defined sets of generic topics. In the former, a conventional bag- of- words (BOW) strategy is compared to a specific set of features (authorship and the presence of several types of twitter- related phenomena) using a Naive Bayes (NB) classifier to classify tweets into the following generic categories: News, Events, Opinions, Deals, and Private Messages. Findings show that authorship is a quite important feature. In the latter, two strategies, BOW and network- based classification, are explored to classify clusters of tweets into 18 general categories, like Sports, Politics, or Technology. In the BOW approach, the clusters of tweets are represented by TF- IDF vectors and NB, NB Multinomial, and Support Vector Machines (SVM) classifiers are used to perform classification. The network- based classification approach is based on the links between users and C5.0 decision tree, k- Nearest Neighbor, SVM, and Logistic Regression classifiers were used. Network- based classification was shown to achieve a better performance, but being link- based, it cannot be used for all situations. Data The data provided in the context of the TASS contest [18] consists of Spanish tweets written in Spanish by 200 well- known personalities and. The training data is an XML file containing about 7200 tweets, each one labeled with sentiment polarity and the corresponding topics. The goal consists in providing automatic sentiment and topic classification for the test data, which is also available in XML and consists of about unlabeled tweets. Each tweet in the labeled data is annotated in terms of polarity, using one of six possible values: NONE, N (negative), N+ (very negative), NEU (both negative and positive), P (positive), P+ (very positive). Moreover, each annotation is also marked as AGREEMENT or DISAGREEMENT, indicating whether all the annotators performed the annotation coherently. In what concerns topic detection, each tweet was annotated with one or more topics, from a list of 10 possible topics: política (politics), otros (others), entretenimiento (entertainment), economía (economics), música (music), fútbol (football), cine (movies), tecnología (technology), deportes (sports), and literatura (literature). It is also important to mention that, besides the tweets, an extra XML file is also available, containing information about each one of the users that authored at least one of the tweets in the data. In particular, the information includes the type of user, assuming one of three possible values periodista (journalist),
4 famoso (famous person), and politico (politician) which may provide valuable information for these tasks. Approaches and Results The number of participants was eight for the sentiment analysis task, but only six of them also performed the topic detection task. The most relevant strategies described by the participants cover: treatment of emoticons, processing negation, spelling error correction, part- of- speech tagging, use of named entities and adjectives, syntactic information, word sense disambiguation, polarity lexicons (for sentiment analysis), Twitter- LDA - Latent Dirichelet Allocation (for topic detection), information retrieval motivated strategies based on language divergence. In what concerns machine learning, the participants have reported the use of Logistic Regression, Multinomial Naive Bayes, and SVM (Support Vector Machines). Sentiment Analysis Topic Detection 65.3% 65.4% 63.4% 60.1% 57.0% 45.3% Table 1 Top 3 results (accuracy) Table 1 presents the best results achieved for each one of the tasks. Our team achieved the best result for Topic Detection and the second place for Sentiment Analysis. The team which achieved the best results for Sentiment Analysis, involved treating emoticons, negation, spelling errors, POS- tagging, syntactic analysis, and the used of sentiment lexicons. Our results for this task involved not only text features, but also the tweet's author name and the user type, also available in the distributed data. Apart from the provided data, our experiments also used Spanish Sentiment Lexicons ( a resource created at the University of North Texas [17]. From this resource, only the most robust part was used, known as fullstrengthlexicon, and containing 1346 words automatically labeled with sentiment polarity. For Topic Detection, our approach was the most successful and involved the use of text features (after removing punctuation marks) and the tweet's author name. Bibliography 1. A. Go, R. Bhayani, and L. Huang, Twitter Sentiment Classification using Distant Supervision, tech. rep., Stanford University, A. L. Berger, S. A. D. Pietra, and V. J. D. Pietra. A maximum entropy approach to natural language processing. Computational Linguistics, 22(1):39 71, B. Pang, L. Lee, and S. Vaithyanathan, Thumbs up? sentiment classification using machine learning techniques, in Proceedings of Empirical Methods in Natural Language Processing (EMNLP), pp , B. Sriram, D. Fuhry, E. Demir, H. Ferhatosmanoglu, and M. Demirbas, Short Text Classification in Twitter to Improve Information Filtering, in SIGIR 10:
5 Proceedings of the 33rd international ACM SIGIR conference on Research and Development in Information Retrieval, pp , ACM, Bollen, Johan, Huina Mao, and Xiao- Jun Zeng Twitter mood predicts the stock market. CoRR, abs/ C. Lin, Y. He, R. Everson, and S. Rüger, Weakly Supervised Joint Sentiment- Topic Detection from Text, IEEE Transactions On Knowledge And Data Engineering, vol. 24, no. 6, pp , F. Batista, R. Ribeiro, Sentiment Analysis and Topic Classification based on Binary Maximum Entropy Classifiers, Procesamiento de Lenguaje Natural, vol. 50, pp ISSN: , H. Daumé III. Notes on CG and LM- BFGS optimization of logistic regression K. Lee, D. Palsetia, R. Narayanan, M. Patwary, A. Agrawal, and A. Choudhary, Twitter trending topic classification, in International Conference on Data Mining Workshops (ICDMW), pp , IEEE, M. Hu and B. Liu, Mining and summarizing customer reviews, in Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 04, pp , ACM, O Connor, B., R. Balasubramanyan, B. R. Routledge, and N. A. Smith From tweets to polls: Linking text sentiment to public opinion time series. In Proceedings of the Fourth International AAAI Conference on Weblogs and Social Media (ICWSM 10). 12. P. D. Turney, Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Clas- sification of Reviews, in Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp , ACL, P. Stone, D. Dunphy, M. Smith, and D. Ogilvie, The General Inquirer: A Computer Approach to Content Analysis. MIT Press, S. Das and M. Chen, Yahoo! for Amazon: Extracting market sentiment from stock message boards, in Proceedings of the Asia Pacific Finance Association Annual Conference (APFA), S. P. Kasiviswanathan, P. Melville, A. Banerjee, and V. Sindhwani, Emerging Topic Detection using Dictionary Learning, in CIKM 11: Proceedings of the 20th ACM international conference on Information and Knowledge Management, pp , ACM, T. Wilson, J. Wiebe, and P. Hoffmann, Recognizing contextual polarity in phrase- level sentiment anal- ysis, in Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP), pp , ACL, V. Perez- Rosas, C. Banea, and R. Mihalcea. Learning sentiment lexicons in spanish. Proc. of the 8th International Conference on Language Resources and Evaluation (LREC 12), Istanbul, Turkey, May Villena- Román, J., J. García- Morera, C. Moreno Garcia, L. Ferrer- Ureña, S. Lana- Serrano, J. C. González- Cristobal, A. Westerski, E. Martínez- Cámara, M. A. García- Cumbreras, M. T. Martín- Valdivia, and Ureña- López L. A TASS- Workshop on Sentiment Analysis at SEPLN. Procesamiento de Lenguaje Natural, 50.
Sentiment analysis on tweets in a financial domain
Sentiment analysis on tweets in a financial domain Jasmina Smailović 1,2, Miha Grčar 1, Martin Žnidaršič 1 1 Dept of Knowledge Technologies, Jožef Stefan Institute, Ljubljana, Slovenia 2 Jožef Stefan International
Data Mining Yelp Data - Predicting rating stars from review text
Data Mining Yelp Data - Predicting rating stars from review text Rakesh Chada Stony Brook University [email protected] Chetan Naik Stony Brook University [email protected] ABSTRACT The majority
Sentiment Analysis. D. Skrepetos 1. University of Waterloo. NLP Presenation, 06/17/2015
Sentiment Analysis D. Skrepetos 1 1 Department of Computer Science University of Waterloo NLP Presenation, 06/17/2015 D. Skrepetos (University of Waterloo) Sentiment Analysis NLP Presenation, 06/17/2015
S-Sense: A Sentiment Analysis Framework for Social Media Sensing
S-Sense: A Sentiment Analysis Framework for Social Media Sensing Choochart Haruechaiyasak, Alisa Kongthon, Pornpimon Palingoon and Kanokorn Trakultaweekoon Speech and Audio Technology Laboratory (SPT)
Microblog Sentiment Analysis with Emoticon Space Model
Microblog Sentiment Analysis with Emoticon Space Model Fei Jiang, Yiqun Liu, Huanbo Luan, Min Zhang, and Shaoping Ma State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory
Sentiment analysis: towards a tool for analysing real-time students feedback
Sentiment analysis: towards a tool for analysing real-time students feedback Nabeela Altrabsheh Email: [email protected] Mihaela Cocea Email: [email protected] Sanaz Fallahkhair Email:
Robust Sentiment Detection on Twitter from Biased and Noisy Data
Robust Sentiment Detection on Twitter from Biased and Noisy Data Luciano Barbosa AT&T Labs - Research [email protected] Junlan Feng AT&T Labs - Research [email protected] Abstract In this
Semantic Sentiment Analysis of Twitter
Semantic Sentiment Analysis of Twitter Hassan Saif, Yulan He & Harith Alani Knowledge Media Institute, The Open University, Milton Keynes, United Kingdom The 11 th International Semantic Web Conference
Emoticon Smoothed Language Models for Twitter Sentiment Analysis
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence Emoticon Smoothed Language Models for Twitter Sentiment Analysis Kun-Lin Liu, Wu-Jun Li, Minyi Guo Shanghai Key Laboratory of
Impact of Financial News Headline and Content to Market Sentiment
International Journal of Machine Learning and Computing, Vol. 4, No. 3, June 2014 Impact of Financial News Headline and Content to Market Sentiment Tan Li Im, Phang Wai San, Chin Kim On, Rayner Alfred,
Sentiment Analysis of Movie Reviews and Twitter Statuses. Introduction
Sentiment Analysis of Movie Reviews and Twitter Statuses Introduction Sentiment analysis is the task of identifying whether the opinion expressed in a text is positive or negative in general, or about
End-to-End Sentiment Analysis of Twitter Data
End-to-End Sentiment Analysis of Twitter Data Apoor v Agarwal 1 Jasneet Singh Sabharwal 2 (1) Columbia University, NY, U.S.A. (2) Guru Gobind Singh Indraprastha University, New Delhi, India [email protected],
Italian Journal of Accounting and Economia Aziendale. International Area. Year CXIV - 2014 - n. 1, 2 e 3
Italian Journal of Accounting and Economia Aziendale International Area Year CXIV - 2014 - n. 1, 2 e 3 Could we make better prediction of stock market indicators through Twitter sentiment analysis? ALEXANDER
Kea: Expression-level Sentiment Analysis from Twitter Data
Kea: Expression-level Sentiment Analysis from Twitter Data Ameeta Agrawal Computer Science and Engineering York University Toronto, Canada [email protected] Aijun An Computer Science and Engineering
Using Text and Data Mining Techniques to extract Stock Market Sentiment from Live News Streams
2012 International Conference on Computer Technology and Science (ICCTS 2012) IPCSIT vol. XX (2012) (2012) IACSIT Press, Singapore Using Text and Data Mining Techniques to extract Stock Market Sentiment
A Comparative Study on Sentiment Classification and Ranking on Product Reviews
A Comparative Study on Sentiment Classification and Ranking on Product Reviews C.EMELDA Research Scholar, PG and Research Department of Computer Science, Nehru Memorial College, Putthanampatti, Bharathidasan
MLg. Big Data and Its Implication to Research Methodologies and Funding. Cornelia Caragea TARDIS 2014. November 7, 2014. Machine Learning Group
Big Data and Its Implication to Research Methodologies and Funding Cornelia Caragea TARDIS 2014 November 7, 2014 UNT Computer Science and Engineering Data Everywhere Lots of data is being collected and
Sentiment Analysis: a case study. Giuseppe Castellucci [email protected]
Sentiment Analysis: a case study Giuseppe Castellucci [email protected] Web Mining & Retrieval a.a. 2013/2014 Outline Sentiment Analysis overview Brand Reputation Sentiment Analysis in Twitter
Sentiment analysis of Twitter microblogging posts. Jasmina Smailović Jožef Stefan Institute Department of Knowledge Technologies
Sentiment analysis of Twitter microblogging posts Jasmina Smailović Jožef Stefan Institute Department of Knowledge Technologies Introduction Popularity of microblogging services Twitter microblogging posts
Sentiment Analysis for Movie Reviews
Sentiment Analysis for Movie Reviews Ankit Goyal, [email protected] Amey Parulekar, [email protected] Introduction: Movie reviews are an important way to gauge the performance of a movie. While providing
RRSS - Rating Reviews Support System purpose built for movies recommendation
RRSS - Rating Reviews Support System purpose built for movies recommendation Grzegorz Dziczkowski 1,2 and Katarzyna Wegrzyn-Wolska 1 1 Ecole Superieur d Ingenieurs en Informatique et Genie des Telecommunicatiom
EFFICIENTLY PROVIDE SENTIMENT ANALYSIS DATA SETS USING EXPRESSIONS SUPPORT METHOD
EFFICIENTLY PROVIDE SENTIMENT ANALYSIS DATA SETS USING EXPRESSIONS SUPPORT METHOD 1 Josephine Nancy.C, 2 K Raja. 1 PG scholar,department of Computer Science, Tagore Institute of Engineering and Technology,
Sentiment analysis on news articles using Natural Language Processing and Machine Learning Approach.
Sentiment analysis on news articles using Natural Language Processing and Machine Learning Approach. Pranali Chilekar 1, Swati Ubale 2, Pragati Sonkambale 3, Reema Panarkar 4, Gopal Upadhye 5 1 2 3 4 5
CSE 598 Project Report: Comparison of Sentiment Aggregation Techniques
CSE 598 Project Report: Comparison of Sentiment Aggregation Techniques Chris MacLellan [email protected] May 3, 2012 Abstract Different methods for aggregating twitter sentiment data are proposed and three
Approaches for Sentiment Analysis on Twitter: A State-of-Art study
Approaches for Sentiment Analysis on Twitter: A State-of-Art study Harsh Thakkar and Dhiren Patel Department of Computer Engineering, National Institute of Technology, Surat-395007, India {harsh9t,dhiren29p}@gmail.com
Multilanguage sentiment-analysis of Twitter data on the example of Swiss politicians
Multilanguage sentiment-analysis of Twitter data on the example of Swiss politicians Lucas Brönnimann University of Applied Science Northwestern Switzerland, CH-5210 Windisch, Switzerland Email: [email protected]
WILL TWITTER MAKE YOU A BETTER INVESTOR? A LOOK AT SENTIMENT, USER REPUTATION AND THEIR EFFECT ON THE STOCK MARKET
WILL TWITTER MAKE YOU A BETTER INVESTOR? A LOOK AT SENTIMENT, USER REPUTATION AND THEIR EFFECT ON THE STOCK MARKET ABSTRACT Eric D. Brown Dakota State University [email protected] The use of social networks
Twitter sentiment vs. Stock price!
Twitter sentiment vs. Stock price! Background! On April 24 th 2013, the Twitter account belonging to Associated Press was hacked. Fake posts about the Whitehouse being bombed and the President being injured
The multilayer sentiment analysis model based on Random forest Wei Liu1, Jie Zhang2
2nd International Conference on Advances in Mechanical Engineering and Industrial Informatics (AMEII 2016) The multilayer sentiment analysis model based on Random forest Wei Liu1, Jie Zhang2 1 School of
FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSIS
FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSIS Gautami Tripathi 1 and Naganna S. 2 1 PG Scholar, School of Computing Science and Engineering, Galgotias University, Greater Noida,
Improving Twitter Sentiment Analysis with Topic-Based Mixture Modeling and Semi-Supervised Training
Improving Twitter Sentiment Analysis with Topic-Based Mixture Modeling and Semi-Supervised Training Bing Xiang * IBM Watson 1101 Kitchawan Rd Yorktown Heights, NY 10598, USA [email protected] Liang Zhou
Using Twitter as a source of information for stock market prediction
Using Twitter as a source of information for stock market prediction Ramon Xuriguera ([email protected]) Joint work with Marta Arias and Argimiro Arratia ERCIM 2011, 17-19 Dec. 2011, University of
VCU-TSA at Semeval-2016 Task 4: Sentiment Analysis in Twitter
VCU-TSA at Semeval-2016 Task 4: Sentiment Analysis in Twitter Gerard Briones and Kasun Amarasinghe and Bridget T. McInnes, PhD. Department of Computer Science Virginia Commonwealth University Richmond,
Text Opinion Mining to Analyze News for Stock Market Prediction
Int. J. Advance. Soft Comput. Appl., Vol. 6, No. 1, March 2014 ISSN 2074-8523; Copyright SCRG Publication, 2014 Text Opinion Mining to Analyze News for Stock Market Prediction Yoosin Kim 1, Seung Ryul
Sentiment Lexicons for Arabic Social Media
Sentiment Lexicons for Arabic Social Media Saif M. Mohammad 1, Mohammad Salameh 2, Svetlana Kiritchenko 1 1 National Research Council Canada, 2 University of Alberta [email protected], [email protected],
Machine Learning for natural language processing
Machine Learning for natural language processing Introduction Laura Kallmeyer Heinrich-Heine-Universität Düsseldorf Summer 2016 1 / 13 Introduction Goal of machine learning: Automatically learn how to
Using Social Media for Continuous Monitoring and Mining of Consumer Behaviour
Using Social Media for Continuous Monitoring and Mining of Consumer Behaviour Michail Salampasis 1, Giorgos Paltoglou 2, Anastasia Giahanou 1 1 Department of Informatics, Alexander Technological Educational
Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words
, pp.290-295 http://dx.doi.org/10.14257/astl.2015.111.55 Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words Irfan
TOOL OF THE INTELLIGENCE ECONOMIC: RECOGNITION FUNCTION OF REVIEWS CRITICS. Extraction and linguistic analysis of sentiments
TOOL OF THE INTELLIGENCE ECONOMIC: RECOGNITION FUNCTION OF REVIEWS CRITICS. Extraction and linguistic analysis of sentiments Grzegorz Dziczkowski, Katarzyna Wegrzyn-Wolska Ecole Superieur d Ingenieurs
Sentiment Analysis Tool using Machine Learning Algorithms
Sentiment Analysis Tool using Machine Learning Algorithms I.Hemalatha 1, Dr. G. P Saradhi Varma 2, Dr. A.Govardhan 3 1 Research Scholar JNT University Kakinada, Kakinada, A.P., INDIA 2 Professor & Head,
Text Mining for Sentiment Analysis of Twitter Data
Text Mining for Sentiment Analysis of Twitter Data Shruti Wakade, Chandra Shekar, Kathy J. Liszka and Chien-Chung Chan The University of Akron Department of Computer Science [email protected], [email protected]
Fine-grained German Sentiment Analysis on Social Media
Fine-grained German Sentiment Analysis on Social Media Saeedeh Momtazi Information Systems Hasso-Plattner-Institut Potsdam University, Germany [email protected] Abstract Expressing opinions
News Sentiment Analysis Using R to Predict Stock Market Trends
News Sentiment Analysis Using R to Predict Stock Market Trends Anurag Nagar and Michael Hahsler Computer Science Southern Methodist University Dallas, TX Topics Motivation Gathering News Creating News
A Sentiment Detection Engine for Internet Stock Message Boards
A Sentiment Detection Engine for Internet Stock Message Boards Christopher C. Chua Maria Milosavljevic James R. Curran School of Computer Science Capital Markets CRC Ltd School of Information and Engineering
Particular Requirements on Opinion Mining for the Insurance Business
Particular Requirements on Opinion Mining for the Insurance Business Sven Rill, Johannes Drescher, Dirk Reinel, Jörg Scheidt, Florian Wogenstein Institute of Information Systems (iisys) University of Applied
Keyphrase Extraction for Scholarly Big Data
Keyphrase Extraction for Scholarly Big Data Cornelia Caragea Computer Science and Engineering University of North Texas July 10, 2015 Scholarly Big Data Large number of scholarly documents on the Web PubMed
SENTIMENT ANALYSIS: A STUDY ON PRODUCT FEATURES
University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln Dissertations and Theses from the College of Business Administration Business Administration, College of 4-1-2012 SENTIMENT
SENTIMENT EXTRACTION FROM NATURAL AUDIO STREAMS. Lakshmish Kaushik, Abhijeet Sangwan, John H. L. Hansen
SENTIMENT EXTRACTION FROM NATURAL AUDIO STREAMS Lakshmish Kaushik, Abhijeet Sangwan, John H. L. Hansen Center for Robust Speech Systems (CRSS), Eric Jonsson School of Engineering, The University of Texas
TASS - Workshop on Sentiment Analysis at SEPLN
Procesamiento del Lenguaje Natural, Revista nº 50 marzo de 2013, pp 37-44 recibido 28-11-12 revisado 14-02-13 aceptado 19-02-13 TASS - Workshop on Sentiment Analysis at SEPLN TASS - Taller de Análisis
Sentiment Classification. in a Nutshell. Cem Akkaya, Xiaonan Zhang
Sentiment Classification in a Nutshell Cem Akkaya, Xiaonan Zhang Outline Problem Definition Level of Classification Evaluation Mainstream Method Conclusion Problem Definition Sentiment is the overall emotion,
Twitter Stock Bot. John Matthew Fong The University of Texas at Austin [email protected]
Twitter Stock Bot John Matthew Fong The University of Texas at Austin [email protected] Hassaan Markhiani The University of Texas at Austin [email protected] Abstract The stock market is influenced
CS 229, Autumn 2011 Modeling the Stock Market Using Twitter Sentiment Analysis
CS 229, Autumn 2011 Modeling the Stock Market Using Twitter Sentiment Analysis Team members: Daniel Debbini, Philippe Estin, Maxime Goutagny Supervisor: Mihai Surdeanu (with John Bauer) 1 Introduction
Probabilistic topic models for sentiment analysis on the Web
University of Exeter Department of Computer Science Probabilistic topic models for sentiment analysis on the Web Chenghua Lin September 2011 Submitted by Chenghua Lin, to the the University of Exeter as
A Survey on Product Aspect Ranking Techniques
A Survey on Product Aspect Ranking Techniques Ancy. J. S, Nisha. J.R P.G. Scholar, Dept. of C.S.E., Marian Engineering College, Kerala University, Trivandrum, India. Asst. Professor, Dept. of C.S.E., Marian
Bagged Ensemble Classifiers for Sentiment Classification of Movie Reviews
www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 3 Issue 2 February, 2014 Page No. 3951-3961 Bagged Ensemble Classifiers for Sentiment Classification of Movie
Sentiment Analysis of Twitter Data
Sentiment Analysis of Twitter Data Apoorv Agarwal Boyi Xie Ilia Vovsha Owen Rambow Rebecca Passonneau Department of Computer Science Columbia University New York, NY 10027 USA {apoorv@cs, xie@cs, iv2121@,
Positive or negative? Using blogs to assess vehicles features
Positive or negative? Using blogs to assess vehicles features Silvio S Ribeiro Jr. 1, Zilton Junior 1, Wagner Meira Jr. 1, Gisele L. Pappa 1 1 Departamento de Ciência da Computação Universidade Federal
Analysis of Tweets for Prediction of Indian Stock Markets
Analysis of Tweets for Prediction of Indian Stock Markets Phillip Tichaona Sumbureru Department of Computer Science and Engineering, JNTU College of Engineering Hyderabad, Kukatpally, Hyderabad-500 085,
Twitter Analytics for Insider Trading Fraud Detection
Twitter Analytics for Insider Trading Fraud Detection W-J Ketty Gann, John Day, Shujia Zhou Information Systems Northrop Grumman Corporation, Annapolis Junction, MD, USA {wan-ju.gann, john.day2, shujia.zhou}@ngc.com
How To Write A Summary Of A Review
PRODUCT REVIEW RANKING SUMMARIZATION N.P.Vadivukkarasi, Research Scholar, Department of Computer Science, Kongu Arts and Science College, Erode. Dr. B. Jayanthi M.C.A., M.Phil., Ph.D., Associate Professor,
An Introduction to Data Mining
An Introduction to Intel Beijing [email protected] January 17, 2014 Outline 1 DW Overview What is Notable Application of Conference, Software and Applications Major Process in 2 Major Tasks in Detail
Bridging CAQDAS with text mining: Text analyst s toolbox for Big Data: Science in the Media Project
Bridging CAQDAS with text mining: Text analyst s toolbox for Big Data: Science in the Media Project Ahmet Suerdem Istanbul Bilgi University; LSE Methodology Dept. Science in the media project is funded
Keywords social media, internet, data, sentiment analysis, opinion mining, business
Volume 5, Issue 8, August 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Real time Extraction
SENTIMENT ANALYSIS: TEXT PRE-PROCESSING, READER VIEWS AND CROSS DOMAINS EMMA HADDI BRUNEL UNIVERSITY LONDON
BRUNEL UNIVERSITY LONDON COLLEGE OF ENGINEERING, DESIGN AND PHYSICAL SCIENCES DEPARTMENT OF COMPUTER SCIENCE DOCTOR OF PHILOSOPHY DISSERTATION SENTIMENT ANALYSIS: TEXT PRE-PROCESSING, READER VIEWS AND
The Italian Hate Map:
I-CiTies 2015 2015 CINI Annual Workshop on ICT for Smart Cities and Communities Palermo (Italy) - October 29-30, 2015 The Italian Hate Map: semantic content analytics for social good (Università degli
Designing Ranking Systems for Consumer Reviews: The Impact of Review Subjectivity on Product Sales and Review Quality
Designing Ranking Systems for Consumer Reviews: The Impact of Review Subjectivity on Product Sales and Review Quality Anindya Ghose, Panagiotis G. Ipeirotis {aghose, panos}@stern.nyu.edu Department of
Twitter Sentiment Analysis of Movie Reviews using Machine Learning Techniques.
Twitter Sentiment Analysis of Movie Reviews using Machine Learning Techniques. Akshay Amolik, Niketan Jivane, Mahavir Bhandari, Dr.M.Venkatesan School of Computer Science and Engineering, VIT University,
Ming-Wei Chang. Machine learning and its applications to natural language processing, information retrieval and data mining.
Ming-Wei Chang 201 N Goodwin Ave, Department of Computer Science University of Illinois at Urbana-Champaign, Urbana, IL 61801 +1 (917) 345-6125 [email protected] http://flake.cs.uiuc.edu/~mchang21 Research
Towards SoMEST Combining Social Media Monitoring with Event Extraction and Timeline Analysis
Towards SoMEST Combining Social Media Monitoring with Event Extraction and Timeline Analysis Yue Dai, Ernest Arendarenko, Tuomo Kakkonen, Ding Liao School of Computing University of Eastern Finland {yvedai,
Defect Analytics in a High-End Server Manufacturing Environment
Proceedings of the 2015 Industrial and Systems Engineering Research Conference S. Cetinkaya and J. Ryan, eds. Defect Analytics in a High-End Server Manufacturing Environment Faisal Aqlan Industrial Engineering
CIRGIRDISCO at RepLab2014 Reputation Dimension Task: Using Wikipedia Graph Structure for Classifying the Reputation Dimension of a Tweet
CIRGIRDISCO at RepLab2014 Reputation Dimension Task: Using Wikipedia Graph Structure for Classifying the Reputation Dimension of a Tweet Muhammad Atif Qureshi 1,2, Arjumand Younus 1,2, Colm O Riordan 1,
An Insight Of Sentiment Analysis In The Financial News
Available online at www.globalilluminators.org GlobalIlluminators FULL PAPER PROCEEDING Multidisciplinary Studies Full Paper Proceeding ICMRP-2014, Vol. 1, 278-291 ISBN: 978-969-9948-08-4 ICMRP 2014 An
Identifying Focus, Techniques and Domain of Scientific Papers
Identifying Focus, Techniques and Domain of Scientific Papers Sonal Gupta Department of Computer Science Stanford University Stanford, CA 94305 [email protected] Christopher D. Manning Department of
Text Mining - Scope and Applications
Journal of Computer Science and Applications. ISSN 2231-1270 Volume 5, Number 2 (2013), pp. 51-55 International Research Publication House http://www.irphouse.com Text Mining - Scope and Applications Miss
Emotion Detection in Code-switching Texts via Bilingual and Sentimental Information
Emotion Detection in Code-switching Texts via Bilingual and Sentimental Information Zhongqing Wang, Sophia Yat Mei Lee, Shoushan Li, and Guodong Zhou Natural Language Processing Lab, Soochow University,
Bing Liu. Web Data Mining. Exploring Hyperlinks, Contents, and Usage Data. With 177 Figures. ~ Spring~r
Bing Liu Web Data Mining Exploring Hyperlinks, Contents, and Usage Data With 177 Figures ~ Spring~r Table of Contents 1. Introduction.. 1 1.1. What is the World Wide Web? 1 1.2. ABrief History of the Web
Forecasting stock markets with Twitter
Forecasting stock markets with Twitter Argimiro Arratia [email protected] Joint work with Marta Arias and Ramón Xuriguera To appear in: ACM Transactions on Intelligent Systems and Technology, 2013,
II. RELATED WORK. Sentiment Mining
Sentiment Mining Using Ensemble Classification Models Matthew Whitehead and Larry Yaeger Indiana University School of Informatics 901 E. 10th St. Bloomington, IN 47408 {mewhiteh, larryy}@indiana.edu Abstract
Can Twitter provide enough information for predicting the stock market?
Can Twitter provide enough information for predicting the stock market? Maria Dolores Priego Porcuna Introduction Nowadays a huge percentage of financial companies are investing a lot of money on Social
Semi-Supervised Learning for Blog Classification
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence (2008) Semi-Supervised Learning for Blog Classification Daisuke Ikeda Department of Computational Intelligence and Systems Science,
Tweets Miner for Stock Market Analysis
Tweets Miner for Stock Market Analysis Bohdan Pavlyshenko Electronics department, Ivan Franko Lviv National University,Ukraine, Drahomanov Str. 50, Lviv, 79005, Ukraine, e-mail: [email protected]
Challenges of Cloud Scale Natural Language Processing
Challenges of Cloud Scale Natural Language Processing Mark Dredze Johns Hopkins University My Interests? Information Expressed in Human Language Machine Learning Natural Language Processing Intelligent
ONLINE RESUME PARSING SYSTEM USING TEXT ANALYTICS
ONLINE RESUME PARSING SYSTEM USING TEXT ANALYTICS Divyanshu Chandola 1, Aditya Garg 2, Ankit Maurya 3, Amit Kushwaha 4 1 Student, Department of Information Technology, ABES Engineering College, Uttar Pradesh,
Neuro-Fuzzy Classification Techniques for Sentiment Analysis using Intelligent Agents on Twitter Data
International Journal of Innovation and Scientific Research ISSN 2351-8014 Vol. 23 No. 2 May 2016, pp. 356-360 2015 Innovative Space of Scientific Research Journals http://www.ijisr.issr-journals.org/
Domain Classification of Technical Terms Using the Web
Systems and Computers in Japan, Vol. 38, No. 14, 2007 Translated from Denshi Joho Tsushin Gakkai Ronbunshi, Vol. J89-D, No. 11, November 2006, pp. 2470 2482 Domain Classification of Technical Terms Using
