Sentiment Analysis and Topic Classification: Case study over Spanish tweets
|
|
- Sandra Neal
- 8 years ago
- Views:
Transcription
1 Sentiment Analysis and Topic Classification: Case study over Spanish tweets Fernando Batista, Ricardo Ribeiro Laboratório de Sistemas de Língua Falada, INESC- ID Lisboa R. Alves Redol, 9, Lisboa, Portugal ISCTE - - Instituto Universitário de Lisboa Av. Forças Armadas, Lisboa, Portugal {Fernando.Batista, Ricardo.Ribeiro}@inesc-id.pt Introduction By providing revolutionary means for people to communicate and interact, Social Networks take now part in the life of a large of millions of people all over the world. Each social network targets different audiences, offering a range of unique services that people find useful in the course of their lives. In what concerns Twitter, it provides a simple way of writing small messages, which people can use to express everything. Twitter can be accessed in numerous ways, ranging from computers to mobile phones or other mobile devices. That is particularly important because accessing and producing content becomes a trivial task, therefore assuming an important part of people's lives. One relevant aspect that differentiates Twitter from other communication means is its ability to rapidly propagate such content and make it available to specific communities, selected based on their interests. Twitter data is a powerful source of information for assessing and predicting large- scale facts. For example, [11] capture large- scale trends on consumer confidence and political opinion in tweets, strengthening the potential of such data as a supplement for traditional polling. In what concerns stock markets, [5] found that Twitter data can be used to significantly improve stock market predictions accuracy. The huge amount of data, constantly being produced, makes it impracticable to manually process such content. For that reason, it becomes urgent to apply automatic processing strategies that can handle, and take advantage, of such amount of data. However, processing Twitter is all but an easy task, not only because of specific phenomena that can be found in the data, but also because it may require to process a continuous stream of data, and possibly to store some of the data in a way that it can be accessed in the future. This paper addresses two well- known Natural Language Processing (NLP) tasks, sentiment analysis and topic classification, which have been performed over Spanish Twitter data, in the context of the "TASS workshop on Sentiment Analysis" [18], a satellite event of the SEPLN 2012 conference. The remainder of this paper presents an overview of the challenge proposed within the workshop, reports on our experience and presents some conclusions about the achieved results.
2 Related work Text Classification consists of assigning a class from a set of possible predefined classes to a given text. Sentiment Analysis and Topic Detection are two Text Classification tasks, commonly applied both to written and speech corpora, that may be tackled using similar strategies, despite being characterized by their specificities. Sentiment Analysis is often referred by other names (e.g. sentiment mining, opinion mining, etc.) and assigns an opinion, sentiment or emotion to a given portion of text. Topic detection assigns topics from a set of possible predefined topics to a given portion of text. Sentiment analysis can be performed at different complexity levels, where the most basic one consists just on deciding whether a portion of text contains a positive or a negative sentiment. However, it can be performed at more complex levels, like ranking the attitude into a set of more than two classes or, even further, it can be performed in a way that different complex attitude types can be determined, as well as finding the source and the target of such attitudes. Dealing with the huge amounts of data available on Twitter demand clever strategies. One interesting idea, explored by [1] consists of using emoticons, abundantly available on tweets, to automatically label the data and then use such data to train machine learning algorithms. The paper shows that machine learning algorithms trained with such approach achieve above 80% accuracy, when classifying messages as positive or negative. A similar idea was previously explored by [3] for movie reviews, by using star ratings as polarity signals in their training data. This latter paper analyses the performance of different classifiers on movie reviews, and presents a number of techniques that were used by many authors and served as baseline for posterior studies. As an example, they have adapted a technique, introduced by [14], for modeling the contextual effect of negation, adding the prefix NOT_ to every word between a negation word and the first punctuation mark following the negation word. Common approaches to sentiment analysis involve the use of sentiment lexicons of positive and negative words or expressions. The General Inquirer [13] was one of the first available sentiment lexicons freely available for research, which includes several categories of words, such as: positive vs. negative, strong vs. week. Two other examples include [10], an opinion lexicon containing about 7000 words, and the MPQA Subjectivity Cues Lexicon [16], where words are annotated not only as positive vs. negative, but also with intensity. Learning polarity lexicons is another research approach that can be especially useful for dealing with large corpora. The process starts with a seed set of words and the idea is to increasingly find words or phrases with similar polarity, in semi- supervised fashion [12]. The final lexicon contains much more words, possibly learning domain- specific information, and therefore is more prone to be robust. Work on Topic Detection has its origins in 1996 with the Topic Detection and Tracking (TDT) initiative sponsored by the US government. The main motivation for this initiative was the processing of the large amounts of information coming
3 from newswire and broadcast news. The main goal was to organize the information in terms of events and stories that discussed them. The concept of topic was defined as the set of stories about a particular event. Five tasks were defined: story segmentation, first story detection, cluster detection, tracking, and story link detection. The current impact and the amount information generated by social media led to a state of affairs similar to the one that fostered the pioneer work on TDT. Social media is now the context for research tasks like topic (cluster) detection [9, 6] or emerging topic (first story) detection [15]. In that sense, closer to our work are the approaches described by [4] and [9], where tweets are classified into previously defined sets of generic topics. In the former, a conventional bag- of- words (BOW) strategy is compared to a specific set of features (authorship and the presence of several types of twitter- related phenomena) using a Naive Bayes (NB) classifier to classify tweets into the following generic categories: News, Events, Opinions, Deals, and Private Messages. Findings show that authorship is a quite important feature. In the latter, two strategies, BOW and network- based classification, are explored to classify clusters of tweets into 18 general categories, like Sports, Politics, or Technology. In the BOW approach, the clusters of tweets are represented by TF- IDF vectors and NB, NB Multinomial, and Support Vector Machines (SVM) classifiers are used to perform classification. The network- based classification approach is based on the links between users and C5.0 decision tree, k- Nearest Neighbor, SVM, and Logistic Regression classifiers were used. Network- based classification was shown to achieve a better performance, but being link- based, it cannot be used for all situations. Data The data provided in the context of the TASS contest [18] consists of Spanish tweets written in Spanish by 200 well- known personalities and. The training data is an XML file containing about 7200 tweets, each one labeled with sentiment polarity and the corresponding topics. The goal consists in providing automatic sentiment and topic classification for the test data, which is also available in XML and consists of about unlabeled tweets. Each tweet in the labeled data is annotated in terms of polarity, using one of six possible values: NONE, N (negative), N+ (very negative), NEU (both negative and positive), P (positive), P+ (very positive). Moreover, each annotation is also marked as AGREEMENT or DISAGREEMENT, indicating whether all the annotators performed the annotation coherently. In what concerns topic detection, each tweet was annotated with one or more topics, from a list of 10 possible topics: política (politics), otros (others), entretenimiento (entertainment), economía (economics), música (music), fútbol (football), cine (movies), tecnología (technology), deportes (sports), and literatura (literature). It is also important to mention that, besides the tweets, an extra XML file is also available, containing information about each one of the users that authored at least one of the tweets in the data. In particular, the information includes the type of user, assuming one of three possible values periodista (journalist),
4 famoso (famous person), and politico (politician) which may provide valuable information for these tasks. Approaches and Results The number of participants was eight for the sentiment analysis task, but only six of them also performed the topic detection task. The most relevant strategies described by the participants cover: treatment of emoticons, processing negation, spelling error correction, part- of- speech tagging, use of named entities and adjectives, syntactic information, word sense disambiguation, polarity lexicons (for sentiment analysis), Twitter- LDA - Latent Dirichelet Allocation (for topic detection), information retrieval motivated strategies based on language divergence. In what concerns machine learning, the participants have reported the use of Logistic Regression, Multinomial Naive Bayes, and SVM (Support Vector Machines). Sentiment Analysis Topic Detection 65.3% 65.4% 63.4% 60.1% 57.0% 45.3% Table 1 Top 3 results (accuracy) Table 1 presents the best results achieved for each one of the tasks. Our team achieved the best result for Topic Detection and the second place for Sentiment Analysis. The team which achieved the best results for Sentiment Analysis, involved treating emoticons, negation, spelling errors, POS- tagging, syntactic analysis, and the used of sentiment lexicons. Our results for this task involved not only text features, but also the tweet's author name and the user type, also available in the distributed data. Apart from the provided data, our experiments also used Spanish Sentiment Lexicons ( a resource created at the University of North Texas [17]. From this resource, only the most robust part was used, known as fullstrengthlexicon, and containing 1346 words automatically labeled with sentiment polarity. For Topic Detection, our approach was the most successful and involved the use of text features (after removing punctuation marks) and the tweet's author name. Bibliography 1. A. Go, R. Bhayani, and L. Huang, Twitter Sentiment Classification using Distant Supervision, tech. rep., Stanford University, A. L. Berger, S. A. D. Pietra, and V. J. D. Pietra. A maximum entropy approach to natural language processing. Computational Linguistics, 22(1):39 71, B. Pang, L. Lee, and S. Vaithyanathan, Thumbs up? sentiment classification using machine learning techniques, in Proceedings of Empirical Methods in Natural Language Processing (EMNLP), pp , B. Sriram, D. Fuhry, E. Demir, H. Ferhatosmanoglu, and M. Demirbas, Short Text Classification in Twitter to Improve Information Filtering, in SIGIR 10:
5 Proceedings of the 33rd international ACM SIGIR conference on Research and Development in Information Retrieval, pp , ACM, Bollen, Johan, Huina Mao, and Xiao- Jun Zeng Twitter mood predicts the stock market. CoRR, abs/ C. Lin, Y. He, R. Everson, and S. Rüger, Weakly Supervised Joint Sentiment- Topic Detection from Text, IEEE Transactions On Knowledge And Data Engineering, vol. 24, no. 6, pp , F. Batista, R. Ribeiro, Sentiment Analysis and Topic Classification based on Binary Maximum Entropy Classifiers, Procesamiento de Lenguaje Natural, vol. 50, pp ISSN: , H. Daumé III. Notes on CG and LM- BFGS optimization of logistic regression K. Lee, D. Palsetia, R. Narayanan, M. Patwary, A. Agrawal, and A. Choudhary, Twitter trending topic classification, in International Conference on Data Mining Workshops (ICDMW), pp , IEEE, M. Hu and B. Liu, Mining and summarizing customer reviews, in Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 04, pp , ACM, O Connor, B., R. Balasubramanyan, B. R. Routledge, and N. A. Smith From tweets to polls: Linking text sentiment to public opinion time series. In Proceedings of the Fourth International AAAI Conference on Weblogs and Social Media (ICWSM 10). 12. P. D. Turney, Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Clas- sification of Reviews, in Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp , ACL, P. Stone, D. Dunphy, M. Smith, and D. Ogilvie, The General Inquirer: A Computer Approach to Content Analysis. MIT Press, S. Das and M. Chen, Yahoo! for Amazon: Extracting market sentiment from stock message boards, in Proceedings of the Asia Pacific Finance Association Annual Conference (APFA), S. P. Kasiviswanathan, P. Melville, A. Banerjee, and V. Sindhwani, Emerging Topic Detection using Dictionary Learning, in CIKM 11: Proceedings of the 20th ACM international conference on Information and Knowledge Management, pp , ACM, T. Wilson, J. Wiebe, and P. Hoffmann, Recognizing contextual polarity in phrase- level sentiment anal- ysis, in Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP), pp , ACL, V. Perez- Rosas, C. Banea, and R. Mihalcea. Learning sentiment lexicons in spanish. Proc. of the 8th International Conference on Language Resources and Evaluation (LREC 12), Istanbul, Turkey, May Villena- Román, J., J. García- Morera, C. Moreno Garcia, L. Ferrer- Ureña, S. Lana- Serrano, J. C. González- Cristobal, A. Westerski, E. Martínez- Cámara, M. A. García- Cumbreras, M. T. Martín- Valdivia, and Ureña- López L. A TASS- Workshop on Sentiment Analysis at SEPLN. Procesamiento de Lenguaje Natural, 50.
Sentiment analysis on tweets in a financial domain
Sentiment analysis on tweets in a financial domain Jasmina Smailović 1,2, Miha Grčar 1, Martin Žnidaršič 1 1 Dept of Knowledge Technologies, Jožef Stefan Institute, Ljubljana, Slovenia 2 Jožef Stefan International
More informationData Mining Yelp Data - Predicting rating stars from review text
Data Mining Yelp Data - Predicting rating stars from review text Rakesh Chada Stony Brook University rchada@cs.stonybrook.edu Chetan Naik Stony Brook University cnaik@cs.stonybrook.edu ABSTRACT The majority
More informationSentiment Analysis. D. Skrepetos 1. University of Waterloo. NLP Presenation, 06/17/2015
Sentiment Analysis D. Skrepetos 1 1 Department of Computer Science University of Waterloo NLP Presenation, 06/17/2015 D. Skrepetos (University of Waterloo) Sentiment Analysis NLP Presenation, 06/17/2015
More informationS-Sense: A Sentiment Analysis Framework for Social Media Sensing
S-Sense: A Sentiment Analysis Framework for Social Media Sensing Choochart Haruechaiyasak, Alisa Kongthon, Pornpimon Palingoon and Kanokorn Trakultaweekoon Speech and Audio Technology Laboratory (SPT)
More informationMicroblog Sentiment Analysis with Emoticon Space Model
Microblog Sentiment Analysis with Emoticon Space Model Fei Jiang, Yiqun Liu, Huanbo Luan, Min Zhang, and Shaoping Ma State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory
More informationSentiment analysis: towards a tool for analysing real-time students feedback
Sentiment analysis: towards a tool for analysing real-time students feedback Nabeela Altrabsheh Email: nabeela.altrabsheh@port.ac.uk Mihaela Cocea Email: mihaela.cocea@port.ac.uk Sanaz Fallahkhair Email:
More informationRobust Sentiment Detection on Twitter from Biased and Noisy Data
Robust Sentiment Detection on Twitter from Biased and Noisy Data Luciano Barbosa AT&T Labs - Research lbarbosa@research.att.com Junlan Feng AT&T Labs - Research junlan@research.att.com Abstract In this
More informationSemantic Sentiment Analysis of Twitter
Semantic Sentiment Analysis of Twitter Hassan Saif, Yulan He & Harith Alani Knowledge Media Institute, The Open University, Milton Keynes, United Kingdom The 11 th International Semantic Web Conference
More informationEmoticon Smoothed Language Models for Twitter Sentiment Analysis
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence Emoticon Smoothed Language Models for Twitter Sentiment Analysis Kun-Lin Liu, Wu-Jun Li, Minyi Guo Shanghai Key Laboratory of
More informationImpact of Financial News Headline and Content to Market Sentiment
International Journal of Machine Learning and Computing, Vol. 4, No. 3, June 2014 Impact of Financial News Headline and Content to Market Sentiment Tan Li Im, Phang Wai San, Chin Kim On, Rayner Alfred,
More informationSentiment Analysis of Movie Reviews and Twitter Statuses. Introduction
Sentiment Analysis of Movie Reviews and Twitter Statuses Introduction Sentiment analysis is the task of identifying whether the opinion expressed in a text is positive or negative in general, or about
More informationEnd-to-End Sentiment Analysis of Twitter Data
End-to-End Sentiment Analysis of Twitter Data Apoor v Agarwal 1 Jasneet Singh Sabharwal 2 (1) Columbia University, NY, U.S.A. (2) Guru Gobind Singh Indraprastha University, New Delhi, India apoorv@cs.columbia.edu,
More informationItalian Journal of Accounting and Economia Aziendale. International Area. Year CXIV - 2014 - n. 1, 2 e 3
Italian Journal of Accounting and Economia Aziendale International Area Year CXIV - 2014 - n. 1, 2 e 3 Could we make better prediction of stock market indicators through Twitter sentiment analysis? ALEXANDER
More informationKea: Expression-level Sentiment Analysis from Twitter Data
Kea: Expression-level Sentiment Analysis from Twitter Data Ameeta Agrawal Computer Science and Engineering York University Toronto, Canada ameeta@cse.yorku.ca Aijun An Computer Science and Engineering
More informationUsing Text and Data Mining Techniques to extract Stock Market Sentiment from Live News Streams
2012 International Conference on Computer Technology and Science (ICCTS 2012) IPCSIT vol. XX (2012) (2012) IACSIT Press, Singapore Using Text and Data Mining Techniques to extract Stock Market Sentiment
More informationA Comparative Study on Sentiment Classification and Ranking on Product Reviews
A Comparative Study on Sentiment Classification and Ranking on Product Reviews C.EMELDA Research Scholar, PG and Research Department of Computer Science, Nehru Memorial College, Putthanampatti, Bharathidasan
More informationMLg. Big Data and Its Implication to Research Methodologies and Funding. Cornelia Caragea TARDIS 2014. November 7, 2014. Machine Learning Group
Big Data and Its Implication to Research Methodologies and Funding Cornelia Caragea TARDIS 2014 November 7, 2014 UNT Computer Science and Engineering Data Everywhere Lots of data is being collected and
More informationSentiment Analysis: a case study. Giuseppe Castellucci castellucci@ing.uniroma2.it
Sentiment Analysis: a case study Giuseppe Castellucci castellucci@ing.uniroma2.it Web Mining & Retrieval a.a. 2013/2014 Outline Sentiment Analysis overview Brand Reputation Sentiment Analysis in Twitter
More informationSentiment analysis of Twitter microblogging posts. Jasmina Smailović Jožef Stefan Institute Department of Knowledge Technologies
Sentiment analysis of Twitter microblogging posts Jasmina Smailović Jožef Stefan Institute Department of Knowledge Technologies Introduction Popularity of microblogging services Twitter microblogging posts
More informationSentiment Analysis for Movie Reviews
Sentiment Analysis for Movie Reviews Ankit Goyal, a3goyal@ucsd.edu Amey Parulekar, aparulek@ucsd.edu Introduction: Movie reviews are an important way to gauge the performance of a movie. While providing
More informationRRSS - Rating Reviews Support System purpose built for movies recommendation
RRSS - Rating Reviews Support System purpose built for movies recommendation Grzegorz Dziczkowski 1,2 and Katarzyna Wegrzyn-Wolska 1 1 Ecole Superieur d Ingenieurs en Informatique et Genie des Telecommunicatiom
More informationEFFICIENTLY PROVIDE SENTIMENT ANALYSIS DATA SETS USING EXPRESSIONS SUPPORT METHOD
EFFICIENTLY PROVIDE SENTIMENT ANALYSIS DATA SETS USING EXPRESSIONS SUPPORT METHOD 1 Josephine Nancy.C, 2 K Raja. 1 PG scholar,department of Computer Science, Tagore Institute of Engineering and Technology,
More informationSentiment analysis on news articles using Natural Language Processing and Machine Learning Approach.
Sentiment analysis on news articles using Natural Language Processing and Machine Learning Approach. Pranali Chilekar 1, Swati Ubale 2, Pragati Sonkambale 3, Reema Panarkar 4, Gopal Upadhye 5 1 2 3 4 5
More informationCSE 598 Project Report: Comparison of Sentiment Aggregation Techniques
CSE 598 Project Report: Comparison of Sentiment Aggregation Techniques Chris MacLellan cjmaclel@asu.edu May 3, 2012 Abstract Different methods for aggregating twitter sentiment data are proposed and three
More informationApproaches for Sentiment Analysis on Twitter: A State-of-Art study
Approaches for Sentiment Analysis on Twitter: A State-of-Art study Harsh Thakkar and Dhiren Patel Department of Computer Engineering, National Institute of Technology, Surat-395007, India {harsh9t,dhiren29p}@gmail.com
More informationMultilanguage sentiment-analysis of Twitter data on the example of Swiss politicians
Multilanguage sentiment-analysis of Twitter data on the example of Swiss politicians Lucas Brönnimann University of Applied Science Northwestern Switzerland, CH-5210 Windisch, Switzerland Email: lucas.broennimann@students.fhnw.ch
More informationWILL TWITTER MAKE YOU A BETTER INVESTOR? A LOOK AT SENTIMENT, USER REPUTATION AND THEIR EFFECT ON THE STOCK MARKET
WILL TWITTER MAKE YOU A BETTER INVESTOR? A LOOK AT SENTIMENT, USER REPUTATION AND THEIR EFFECT ON THE STOCK MARKET ABSTRACT Eric D. Brown Dakota State University edbrown@dsu.edu The use of social networks
More informationTwitter sentiment vs. Stock price!
Twitter sentiment vs. Stock price! Background! On April 24 th 2013, the Twitter account belonging to Associated Press was hacked. Fake posts about the Whitehouse being bombed and the President being injured
More informationThe multilayer sentiment analysis model based on Random forest Wei Liu1, Jie Zhang2
2nd International Conference on Advances in Mechanical Engineering and Industrial Informatics (AMEII 2016) The multilayer sentiment analysis model based on Random forest Wei Liu1, Jie Zhang2 1 School of
More informationFEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSIS
FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSIS Gautami Tripathi 1 and Naganna S. 2 1 PG Scholar, School of Computing Science and Engineering, Galgotias University, Greater Noida,
More informationImproving Twitter Sentiment Analysis with Topic-Based Mixture Modeling and Semi-Supervised Training
Improving Twitter Sentiment Analysis with Topic-Based Mixture Modeling and Semi-Supervised Training Bing Xiang * IBM Watson 1101 Kitchawan Rd Yorktown Heights, NY 10598, USA bingxia@us.ibm.com Liang Zhou
More informationUsing Twitter as a source of information for stock market prediction
Using Twitter as a source of information for stock market prediction Ramon Xuriguera (rxuriguera@lsi.upc.edu) Joint work with Marta Arias and Argimiro Arratia ERCIM 2011, 17-19 Dec. 2011, University of
More informationVCU-TSA at Semeval-2016 Task 4: Sentiment Analysis in Twitter
VCU-TSA at Semeval-2016 Task 4: Sentiment Analysis in Twitter Gerard Briones and Kasun Amarasinghe and Bridget T. McInnes, PhD. Department of Computer Science Virginia Commonwealth University Richmond,
More informationText Opinion Mining to Analyze News for Stock Market Prediction
Int. J. Advance. Soft Comput. Appl., Vol. 6, No. 1, March 2014 ISSN 2074-8523; Copyright SCRG Publication, 2014 Text Opinion Mining to Analyze News for Stock Market Prediction Yoosin Kim 1, Seung Ryul
More informationSentiment Lexicons for Arabic Social Media
Sentiment Lexicons for Arabic Social Media Saif M. Mohammad 1, Mohammad Salameh 2, Svetlana Kiritchenko 1 1 National Research Council Canada, 2 University of Alberta saif.mohammad@nrc-cnrc.gc.ca, msalameh@ualberta.ca,
More informationMachine Learning for natural language processing
Machine Learning for natural language processing Introduction Laura Kallmeyer Heinrich-Heine-Universität Düsseldorf Summer 2016 1 / 13 Introduction Goal of machine learning: Automatically learn how to
More informationUsing Social Media for Continuous Monitoring and Mining of Consumer Behaviour
Using Social Media for Continuous Monitoring and Mining of Consumer Behaviour Michail Salampasis 1, Giorgos Paltoglou 2, Anastasia Giahanou 1 1 Department of Informatics, Alexander Technological Educational
More informationEfficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words
, pp.290-295 http://dx.doi.org/10.14257/astl.2015.111.55 Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words Irfan
More informationQuantifying Political Legitimacy from Twitter
Quantifying Political Legitimacy from Twitter Haibin Liu and Dongwon Lee College of Information Sciences and Technology The Pennsylvania State University, University Park, PA {haibin,dongwon}@psu.edu Abstract.
More informationTOOL OF THE INTELLIGENCE ECONOMIC: RECOGNITION FUNCTION OF REVIEWS CRITICS. Extraction and linguistic analysis of sentiments
TOOL OF THE INTELLIGENCE ECONOMIC: RECOGNITION FUNCTION OF REVIEWS CRITICS. Extraction and linguistic analysis of sentiments Grzegorz Dziczkowski, Katarzyna Wegrzyn-Wolska Ecole Superieur d Ingenieurs
More informationExploiting Topic based Twitter Sentiment for Stock Prediction
Exploiting Topic based Twitter Sentiment for Stock Prediction Jianfeng Si * Arjun Mukherjee Bing Liu Qing Li * Huayi Li Xiaotie Deng * Department of Computer Science, City University of Hong Kong, Hong
More informationSentiment Analysis Tool using Machine Learning Algorithms
Sentiment Analysis Tool using Machine Learning Algorithms I.Hemalatha 1, Dr. G. P Saradhi Varma 2, Dr. A.Govardhan 3 1 Research Scholar JNT University Kakinada, Kakinada, A.P., INDIA 2 Professor & Head,
More informationText Mining for Sentiment Analysis of Twitter Data
Text Mining for Sentiment Analysis of Twitter Data Shruti Wakade, Chandra Shekar, Kathy J. Liszka and Chien-Chung Chan The University of Akron Department of Computer Science liszka@uakron.edu, chan@uakron.edu
More informationListening to the Crowd: Automated Analysis of Events via Aggregated Twitter Sentiment
Listening to the Crowd: Automated Analysis of Events via Aggregated Twitter Sentiment Yuheng Hu Fei Wang Subbarao Kambhampati Department of Computer Science, Arizona State University, Tempe, AZ 8581 IBM
More informationFine-grained German Sentiment Analysis on Social Media
Fine-grained German Sentiment Analysis on Social Media Saeedeh Momtazi Information Systems Hasso-Plattner-Institut Potsdam University, Germany Saeedeh.momtazi@hpi.uni-potsdam.de Abstract Expressing opinions
More informationNews Sentiment Analysis Using R to Predict Stock Market Trends
News Sentiment Analysis Using R to Predict Stock Market Trends Anurag Nagar and Michael Hahsler Computer Science Southern Methodist University Dallas, TX Topics Motivation Gathering News Creating News
More informationA Sentiment Detection Engine for Internet Stock Message Boards
A Sentiment Detection Engine for Internet Stock Message Boards Christopher C. Chua Maria Milosavljevic James R. Curran School of Computer Science Capital Markets CRC Ltd School of Information and Engineering
More informationParticular Requirements on Opinion Mining for the Insurance Business
Particular Requirements on Opinion Mining for the Insurance Business Sven Rill, Johannes Drescher, Dirk Reinel, Jörg Scheidt, Florian Wogenstein Institute of Information Systems (iisys) University of Applied
More informationKeyphrase Extraction for Scholarly Big Data
Keyphrase Extraction for Scholarly Big Data Cornelia Caragea Computer Science and Engineering University of North Texas July 10, 2015 Scholarly Big Data Large number of scholarly documents on the Web PubMed
More informationSENTIMENT ANALYSIS: A STUDY ON PRODUCT FEATURES
University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln Dissertations and Theses from the College of Business Administration Business Administration, College of 4-1-2012 SENTIMENT
More informationSENTIMENT EXTRACTION FROM NATURAL AUDIO STREAMS. Lakshmish Kaushik, Abhijeet Sangwan, John H. L. Hansen
SENTIMENT EXTRACTION FROM NATURAL AUDIO STREAMS Lakshmish Kaushik, Abhijeet Sangwan, John H. L. Hansen Center for Robust Speech Systems (CRSS), Eric Jonsson School of Engineering, The University of Texas
More informationTASS - Workshop on Sentiment Analysis at SEPLN
Procesamiento del Lenguaje Natural, Revista nº 50 marzo de 2013, pp 37-44 recibido 28-11-12 revisado 14-02-13 aceptado 19-02-13 TASS - Workshop on Sentiment Analysis at SEPLN TASS - Taller de Análisis
More informationSentiment Classification. in a Nutshell. Cem Akkaya, Xiaonan Zhang
Sentiment Classification in a Nutshell Cem Akkaya, Xiaonan Zhang Outline Problem Definition Level of Classification Evaluation Mainstream Method Conclusion Problem Definition Sentiment is the overall emotion,
More informationTwitter Stock Bot. John Matthew Fong The University of Texas at Austin jmfong@cs.utexas.edu
Twitter Stock Bot John Matthew Fong The University of Texas at Austin jmfong@cs.utexas.edu Hassaan Markhiani The University of Texas at Austin hassaan@cs.utexas.edu Abstract The stock market is influenced
More informationCS 229, Autumn 2011 Modeling the Stock Market Using Twitter Sentiment Analysis
CS 229, Autumn 2011 Modeling the Stock Market Using Twitter Sentiment Analysis Team members: Daniel Debbini, Philippe Estin, Maxime Goutagny Supervisor: Mihai Surdeanu (with John Bauer) 1 Introduction
More informationProbabilistic topic models for sentiment analysis on the Web
University of Exeter Department of Computer Science Probabilistic topic models for sentiment analysis on the Web Chenghua Lin September 2011 Submitted by Chenghua Lin, to the the University of Exeter as
More informationA Survey on Product Aspect Ranking Techniques
A Survey on Product Aspect Ranking Techniques Ancy. J. S, Nisha. J.R P.G. Scholar, Dept. of C.S.E., Marian Engineering College, Kerala University, Trivandrum, India. Asst. Professor, Dept. of C.S.E., Marian
More informationBagged Ensemble Classifiers for Sentiment Classification of Movie Reviews
www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 3 Issue 2 February, 2014 Page No. 3951-3961 Bagged Ensemble Classifiers for Sentiment Classification of Movie
More informationSentiment Analysis of Twitter Data
Sentiment Analysis of Twitter Data Apoorv Agarwal Boyi Xie Ilia Vovsha Owen Rambow Rebecca Passonneau Department of Computer Science Columbia University New York, NY 10027 USA {apoorv@cs, xie@cs, iv2121@,
More informationPositive or negative? Using blogs to assess vehicles features
Positive or negative? Using blogs to assess vehicles features Silvio S Ribeiro Jr. 1, Zilton Junior 1, Wagner Meira Jr. 1, Gisele L. Pappa 1 1 Departamento de Ciência da Computação Universidade Federal
More informationAnalysis of Tweets for Prediction of Indian Stock Markets
Analysis of Tweets for Prediction of Indian Stock Markets Phillip Tichaona Sumbureru Department of Computer Science and Engineering, JNTU College of Engineering Hyderabad, Kukatpally, Hyderabad-500 085,
More informationTwitter Analytics for Insider Trading Fraud Detection
Twitter Analytics for Insider Trading Fraud Detection W-J Ketty Gann, John Day, Shujia Zhou Information Systems Northrop Grumman Corporation, Annapolis Junction, MD, USA {wan-ju.gann, john.day2, shujia.zhou}@ngc.com
More informationHow To Write A Summary Of A Review
PRODUCT REVIEW RANKING SUMMARIZATION N.P.Vadivukkarasi, Research Scholar, Department of Computer Science, Kongu Arts and Science College, Erode. Dr. B. Jayanthi M.C.A., M.Phil., Ph.D., Associate Professor,
More informationAn Introduction to Data Mining
An Introduction to Intel Beijing wei.heng@intel.com January 17, 2014 Outline 1 DW Overview What is Notable Application of Conference, Software and Applications Major Process in 2 Major Tasks in Detail
More informationEuropean Parliament elections on Twitter
Analysis of Twitter feeds 6 June 2014 Outline The goal of our project is to investigate any possible relations between public support towards two Polish major political parties - Platforma Obywatelska
More informationConstructing Social Intentional Corpora to Predict Click-Through Rate for Search Advertising
Constructing Social Intentional Corpora to Predict Click-Through Rate for Search Advertising Yi-Ting Chen, Hung-Yu Kao Department of Computer Science and Information Engineering National Cheng Kung University
More informationDesigning Novel Review Ranking Systems: Predicting Usefulness and Impact of Reviews
Designing Novel Review Ranking Systems: Predicting Usefulness and Impact of Reviews Anindya Ghose aghose@stern.nyu.edu Panagiotis G. Ipeirotis panos@nyu.edu Department of Information, Operations, and Management
More informationSI485i : NLP. Set 6 Sentiment and Opinions
SI485i : NLP Set 6 Sentiment and Opinions It's about finding out what people think... Can be big business Someone who wants to buy a camera Looks for reviews online Someone who just bought a camera Writes
More informationBridging CAQDAS with text mining: Text analyst s toolbox for Big Data: Science in the Media Project
Bridging CAQDAS with text mining: Text analyst s toolbox for Big Data: Science in the Media Project Ahmet Suerdem Istanbul Bilgi University; LSE Methodology Dept. Science in the media project is funded
More informationKeywords social media, internet, data, sentiment analysis, opinion mining, business
Volume 5, Issue 8, August 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Real time Extraction
More informationThe Open University s repository of research publications and other research outputs
Open Research Online The Open University s repository of research publications and other research outputs Alleviating data sparsity for Twitter sentiment analysis Conference Item How to cite: Saif, Hassan;
More informationSENTIMENT ANALYSIS: TEXT PRE-PROCESSING, READER VIEWS AND CROSS DOMAINS EMMA HADDI BRUNEL UNIVERSITY LONDON
BRUNEL UNIVERSITY LONDON COLLEGE OF ENGINEERING, DESIGN AND PHYSICAL SCIENCES DEPARTMENT OF COMPUTER SCIENCE DOCTOR OF PHILOSOPHY DISSERTATION SENTIMENT ANALYSIS: TEXT PRE-PROCESSING, READER VIEWS AND
More informationThe Italian Hate Map:
I-CiTies 2015 2015 CINI Annual Workshop on ICT for Smart Cities and Communities Palermo (Italy) - October 29-30, 2015 The Italian Hate Map: semantic content analytics for social good (Università degli
More informationBenchmarking Twitter Sentiment Analysis Tools
Benchmarking Twitter Sentiment Analysis Tools Ahmed Abbasi, Ammar Hassan, Milan Dhar University of Virginia Charlottesville, Virginia, USA E-mail: abbasi@comm.virginia.edu, mah9tg@virginia.edu, msd4ah@virginia.edu
More informationDesigning Ranking Systems for Consumer Reviews: The Impact of Review Subjectivity on Product Sales and Review Quality
Designing Ranking Systems for Consumer Reviews: The Impact of Review Subjectivity on Product Sales and Review Quality Anindya Ghose, Panagiotis G. Ipeirotis {aghose, panos}@stern.nyu.edu Department of
More informationTwitter Sentiment Analysis of Movie Reviews using Machine Learning Techniques.
Twitter Sentiment Analysis of Movie Reviews using Machine Learning Techniques. Akshay Amolik, Niketan Jivane, Mahavir Bhandari, Dr.M.Venkatesan School of Computer Science and Engineering, VIT University,
More informationMing-Wei Chang. Machine learning and its applications to natural language processing, information retrieval and data mining.
Ming-Wei Chang 201 N Goodwin Ave, Department of Computer Science University of Illinois at Urbana-Champaign, Urbana, IL 61801 +1 (917) 345-6125 mchang21@uiuc.edu http://flake.cs.uiuc.edu/~mchang21 Research
More informationTowards SoMEST Combining Social Media Monitoring with Event Extraction and Timeline Analysis
Towards SoMEST Combining Social Media Monitoring with Event Extraction and Timeline Analysis Yue Dai, Ernest Arendarenko, Tuomo Kakkonen, Ding Liao School of Computing University of Eastern Finland {yvedai,
More informationSentiment Analysis Using Dependency Trees and Named-Entities
Proceedings of the Twenty-Seventh International Florida Artificial Intelligence Research Society Conference Sentiment Analysis Using Dependency Trees and Named-Entities Ugan Yasavur, Jorge Travieso, Christine
More informationDefect Analytics in a High-End Server Manufacturing Environment
Proceedings of the 2015 Industrial and Systems Engineering Research Conference S. Cetinkaya and J. Ryan, eds. Defect Analytics in a High-End Server Manufacturing Environment Faisal Aqlan Industrial Engineering
More informationMULTI AGENT BASED ASPECT RANKING REVIEW SUMMARIZATION FOR CUSTOMER GUIDANCE
MULTI AGENT BASED ASPECT RANKING REVIEW SUMMARIZATION FOR CUSTOMER GUIDANCE S. Sobitha Ahila 1 and K. L.Shunmuganathan 2 1 Department of Computer Science and Engineering, Easwari Engineering College, India
More informationCIRGIRDISCO at RepLab2014 Reputation Dimension Task: Using Wikipedia Graph Structure for Classifying the Reputation Dimension of a Tweet
CIRGIRDISCO at RepLab2014 Reputation Dimension Task: Using Wikipedia Graph Structure for Classifying the Reputation Dimension of a Tweet Muhammad Atif Qureshi 1,2, Arjumand Younus 1,2, Colm O Riordan 1,
More informationAn Insight Of Sentiment Analysis In The Financial News
Available online at www.globalilluminators.org GlobalIlluminators FULL PAPER PROCEEDING Multidisciplinary Studies Full Paper Proceeding ICMRP-2014, Vol. 1, 278-291 ISBN: 978-969-9948-08-4 ICMRP 2014 An
More informationHow To Learn From The Revolution
The Revolution Learning from : Text, Feelings and Machine Learning IT Management, CBS Supply Chain Leaders Forum 3 September 2015 The Revolution Learning from : Text, Feelings and Machine Learning Outline
More informationUsing an Emotion-based Model and Sentiment Analysis Techniques to Classify Polarity for Reputation
Using an Emotion-based Model and Sentiment Analysis Techniques to Classify Polarity for Reputation Jorge Carrillo de Albornoz, Irina Chugur, and Enrique Amigó Natural Language Processing and Information
More informationIdentifying Focus, Techniques and Domain of Scientific Papers
Identifying Focus, Techniques and Domain of Scientific Papers Sonal Gupta Department of Computer Science Stanford University Stanford, CA 94305 sonal@cs.stanford.edu Christopher D. Manning Department of
More informationIssues in Information Systems Volume 15, Issue II, pp. 350-358, 2014
AUTOMATED PLATFORM FOR AGGREGATION AND TOPICAL SENTIMENT ANALYSIS OF NEWS ARTICLES, BLOGS, AND OTHER ONLINE PUBLICATIONS Michael R. Grayson, Mercer University, michael.richard.grayson@live.mercer.edu Myungjae
More informationText Mining - Scope and Applications
Journal of Computer Science and Applications. ISSN 2231-1270 Volume 5, Number 2 (2013), pp. 51-55 International Research Publication House http://www.irphouse.com Text Mining - Scope and Applications Miss
More informationEmotion Detection in Code-switching Texts via Bilingual and Sentimental Information
Emotion Detection in Code-switching Texts via Bilingual and Sentimental Information Zhongqing Wang, Sophia Yat Mei Lee, Shoushan Li, and Guodong Zhou Natural Language Processing Lab, Soochow University,
More informationBing Liu. Web Data Mining. Exploring Hyperlinks, Contents, and Usage Data. With 177 Figures. ~ Spring~r
Bing Liu Web Data Mining Exploring Hyperlinks, Contents, and Usage Data With 177 Figures ~ Spring~r Table of Contents 1. Introduction.. 1 1.1. What is the World Wide Web? 1 1.2. ABrief History of the Web
More informationForecasting stock markets with Twitter
Forecasting stock markets with Twitter Argimiro Arratia argimiro@lsi.upc.edu Joint work with Marta Arias and Ramón Xuriguera To appear in: ACM Transactions on Intelligent Systems and Technology, 2013,
More informationII. RELATED WORK. Sentiment Mining
Sentiment Mining Using Ensemble Classification Models Matthew Whitehead and Larry Yaeger Indiana University School of Informatics 901 E. 10th St. Bloomington, IN 47408 {mewhiteh, larryy}@indiana.edu Abstract
More informationCan Twitter provide enough information for predicting the stock market?
Can Twitter provide enough information for predicting the stock market? Maria Dolores Priego Porcuna Introduction Nowadays a huge percentage of financial companies are investing a lot of money on Social
More informationSemi-Supervised Learning for Blog Classification
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence (2008) Semi-Supervised Learning for Blog Classification Daisuke Ikeda Department of Computational Intelligence and Systems Science,
More informationCan Twitter Predict Royal Baby's Name?
Summary Can Twitter Predict Royal Baby's Name? Bohdan Pavlyshenko Ivan Franko Lviv National University,Ukraine, b.pavlyshenko@gmail.com In this paper, we analyze the existence of possible correlation between
More informationTweets Miner for Stock Market Analysis
Tweets Miner for Stock Market Analysis Bohdan Pavlyshenko Electronics department, Ivan Franko Lviv National University,Ukraine, Drahomanov Str. 50, Lviv, 79005, Ukraine, e-mail: b.pavlyshenko@gmail.com
More informationChallenges of Cloud Scale Natural Language Processing
Challenges of Cloud Scale Natural Language Processing Mark Dredze Johns Hopkins University My Interests? Information Expressed in Human Language Machine Learning Natural Language Processing Intelligent
More informationONLINE RESUME PARSING SYSTEM USING TEXT ANALYTICS
ONLINE RESUME PARSING SYSTEM USING TEXT ANALYTICS Divyanshu Chandola 1, Aditya Garg 2, Ankit Maurya 3, Amit Kushwaha 4 1 Student, Department of Information Technology, ABES Engineering College, Uttar Pradesh,
More informationNeuro-Fuzzy Classification Techniques for Sentiment Analysis using Intelligent Agents on Twitter Data
International Journal of Innovation and Scientific Research ISSN 2351-8014 Vol. 23 No. 2 May 2016, pp. 356-360 2015 Innovative Space of Scientific Research Journals http://www.ijisr.issr-journals.org/
More informationDomain Classification of Technical Terms Using the Web
Systems and Computers in Japan, Vol. 38, No. 14, 2007 Translated from Denshi Joho Tsushin Gakkai Ronbunshi, Vol. J89-D, No. 11, November 2006, pp. 2470 2482 Domain Classification of Technical Terms Using
More information