Classification of Student s E-Learning Experiences in Social Media via Text Mining
|
|
|
- Shanna Carmella Patrick
- 10 years ago
- Views:
Transcription
1 IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: ,p-ISSN: , Volume 17, Issue 3, Ver. VI (May Jun. 2015), PP Classification of Student s E-Learning Experiences in Social Media via Text Mining Ms. Priyanka Patel 1, Ms. Khushali Mistry 2 1 (Department of CSE, PIET, Vadodara, India, 2 (Department of CSE, PIET, Vadodara, India, Abstract : In today s world, social media is used every individual for expressing their feelings, opinion, experiences and emotions. Applying data mining on all these emotions expressed in posts, comments and likes called as social media data. In Existing system, only the prominent themes are identified with relatively large number of tweets in the data. There are a variety of other issues hidden in the others theme. Several of these issues may be of great interest to education researchers and practitioners. The fact that the most relevant data which are found on engineering students learning experiences involve complaints, issues, and problems does not mean there are no positive sides in students learning experiences. This may entail that social media serve as a good venue for students to utter negative emotions and seek social support. In proposed work, New Label Good Things introduce and using the probability, the common keywords are considered for this label. Doing this, the proposed system is identifying and classifying the e-learning problems faced by student to improve their education quality with respect to their good and positive comments. Naive Bayes multilabel classifier is used for classification of experiences' by finding the probability of words in tweet for each category probability of each label contains how many users. Finally the tweets with new Label will be compared with the rest of the tweets with existing Labels. Keywords Classification, social media, multilabel, learning experiences, text mining. I. Introduction In today s world we mostly use websites for Social Networking, Education, Marketing, Entertainment, Business, Shopping, and so many other things for making life easy. Now days, Social media craze is mounting to heights of success for every individuals. Youngsters are the most common users and most among of them are Students. Students just comment, share, like and post their feelings on social media like Twitter, Facebook and Youtube. Students feel free to discuss and share their experiences on social media in informal and casual manner without considering spellings and accurate grammar. Social media data are unstructured. These social media provides so many useful knowledge and information about the students emotions, feelings, experiences and struggles in their studies outside the classroom. So following these Students track on the social media and it s an interesting outlook for educational researchers and practitioners to understand student learning experiences outside the classroom. This understanding will uncover so many unknown experiences which was not cleared or considered while the classroom discussion. This understanding about their experiences provides so useful data for the betterment of student in decision making, enhancing students education quality, training and placement, withholding and achievements. The amount of social media data provides chances to understand students experiences, but their methodological difficulties to use social media data for educational purpose. In classroom studies, to know each and every students point of view surveys, reviews, group discussions, interview and concealing was done. The innovative concept of using social media data focuses on the required information and knowledge to be extracted for educational purpose by understanding students experiences. The social media data like the students comments, posts and feelings have to be used for understanding students learning experience with the research goals: 1) To find and classify students problem in their learning. 2) Track students good or bad experiences. Mining the social media data like students emotions will result to classify the group of students according to their experiences and identify their problems to be solved to improve the education quality. Several mining techniques for text document are used for mine various textual matches through the social area. Certain Social site applications with social websites like Facebook, Twitter, and Youtube gives so many chances to ascertain communication via people leading to communal learning and circulation of important knowledge such as comments, posts, emotions, along with likes. DOI: / Page
2 Mining text oriented files is the acquaintance breakthrough technique that provides computational brainpower and this technique comprises of multidisciplinary fields, such as information rescue, text investigation, natural language dispensation, and in sequence classification based on logical and non-petty matches from colossal data sets. Users do not care about spelling mistakes and grammar rules. For text mining classification and clustering which has been extensively used for the examination of the shapeless text available on the large scale systems like Social area there are two approaches. 1.1 Problem Definition Educational researches have been using old ways such as surveys, interviews, focus groups, classroom activities to collect data related to students learning experiences. That will reducing scalability problem. As optimistic about their experiences, students need to reflect on what they were thinking and doing sometime in the past, which may have become obscured over time. There is no research found to directly mine and analyze student-posted content with considering the students problem from uninhibited spaces on the social web with the clear goal of understanding students learning experiences. The existing work has not measured student academic performance to identify the students problem and classify them accurately for enhancing E-learning experiences. 1.2 Motivation Many times, students gets shy or afraid of clearing their problem during the classroom and this social media help them to just post whatever they feel at that time about their emotions. The schools and departments have been struggling with student recruitment and withholding issues. Graduates play an important role in nation s future workforce and which directly effects the nation s economic growth and global competency. The concept of combining Students learning experiences for enhancing E-learning experiences is innovative for improving the training style or teaching style by which student to correct them at some required time without continuous concealing or surveying. Based on understanding of issues and problems in students life, policymakers and educators can make more informed decisions on proper interventions and services that can help students overcome barriers in learning[1].student can be trained or improve education quality as they have been classified. Students learning experiences from social media will save the time to collect the data manually. 1.3 Objective To separate out meaningful information from the label other. To classify student based on content shared in the social media. To understand issues and problems students encounter in their learning experiences. To classify new label Good things for students combining with their emotional aspects and performance grade. Improved the Probability of the labels and keywords for new introduced labels. It can be used to inform educational administrators, practitioners and other relevant decision makers to gain further understanding of students college experiences. II. Literature Survey 2.1 Social Media With the intensification of the Internet communiqué techniques, the World Wide Web have became a extremely imperative display place for users for a interaction with each other. In the course of this display place, users could share with a ease and broaden in sequence and dreams to any one and from anywhere. Next to with the people s contacts with each other via these all social networking websites, progressively data and un-like in sequence information have been aggregated. Surrounded by all different information on these social networking websites, a quantity of has been considered as negative ones because they could be utilize to attack and to malign. Twitter is an exceptionally trendy micro blogging site, where users rummage around for appropriate and social information referred as breaking news, posts about celebrities, and trending topics. Users send short transcript messages referred as tweets, have limitation of 140 characters by length and could be view by user s followers. A follower is any person called when he is choosing to have other s tweets posted on one s timeline. For real-time, Twitters have been worn as a channel information giving out and it has been used in various brand elections, campaigns, and likely to a news media. 2.2Naive Bayes Classification The Naive Bayes Classifier technique has been based using Bayesian theorem and the best wellmatched to the dimensionality of the inputs has sky-scraping. Over and over again outperforms more sophisticated classification methods, yet it is simple to operate. For the models, Maximum Likelihood estimates DOI: / Page
3 all the parameters. To estimate the parameters, it has been require of small number of training. It operates glowing and powerfully in supervised learning. In Bayesian analysis, Prior Probability: It is a belief and based on previous experience. It is a ratio of number of single objects and number of total objects. Likelihood: To classify a new object that this object belongs to which case. Posterior Probability: The final classification is made by combining both sources of information i.e. Prior and Likelihood to form a Posterior Probability using Bayes rule. Posterior Probability of X being a object α Prior Probability of total objects xlikelihood of X given objects. Above theorem guide us to solve the conditional probability of contrary and independent events. The estimation of the probability for an opinion could be containing positive, negative or neutral. It would produce good results.conclusion is that Naive Bayes perform glowing in sky-scraping dependant features and outperforms regularly compared to Neural Networks, Decision Trees etc.,. Nevertheless criterion maximum likelihood parameter learning for Naïve Bayes classifier tends to be suboptimal. 2.3Text Mining The data mining done on social media data covers many uncover features of the social media or the social web i.e. Twitter, Facebook and Youtube. Mining file containing text has been known as Intelligent Text breakdown or acquaintance discovery in Text or it can said that Text Data Mining which can be used to mine the social media data. Mostly the social media data are unstructured format and to retrieve information from that is complex due the massive information. So, it requires specific processing methods and algorithms to extract useful information from that social web data. 2.4 Comparisons between Multiclass and Multilabel Classification Table 2.1 Comparison between Multiclass and Multilabel classification MULTICLASS CLASSIFICATION MULTILABEL CLASSIFICATION PURPOSE This means a classification task with more than two classes but not at the same time. This means a classification task with more than two classes at same time. PROPERTIES It makes the assumption that each sample is It is a predicting property of a data point. assigned to one and only one label. NATURE They are mutually exclusive. They are not mutually exclusive. EXAMPLE A fruit can be either an apple or a mango, but A text can be at any document at same time. not can be apple or mango at the same time. NOTATION Examples : D={x 1,.x n} Examples : D={x 1,.x n} Labels : L={ L 1,, L m} Each example is associated with one label (x, l L) Labels : L={ L 1,, L m} Each example is associated with a subset of labels (x, S L) 2.5 Study of Papers SR NO TITLE AUTHOR DESCRIPTION PUBLI-CATION &YEAR 1 Understanding Hsin-Ying For start-ups companies, Following steps are IEEE, Customers Using Wu, Kuan- done in this paper: Facebook Pages: Data Liang Liu Key phrases extraction and summarization Mining Users and Charles Clustering algorithm: K-means. Feedback Using Text Trappey Clustering quality evaluation: cohesion and Analysis separation. 2 Mining Social Media Data for Understanding Students Learning Experiences Xin Chen, Mihaela Vorvoreanu, and Krishna Madhavan Developed a workflow to integrate both qualitative analysis and large-scale data mining techniques. Used a multi-label classification algorithm to classify tweets reflecting students problems. IEEE, 2013 DOI: / Page
4 3 Content Matters: A study of hate groups detection based on social networks analysis and web mining I-Hsien Ting, Shyue-Liang Wang, Hsing- Miao Chi and Jyun-Sing Wu Two groups hate and like group. Facebook Data Extraction Data Pre-processing Social Network Structure Features Extraction Content Features Extraction by TF-IDF. Group vectors IEEE, Recognizing User Identity in Twitter Social Networks via Text Mining Sara Keretna, Ahmad Hossny and DougCreight on They followed the steps: Input: unlabeled tweets. Feature extractor: feature vector is extracted Classifier: to predict identity of user. IEEE, Twitter Trending Topic Classification Kathy Lee, Diana Palsetia, Ramanathan Narayanan, Md. Mostofa Ali Patwary, Ankit Agrawal, and Alok Choudhary They classify Twitter Trending Topics into 18general categories such as sports, politics, technology,etc. They experiment with 2approaches for topic classification; (i) the well-known Bag-of-Words approach for text classification (ii) network-based classification. IEEE, 2011 III. Proposed Work 3.1 Overview of Proposed Work Proposed work focus on the students emotions express on the social site by which they can be classify into good and bad experiences. The proposed work is only the first step towards enlightening actionable insights from student-generated content on social media in order to pick up education quality. The proposed system aims at which are related to engineering students Twitter posts that are to understood arguments and problems for their educational experiences. Initially to be conducted was qualitative investigation on such a gathered sample taken from tweets which are related to engineering students belongs to college life. The proposed system finds engineering students bump into problems are as follows: lack of social engagement, heavystudy load, and sleep dispossession. Considering the outcomes to be brought the proposed system will implement of multi labelled classifiers algorithm to identify and organize tweets which will be reflecting students problems. The proposed system has been used for the supervised learning technique for the organization of classified the twitter tweets and such a Foremost it has to be employed a healthy known text arrangement technique called Naive Bayes (N.B.) [10]. A document for NB would be modelled for the presence and absence of particular words. A variation of NB is Naive Bayes multinomial (NBM) and this will considers the rate of recurrence of words and could be expressed as follows: In which it represents as : P (c d): the probability of a document as d individual in class as c, P ( c ): the prior probability of a document taking place in class as c, and P (t k c ): the provisional probability of term as tk happening in a document of class as c. A document as d in such a case has been trending defined or tweets associated with tags such as: #hashtag. The proposed system has been sequences of steps which will recognize the social media data for educational purposes which would overcomes all major limitations of both manual qualitative investigation and large scale computational investigation of user-generated textual at ease There can be no pre-defined categories of the data, subsequently the proposed system needed to explore what all the students are saying in the tweets and what they want to say. DOI: / Page
5 Fig.3.1 Architecture Diagram As a consequence proposed system first conducted an inductive content analysis on the engineering Problems dataset. Inductive content analysis is one popular qualitative research method for manually analyzing text content. The proposed system extends the understanding of students experiences to the social and emotional aspects based on their informal online conversations. These are important components of the learning experiences that are much less emphasized and understood compared with academic performance.existing system have five prominent themes as under by feature extraction: Table 3.1 list of category with their keywords[1] CATEGORY Heavy Study Load Lack of Social Engagement Negative Emotion Sleep Problems Diversity Issues MOST PROBABLE KEYWORDS THAT OCCURS IN TWEETS hour, homework, exam, day, class, work, negtoken, problem, study, week, toomuch, all, lab, still, out, time, page, library, spend, today, long, school, due, engineer, already, disgusting negtoken, Friday, homework, out, study, work, weekend, life, class, engineer, exam,drink, break, Saturday, people, social, lab, spend, tonight, watch, game, miss, party, sunny, beautiful hate, f***, shit, exam, negtoken, week, class, hell, engineer, suck, study, hour, homework, time, equate, FML, lab, sad, bad, day, feel, tire, damn, death, hard sleep, hour, night, negtoken, bed, allnight, exam, homework, nap, coffee, time, study,more, work, class, dream, ladyengineer, late,week, day, long, morning, wake, awake, nosleep girl, class, only, negtoken, guy, engineer, Asia,professor, speak, English, female, hot, kid, more, toomuch, walk, people, teach, understand, chick, China, foreign, out, white, black In Proposed system, we used six existing prominent themes or labels with new theme or label: GOODTHING. The most probable features extracted under new label from analysing the tweets are as follows: DOI: / Page
6 Table 3.2 Details of new Label with keywords Others CATEGORY MOST PROBABLE KEYWORDS THAT OCCURS IN TWEETS All rest words are considered to be in this category NOTE: Now separating out some useful words which will be coming under new label GOODTHING by analysing the extracted tweets and then separated out the keywords for this new label. GoodThing Advantage, proud, best, free, Congrats, creativity, skill, good, challenge, free, happy, early, like, love, relax, easy, fun, excite, progress, thank, god, lucky 3.2 Flow Chart of Proposed Flow. Twitter data Extraction: Select user Collect Particular user data Stop word removal Collect new label : GOODTHING to classify with new keywords Stemming Pre-processed data Multilabel classification Each word probability in tweet Calculate probability and non probability for new label Probability of tweet category Probability of tweet under each label tweet with highest probability label Detection of learning experience shared by tweet Figure 3.2 Flow chart of proposed work DOI: / Page
7 IV. Experimental Results This section describes several experimental evolutions on inputted Student tweets on twitter extracted from import.io specific dataset. Existing system result 1 st train data: No. Of tweets: 90, No. Of label: 6 and keywords: 215 Fig 4.1 Probability of keywords falls in which label existing system Fig 4.2 Result Category Probability of Keywords Labels existing system DOI: / Page
8 Proposed system result Classification of Student s E-Learning Experiences in Social Media via Text Mining Fig 4.3 Improved Probability of Goodthing from Others Fig 4.4 1st train data: Improved Category Probability of Keywords Labels After Successfully Evaluate the result of Proposed Approach with compare to Existing Approach, we will see a comparative study of both system. DOI: / Page
9 Table 5.6 Comparison Table of Proposed and Existing system Proposed System considering new label Goodthing and Existing System Others Dataset : Real time Twitter data Only Prudue University. (implemented) 1 st train data: 90 tweets, 215 keywords-real dataset 2 nd train data:200 of tweets, 215 keywords-real dataset Parameter: Probability of keywords. Probability of tweets under five labels. Probability of labels. (Category and non-category) Labels : Seven Labels: HeavystudyLoad Lack of Social-Engagement Only consideration of five labels and Others label as extra case to fetch unwanted tweets. Negative Emotion SleepProblems Diversity Issues Others Goodthing Extractor: Use import.io extractor. (collect large tweets monthly) Prudue University V. Conclusion The conclusion to the research done till now is those researchers in learning analytics, educational data mining, and learning technologies can be very advantageous. It provides a workflow for analyzing social media data for educational purposes that overcomes the major limitations of both manual qualitative analysis and large scale computational analysis of user-generated textual content. From study of various labels for classifying student learning experiences many possibility of falling students under that different category classified as per the analysed result, it is cleared that after implementing proposed work many of the useful tweet which were going under the others and not considered. That all real tweets were collected, analysed and found the probability of that tweets falling under Goodthing. By using Naive Bayes probability rules, we classified the tweets for finding the Probability of the keywords and probability tweets under labels. From studies of different papers, only knowing the sentiment of student shared tweets does not provide much actionable knowledge on relevant interventions and services for students. In sentiment analysis, tweets can be classified positive, negative or neutral but cannot be judged that it belongs to which category. So, in the proposed system multi label classification model was used which will make fall tweet into multi-categories at same time. The proposed system successfully improved the probability and category probability of the two labels: Others and Goodthing labels. And concluded that student just don t only post their bad experiences but also good experiences on social media. And even used a new extractor import.io which helps in making database and dataset and can be used as web crawler. References [1]. Xin Chen, Mihaela Vorvoreanu, and Krishna Madhavan Mining Social Media Data for Understanding Students Learning Experiences, IEEE transactions on learning technologies, manuscript id 1, pp. 1-14, [2]. Ajay Kumar Pal, Saurabh Pal, Classification model of Predication for Placement of Students, in I.J. modern Education and Computer Science, pp , November [3]. Hsin-Ying Wu, Kuan-Liang Liu and Charles Trappey, Understanding Customers Using Facebook Pages: Data Mining Users Feedback Using Text Analysis, IEEE 18th International Conference on Computer Supported Cooperative Work in Design, 2014, pp [4]. I-Hsien Ting, Shyue-Liang Wang, Hsing-Miao Chi and Jyun-Sing Wu, Content Matters: A study of hate groups detection based on social networks analysis and web mining, IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 2013,pp [5]. Sara Keretna, Ahmad Hossny and DougCreighton, Recognising User Identity in Twitter Social Networks via Text Mining, IEEE International Conference on Systems, Man, and Cybernetics, 2013, pp [6]. PAPER 5: Kathy Lee, Diana Palsetia, Ramanathan Narayanan, Md. Mostofa Ali Patwary, Ankit Agrawal, and Alok Choudhary, Twitter Trending Topic Classification, 11th IEEE International Conference on Data Mining Workshops, 2011, pp [7]. N. Anitha, B. Anitha, S. Pradeepa, Sentiment Classification Approaches A Review, in International Journal of Innovations in Engineering and Technology, vol. 3, Issue 1, pp , October [8]. Nitin Pawar, Prof. S. K. Sonkar, An Approach Towards E-Learning Using SVM Classification Technique and Ranking Technique in Microblog Supported Classroom: A Survey, in International Journal of Emerging Technology and Advance Engineering, Volume 3, Issue 7, July [9]. A. K. Santra, S. Jayasudha, Classification of Web Log Data to Identify Interested Users Using Naïve Bayesian Classification, in International Journal of Computer Science Issues, Vol. 9, Issue 1, pp , January [10]. C. D. Manning, P. Raghavan, and H. Schtze, Introduction toinformation Retrieval. New York, NY, USA: Cambridge University Press, DOI: / Page
Social Media Mining. Data Mining Essentials
Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers
ISSN: 2321-7782 (Online) Volume 2, Issue 10, October 2014 International Journal of Advance Research in Computer Science and Management Studies
ISSN: 2321-7782 (Online) Volume 2, Issue 10, October 2014 International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online
Prediction of Heart Disease Using Naïve Bayes Algorithm
Prediction of Heart Disease Using Naïve Bayes Algorithm R.Karthiyayini 1, S.Chithaara 2 Assistant Professor, Department of computer Applications, Anna University, BIT campus, Tiruchirapalli, Tamilnadu,
Sentiment analysis of Twitter microblogging posts. Jasmina Smailović Jožef Stefan Institute Department of Knowledge Technologies
Sentiment analysis of Twitter microblogging posts Jasmina Smailović Jožef Stefan Institute Department of Knowledge Technologies Introduction Popularity of microblogging services Twitter microblogging posts
How To Solve The Kd Cup 2010 Challenge
A Lightweight Solution to the Educational Data Mining Challenge Kun Liu Yan Xing Faculty of Automation Guangdong University of Technology Guangzhou, 510090, China [email protected] [email protected]
Keywords social media, internet, data, sentiment analysis, opinion mining, business
Volume 5, Issue 8, August 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Real time Extraction
International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014
RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer
VCU-TSA at Semeval-2016 Task 4: Sentiment Analysis in Twitter
VCU-TSA at Semeval-2016 Task 4: Sentiment Analysis in Twitter Gerard Briones and Kasun Amarasinghe and Bridget T. McInnes, PhD. Department of Computer Science Virginia Commonwealth University Richmond,
Financial Trading System using Combination of Textual and Numerical Data
Financial Trading System using Combination of Textual and Numerical Data Shital N. Dange Computer Science Department, Walchand Institute of Rajesh V. Argiddi Assistant Prof. Computer Science Department,
Database Marketing, Business Intelligence and Knowledge Discovery
Database Marketing, Business Intelligence and Knowledge Discovery Note: Using material from Tan / Steinbach / Kumar (2005) Introduction to Data Mining,, Addison Wesley; and Cios / Pedrycz / Swiniarski
Role of Social Networking in Marketing using Data Mining
Role of Social Networking in Marketing using Data Mining Mrs. Saroj Junghare Astt. Professor, Department of Computer Science and Application St. Aloysius College, Jabalpur, Madhya Pradesh, India Abstract:
A Comparative Study on Sentiment Classification and Ranking on Product Reviews
A Comparative Study on Sentiment Classification and Ranking on Product Reviews C.EMELDA Research Scholar, PG and Research Department of Computer Science, Nehru Memorial College, Putthanampatti, Bharathidasan
Sentiment Analysis on Big Data
SPAN White Paper!? Sentiment Analysis on Big Data Machine Learning Approach Several sources on the web provide deep insight about people s opinions on the products and services of various companies. Social
Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification
Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification Tina R. Patil, Mrs. S. S. Sherekar Sant Gadgebaba Amravati University, Amravati [email protected], [email protected]
Data Warehousing and Data Mining in Business Applications
133 Data Warehousing and Data Mining in Business Applications Eesha Goel CSE Deptt. GZS-PTU Campus, Bathinda. Abstract Information technology is now required in all aspect of our lives that helps in business
Spatio-Temporal Patterns of Passengers Interests at London Tube Stations
Spatio-Temporal Patterns of Passengers Interests at London Tube Stations Juntao Lai *1, Tao Cheng 1, Guy Lansley 2 1 SpaceTimeLab for Big Data Analytics, Department of Civil, Environmental &Geomatic Engineering,
Analysis of Tweets for Prediction of Indian Stock Markets
Analysis of Tweets for Prediction of Indian Stock Markets Phillip Tichaona Sumbureru Department of Computer Science and Engineering, JNTU College of Engineering Hyderabad, Kukatpally, Hyderabad-500 085,
Football Match Winner Prediction
Football Match Winner Prediction Kushal Gevaria 1, Harshal Sanghavi 2, Saurabh Vaidya 3, Prof. Khushali Deulkar 4 Department of Computer Engineering, Dwarkadas J. Sanghvi College of Engineering, Mumbai,
A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS
A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS Mrs. Jyoti Nawade 1, Dr. Balaji D 2, Mr. Pravin Nawade 3 1 Lecturer, JSPM S Bhivrabai Sawant Polytechnic, Pune (India) 2 Assistant
Impelling Heart Attack Prediction System using Data Mining and Artificial Neural Network
General Article International Journal of Current Engineering and Technology E-ISSN 2277 4106, P-ISSN 2347-5161 2014 INPRESSCO, All Rights Reserved Available at http://inpressco.com/category/ijcet Impelling
Applying Data Mining Techniques to Social Media Data for Analyzing the Student s Learning Experience
Applying Data Mining Techniques to Social Media Data for Analyzing the Student s Learning Experience GRADUATE PROJECT REPORT Submitted to the Faculty of The School of Engineering & Computing Sciences Texas
Blog Post Extraction Using Title Finding
Blog Post Extraction Using Title Finding Linhai Song 1, 2, Xueqi Cheng 1, Yan Guo 1, Bo Wu 1, 2, Yu Wang 1, 2 1 Institute of Computing Technology, Chinese Academy of Sciences, Beijing 2 Graduate School
Research on Sentiment Classification of Chinese Micro Blog Based on
Research on Sentiment Classification of Chinese Micro Blog Based on Machine Learning School of Economics and Management, Shenyang Ligong University, Shenyang, 110159, China E-mail: [email protected] Abstract
BIG DATA IN HEALTHCARE THE NEXT FRONTIER
BIG DATA IN HEALTHCARE THE NEXT FRONTIER Divyaa Krishna Sonnad 1, Dr. Jharna Majumdar 2 2 Dean R&D, Prof. and Head, 1,2 Dept of CSE (PG), Nitte Meenakshi Institute of Technology Abstract: The world of
Forecasting stock markets with Twitter
Forecasting stock markets with Twitter Argimiro Arratia [email protected] Joint work with Marta Arias and Ramón Xuriguera To appear in: ACM Transactions on Intelligent Systems and Technology, 2013,
IMAV: An Intelligent Multi-Agent Model Based on Cloud Computing for Resource Virtualization
2011 International Conference on Information and Electronics Engineering IPCSIT vol.6 (2011) (2011) IACSIT Press, Singapore IMAV: An Intelligent Multi-Agent Model Based on Cloud Computing for Resource
Twitter Sentiment Analysis of Movie Reviews using Machine Learning Techniques.
Twitter Sentiment Analysis of Movie Reviews using Machine Learning Techniques. Akshay Amolik, Niketan Jivane, Mahavir Bhandari, Dr.M.Venkatesan School of Computer Science and Engineering, VIT University,
Using Data Mining for Mobile Communication Clustering and Characterization
Using Data Mining for Mobile Communication Clustering and Characterization A. Bascacov *, C. Cernazanu ** and M. Marcu ** * Lasting Software, Timisoara, Romania ** Politehnica University of Timisoara/Computer
DATA MINING TECHNIQUES AND APPLICATIONS
DATA MINING TECHNIQUES AND APPLICATIONS Mrs. Bharati M. Ramageri, Lecturer Modern Institute of Information Technology and Research, Department of Computer Application, Yamunanagar, Nigdi Pune, Maharashtra,
CLASSIFYING NETWORK TRAFFIC IN THE BIG DATA ERA
CLASSIFYING NETWORK TRAFFIC IN THE BIG DATA ERA Professor Yang Xiang Network Security and Computing Laboratory (NSCLab) School of Information Technology Deakin University, Melbourne, Australia http://anss.org.au/nsclab
IT services for analyses of various data samples
IT services for analyses of various data samples Ján Paralič, František Babič, Martin Sarnovský, Peter Butka, Cecília Havrilová, Miroslava Muchová, Michal Puheim, Martin Mikula, Gabriel Tutoky Technical
Customer Classification And Prediction Based On Data Mining Technique
Customer Classification And Prediction Based On Data Mining Technique Ms. Neethu Baby 1, Mrs. Priyanka L.T 2 1 M.E CSE, Sri Shakthi Institute of Engineering and Technology, Coimbatore 2 Assistant Professor
DATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM
INTERNATIONAL JOURNAL OF RESEARCH IN COMPUTER APPLICATIONS AND ROBOTICS ISSN 2320-7345 DATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM M. Mayilvaganan 1, S. Aparna 2 1 Associate
DIGITAL MARKETING STRATEGY. Setting up, Raising Awareness For and Monitoring Social Media
DIGITAL MARKETING STRATEGY Setting up, Raising Awareness For and Monitoring Social Media INTRODUCTION 1 INTRODUCTION 1 Social media first emerged as a personal means of communication; enabling new connections,
Using News Articles to Predict Stock Price Movements
Using News Articles to Predict Stock Price Movements Győző Gidófalvi Department of Computer Science and Engineering University of California, San Diego La Jolla, CA 9237 [email protected] 21, June 15,
Comparison of K-means and Backpropagation Data Mining Algorithms
Comparison of K-means and Backpropagation Data Mining Algorithms Nitu Mathuriya, Dr. Ashish Bansal Abstract Data mining has got more and more mature as a field of basic research in computer science and
Sentiment analysis: towards a tool for analysing real-time students feedback
Sentiment analysis: towards a tool for analysing real-time students feedback Nabeela Altrabsheh Email: [email protected] Mihaela Cocea Email: [email protected] Sanaz Fallahkhair Email:
International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015
RESEARCH ARTICLE OPEN ACCESS Data Mining Technology for Efficient Network Security Management Ankit Naik [1], S.W. Ahmad [2] Student [1], Assistant Professor [2] Department of Computer Science and Engineering
COURSE RECOMMENDER SYSTEM IN E-LEARNING
International Journal of Computer Science and Communication Vol. 3, No. 1, January-June 2012, pp. 159-164 COURSE RECOMMENDER SYSTEM IN E-LEARNING Sunita B Aher 1, Lobo L.M.R.J. 2 1 M.E. (CSE)-II, Walchand
Inner Classification of Clusters for Online News
Inner Classification of Clusters for Online News Harmandeep Kaur 1, Sheenam Malhotra 2 1 (Computer Science and Engineering Department, Shri Guru Granth Sahib World University Fatehgarh Sahib) 2 (Assistant
Sentiment analysis on news articles using Natural Language Processing and Machine Learning Approach.
Sentiment analysis on news articles using Natural Language Processing and Machine Learning Approach. Pranali Chilekar 1, Swati Ubale 2, Pragati Sonkambale 3, Reema Panarkar 4, Gopal Upadhye 5 1 2 3 4 5
Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.
Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Analytics
Data Mining in Web Search Engine Optimization and User Assisted Rank Results
Data Mining in Web Search Engine Optimization and User Assisted Rank Results Minky Jindal Institute of Technology and Management Gurgaon 122017, Haryana, India Nisha kharb Institute of Technology and Management
Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis
IOSR Journal of Computer Engineering (IOSRJCE) ISSN: 2278-0661, ISBN: 2278-8727 Volume 6, Issue 5 (Nov. - Dec. 2012), PP 36-41 Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree S.Florence 1, N.G.Bhuvaneswari Amma 2, G.Annapoorani 3, K.Malathi 4 PG Scholar, Indian Institute of Information Technology, Srirangam,
Random forest algorithm in big data environment
Random forest algorithm in big data environment Yingchun Liu * School of Economics and Management, Beihang University, Beijing 100191, China Received 1 September 2014, www.cmnt.lv Abstract Random forest
How To Use Neural Networks In Data Mining
International Journal of Electronics and Computer Science Engineering 1449 Available Online at www.ijecse.org ISSN- 2277-1956 Neural Networks in Data Mining Priyanka Gaur Department of Information and
OPINION MINING IN PRODUCT REVIEW SYSTEM USING BIG DATA TECHNOLOGY HADOOP
OPINION MINING IN PRODUCT REVIEW SYSTEM USING BIG DATA TECHNOLOGY HADOOP 1 KALYANKUMAR B WADDAR, 2 K SRINIVASA 1 P G Student, S.I.T Tumkur, 2 Assistant Professor S.I.T Tumkur Abstract- Product Review System
Data Mining Solutions for the Business Environment
Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania [email protected] Over
IJCSES Vol.7 No.4 October 2013 pp.165-168 Serials Publications BEHAVIOR PERDITION VIA MINING SOCIAL DIMENSIONS
IJCSES Vol.7 No.4 October 2013 pp.165-168 Serials Publications BEHAVIOR PERDITION VIA MINING SOCIAL DIMENSIONS V.Sudhakar 1 and G. Draksha 2 Abstract:- Collective behavior refers to the behaviors of individuals
Facilitating Business Process Discovery using Email Analysis
Facilitating Business Process Discovery using Email Analysis Matin Mavaddat [email protected] Stewart Green Stewart.Green Ian Beeson Ian.Beeson Jin Sa Jin.Sa Abstract Extracting business process
Identifying At-Risk Students Using Machine Learning Techniques: A Case Study with IS 100
Identifying At-Risk Students Using Machine Learning Techniques: A Case Study with IS 100 Erkan Er Abstract In this paper, a model for predicting students performance levels is proposed which employs three
Data Mining Algorithms Part 1. Dejan Sarka
Data Mining Algorithms Part 1 Dejan Sarka Join the conversation on Twitter: @DevWeek #DW2015 Instructor Bio Dejan Sarka ([email protected]) 30 years of experience SQL Server MVP, MCT, 13 books 7+ courses
An Overview of Knowledge Discovery Database and Data mining Techniques
An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,
Measure Social Media like a Pro: Social Media Analytics Uncovered SOCIAL MEDIA LIKE SHARE. Powered by
1 Measure Social Media like a Pro: Social Media Analytics Uncovered # SOCIAL MEDIA LIKE # SHARE Powered by 2 Social media analytics were a big deal in 2013, but this year they are set to be even more crucial.
A MACHINE LEARNING APPROACH TO FILTER UNWANTED MESSAGES FROM ONLINE SOCIAL NETWORKS
A MACHINE LEARNING APPROACH TO FILTER UNWANTED MESSAGES FROM ONLINE SOCIAL NETWORKS Charanma.P 1, P. Ganesh Kumar 2, 1 PG Scholar, 2 Assistant Professor,Department of Information Technology, Anna University
Exploring Big Data in Social Networks
Exploring Big Data in Social Networks [email protected] ([email protected]) INWEB National Science and Technology Institute for Web Federal University of Minas Gerais - UFMG May 2013 Some thoughts about
DIY Social Sentiment Analysis in 3 Steps
DIY Social Sentiment Analysis in 3 Steps Feb 26, 2015 About NetElixir Mission: To Help Digital Marketers Succeed Online. Incorporated: 2005. Global Offices: Princeton (HQ). London. Hyderabad. Team: 75+
KATE GLEASON COLLEGE OF ENGINEERING. John D. Hromi Center for Quality and Applied Statistics
ROCHESTER INSTITUTE OF TECHNOLOGY COURSE OUTLINE FORM KATE GLEASON COLLEGE OF ENGINEERING John D. Hromi Center for Quality and Applied Statistics NEW (or REVISED) COURSE (KGCOE- CQAS- 747- Principles of
An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015
An Introduction to Data Mining for Wind Power Management Spring 2015 Big Data World Every minute: Google receives over 4 million search queries Facebook users share almost 2.5 million pieces of content
Internet Worm Classification and Detection using Data Mining Techniques
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 3, Ver. 1 (May Jun. 2015), PP 76-81 www.iosrjournals.org Internet Worm Classification and Detection
Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data
INFO 1500 Introduction to IT Fundamentals 5. Database Systems and Managing Data Resources Learning Objectives 1. Describe how the problems of managing data resources in a traditional file environment are
Mobile Phone APP Software Browsing Behavior using Clustering Analysis
Proceedings of the 2014 International Conference on Industrial Engineering and Operations Management Bali, Indonesia, January 7 9, 2014 Mobile Phone APP Software Browsing Behavior using Clustering Analysis
Data Mining Algorithms and Techniques Research in CRM Systems
Data Mining Algorithms and Techniques Research in CRM Systems ADELA TUDOR, ADELA BARA, IULIANA BOTHA The Bucharest Academy of Economic Studies Bucharest ROMANIA {Adela_Lungu}@yahoo.com {Bara.Adela, Iuliana.Botha}@ie.ase.ro
An Empirical Study of Application of Data Mining Techniques in Library System
An Empirical Study of Application of Data Mining Techniques in Library System Veepu Uppal Department of Computer Science and Engineering, Manav Rachna College of Engineering, Faridabad, India Gunjan Chindwani
Towards SoMEST Combining Social Media Monitoring with Event Extraction and Timeline Analysis
Towards SoMEST Combining Social Media Monitoring with Event Extraction and Timeline Analysis Yue Dai, Ernest Arendarenko, Tuomo Kakkonen, Ding Liao School of Computing University of Eastern Finland {yvedai,
The multilayer sentiment analysis model based on Random forest Wei Liu1, Jie Zhang2
2nd International Conference on Advances in Mechanical Engineering and Industrial Informatics (AMEII 2016) The multilayer sentiment analysis model based on Random forest Wei Liu1, Jie Zhang2 1 School of
An Introduction to Data Mining
An Introduction to Intel Beijing [email protected] January 17, 2014 Outline 1 DW Overview What is Notable Application of Conference, Software and Applications Major Process in 2 Major Tasks in Detail
Search and Information Retrieval
Search and Information Retrieval Search on the Web 1 is a daily activity for many people throughout the world Search and communication are most popular uses of the computer Applications involving search
NEURAL NETWORKS IN DATA MINING
NEURAL NETWORKS IN DATA MINING 1 DR. YASHPAL SINGH, 2 ALOK SINGH CHAUHAN 1 Reader, Bundelkhand Institute of Engineering & Technology, Jhansi, India 2 Lecturer, United Institute of Management, Allahabad,
Sentiment analysis on tweets in a financial domain
Sentiment analysis on tweets in a financial domain Jasmina Smailović 1,2, Miha Grčar 1, Martin Žnidaršič 1 1 Dept of Knowledge Technologies, Jožef Stefan Institute, Ljubljana, Slovenia 2 Jožef Stefan International
How To Write A Summary Of A Review
PRODUCT REVIEW RANKING SUMMARIZATION N.P.Vadivukkarasi, Research Scholar, Department of Computer Science, Kongu Arts and Science College, Erode. Dr. B. Jayanthi M.C.A., M.Phil., Ph.D., Associate Professor,
Keywords Data Mining, Knowledge Discovery, Direct Marketing, Classification Techniques, Customer Relationship Management
Volume 4, Issue 6, June 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com A Simplified Data
Keywords : Data Warehouse, Data Warehouse Testing, Lifecycle based Testing
Volume 4, Issue 12, December 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com A Lifecycle
Introduction to Data Mining
Introduction to Data Mining Jay Urbain Credits: Nazli Goharian & David Grossman @ IIT Outline Introduction Data Pre-processing Data Mining Algorithms Naïve Bayes Decision Tree Neural Network Association
Data Mining & Data Stream Mining Open Source Tools
Data Mining & Data Stream Mining Open Source Tools Darshana Parikh, Priyanka Tirkha Student M.Tech, Dept. of CSE, Sri Balaji College Of Engg. & Tech, Jaipur, Rajasthan, India Assistant Professor, Dept.
A Review of Anomaly Detection Techniques in Network Intrusion Detection System
A Review of Anomaly Detection Techniques in Network Intrusion Detection System Dr.D.V.S.S.Subrahmanyam Professor, Dept. of CSE, Sreyas Institute of Engineering & Technology, Hyderabad, India ABSTRACT:In
Towards applying Data Mining Techniques for Talent Mangement
2009 International Conference on Computer Engineering and Applications IPCSIT vol.2 (2011) (2011) IACSIT Press, Singapore Towards applying Data Mining Techniques for Talent Mangement Hamidah Jantan 1,
Data Mining Approach For Subscription-Fraud. Detection in Telecommunication Sector
Contemporary Engineering Sciences, Vol. 7, 2014, no. 11, 515-522 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ces.2014.4431 Data Mining Approach For Subscription-Fraud Detection in Telecommunication
DATA MINING AND REPORTING IN HEALTHCARE
DATA MINING AND REPORTING IN HEALTHCARE Divya Gandhi 1, Pooja Asher 2, Harshada Chaudhari 3 1,2,3 Department of Information Technology, Sardar Patel Institute of Technology, Mumbai,(India) ABSTRACT The
BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL
The Fifth International Conference on e-learning (elearning-2014), 22-23 September 2014, Belgrade, Serbia BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL SNJEŽANA MILINKOVIĆ University
The Framework of Network Public Opinion Monitoring and Analyzing System Based on Semantic Content Identification
The Framework of Network Public Opinion Monitoring and Analyzing System Based on Semantic Content Identification Cheng Xian-Yi1, Zhu Ling-ling,Zhu Qian,Wang Jin The Framework of Network Public Opinion
A FUZZY BASED APPROACH TO TEXT MINING AND DOCUMENT CLUSTERING
A FUZZY BASED APPROACH TO TEXT MINING AND DOCUMENT CLUSTERING Sumit Goswami 1 and Mayank Singh Shishodia 2 1 Indian Institute of Technology-Kharagpur, Kharagpur, India [email protected] 2 School of Computer
Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words
, pp.290-295 http://dx.doi.org/10.14257/astl.2015.111.55 Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words Irfan
Integrated Data Mining and Knowledge Discovery Techniques in ERP
Integrated Data Mining and Knowledge Discovery Techniques in ERP I Gandhimathi Amirthalingam, II Rabia Shaheen, III Mohammad Kousar, IV Syeda Meraj Bilfaqih I,III,IV Dept. of Computer Science, King Khalid
Sentiment Analysis. D. Skrepetos 1. University of Waterloo. NLP Presenation, 06/17/2015
Sentiment Analysis D. Skrepetos 1 1 Department of Computer Science University of Waterloo NLP Presenation, 06/17/2015 D. Skrepetos (University of Waterloo) Sentiment Analysis NLP Presenation, 06/17/2015
Social Media Implementations
SEM Experience Analytics Social Media Implementations SEM Experience Analytics delivers real sentiment, meaning and trends within social media for many of the world s leading consumer brand companies.
Beyond listening Driving better decisions with business intelligence from social sources
Beyond listening Driving better decisions with business intelligence from social sources From insight to action with IBM Social Media Analytics State of the Union Opinions prevail on the Internet Social
Text Opinion Mining to Analyze News for Stock Market Prediction
Int. J. Advance. Soft Comput. Appl., Vol. 6, No. 1, March 2014 ISSN 2074-8523; Copyright SCRG Publication, 2014 Text Opinion Mining to Analyze News for Stock Market Prediction Yoosin Kim 1, Seung Ryul
IDENTIFIC ATION OF SOFTWARE EROSION USING LOGISTIC REGRESSION
http:// IDENTIFIC ATION OF SOFTWARE EROSION USING LOGISTIC REGRESSION Harinder Kaur 1, Raveen Bajwa 2 1 PG Student., CSE., Baba Banda Singh Bahadur Engg. College, Fatehgarh Sahib, (India) 2 Asstt. Prof.,
AUTO CLAIM FRAUD DETECTION USING MULTI CLASSIFIER SYSTEM
AUTO CLAIM FRAUD DETECTION USING MULTI CLASSIFIER SYSTEM ABSTRACT Luis Alexandre Rodrigues and Nizam Omar Department of Electrical Engineering, Mackenzie Presbiterian University, Brazil, São Paulo [email protected],[email protected]
Text Analytics Beginner s Guide. Extracting Meaning from Unstructured Data
Text Analytics Beginner s Guide Extracting Meaning from Unstructured Data Contents Text Analytics 3 Use Cases 7 Terms 9 Trends 14 Scenario 15 Resources 24 2 2013 Angoss Software Corporation. All rights
Program Planning Guide Business Administration, Associate in Applied Science General Business Administration Track (A25120)
Program Planning Guide Business Administration, Associate in Applied Science General Business Administration Track (A25120) Program Length: 5 semesters Career Pathway Options: Associate in Applied Science
Data Science and Business Analytics Certificate Data Science and Business Intelligence Certificate
Data Science and Business Analytics Certificate Data Science and Business Intelligence Certificate Description The Helzberg School of Management has launched two graduate-level certificates: one in Data
Bisecting K-Means for Clustering Web Log data
Bisecting K-Means for Clustering Web Log data Ruchika R. Patil Department of Computer Technology YCCE Nagpur, India Amreen Khan Department of Computer Technology YCCE Nagpur, India ABSTRACT Web usage mining
Email Spam Detection Using Customized SimHash Function
International Journal of Research Studies in Computer Science and Engineering (IJRSCSE) Volume 1, Issue 8, December 2014, PP 35-40 ISSN 2349-4840 (Print) & ISSN 2349-4859 (Online) www.arcjournals.org Email
ASSOCIATION RULE MINING ON WEB LOGS FOR EXTRACTING INTERESTING PATTERNS THROUGH WEKA TOOL
International Journal Of Advanced Technology In Engineering And Science Www.Ijates.Com Volume No 03, Special Issue No. 01, February 2015 ISSN (Online): 2348 7550 ASSOCIATION RULE MINING ON WEB LOGS FOR
