Mining Text Data for Useful Information in Higher Education John Zilvinskis Indiana University

Size: px
Start display at page:

Download "Mining Text Data for Useful Information in Higher Education John Zilvinskis Indiana University"

Transcription

1 Mining Text Data for Useful Information in Higher Education John Zilvinskis Indiana University

2 Institutional Researchers Credo We have not succeeded in answering all our problems indeed we sometimes feel we have not completely answered any of them. The answers we have found have only served to raise a whole set of new questions. In some ways we feel that we are as confused as ever, but we think we are confused on a higher level and about more important things. Earl C. Kelley, Professor of Secondary Education at Wayne University, 1951

3 Presentation Overview 1. Describe basic concepts of text mining 2. Invite presentation attendees to ask questions and discuss application of this technology 3. List the differences in text mining software 4. Apply this technique to two real life examples 5. Provide implications and considerations

4 Raise your hand if You have a general understanding of text mining Keep your hand up if You have or someone you know has participated in a text mining project You have played a significant role in at least one project that used text mining You have written code for or worked on several text mining projects

5 Learning Outcomes As a result of attending this session, participants will be able to: List fundamental methodologies for organizing text data. Describe how one could integrate mined text in student learning and performance analytics. Compare the differences between text mining software packages. Use text mining methods to refine survey questions.

6 Big Data & Data Mining Big Data (Laney) volume (amount of data) velocity (speed of data) variety (range of data types and sources) Data Mining - Applying algorithms to big data to generate new information

7 Analytics Predictive, Automated, Scale, Real time Data mining to create actionable intelligence (Campbell, DeBlois, & Oblinger, 2007, p. 42) Learning v. Student Analytics

8 Text Mining The need to turn text into numbers so powerful algorithms can be applied to large document databases (Miner, Delen, Elder, Fast, Hill, & Nisbet, 2012, p. 30) Text analytics volume (amount of data) velocity (speed of data) variety (range of data types and sources)

9 Citation Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications Miner, Delen, Elder, Fast, Hill, & Nisbet, 2012

10 Text Mining Processes Define project and identify data Process data: Establish a corpus, Pre-process data, Extract knowledge Develop models Evaluate results Disseminate results

11 Extract Knowledge Classification Clustering Association Trend analysis

12 Why Not Qualitative Research? Requires extensive resources Data must be processed in a timely fashion Might not be practical with big data Information must integrate with other data

13 What Kind of Text Can We Mine? For What Purpose Should We Mine? Perhaps attendees could share what type of textbased datasets are available to them or which ones they would like to have access to. This may help IR staff recognize what text they have access to and can analyze in addition to learning how they may conduct such analyses. AIR Program Reviewer

14 How Can We Mine Text in IR? Kind of Data Application essays Written assignments CMS postings Student blogs Course evaluations Surveys E-portfolios Early alert, course drop text For What Purpose Acceptance, enrollment Likelihood of passing Participation Change in student major Faculty success Open-ended questions Student success Student performance

15 Software Freeware RapidMiner Easy user interface, inverse document frequencies, some aspects for purchase Weka/KEA R Applicable to machine learning, some resources Computer science heavy, many online resources Commercial Software Modeler Premium (SPSS, IBM), strong user interface, other analytics tools, easy to use and comprehensive dictionary Enterprise Miner (SAS), moderate user interface, comprehensive data manipulation, and integrated clustering function

16 Classifying Open Ended Responses National Survey of Student Engagement Experimental item set leadership Formal leadership core item 1,482 of 4,836 students listed other Classified 830 (56%) entries

17 Classifying Open Ended Responses Position n % of other Tutoring % Teaching Assistant % Research Assistant % Secretary % Treasurer % Mentor % Member % Editor %

18 Classifying Open Ended Responses Position Did Not Complete Formal Leadership Completed Formal Leadership Original Option n % n % Resident Assistant % % Diversity Advocate % % Judicial Officer % % President % % Write-In Other n % n % Tutoring % % Teaching Assistant % % Treasurer % % Editor % %

19 Clustering E-Portfolio Submissions City University of New York (CUNY) Guttman High touch, block scheduling, learning communities, summer bridge Bill and Melinda Gates grant 163 student e-portfolio introductions

20 Clustering E-Portfolio Submissions Concept Custered Terms Family family, york, high school, college, child Learning class, teacher, art, math, subject Everyday know, day, love, life College participation high school, school, attend, guttman Gamming game, movie, favorite, watch, video Making friends shy, person, friend, know, quiet Recreation art, basketball, play, sport, travel Society social, worker, work, believe, help Technology technology, information, art, health, mind Business guttman, business, manhattan, administration, graduate

21 Regression of Academic Preparation and Clustered Text Related to Credit Hours Independent Variable β Sig. SATV SATM WritProf Age Connection to family R

22 Implications Process of automation Considering text source Weight of sentiment

23 Considerations Theoretical v. A-theoretical Ethical considerations Creepy treehouse Use of language

24 Thank You

Mining Text Data for Useful Information in Higher Education. John Zilvinskis. Indiana University

Mining Text Data for Useful Information in Higher Education. John Zilvinskis. Indiana University Running head: MINING TEXT DATA 1 Mining Text Data for Useful Information in Higher Education John Zilvinskis Indiana University MINING TEXT DATA 2 Abstract Text mining presents an efficient means to access

More information

Dawn Broschard, EdD Senior Research Analyst Office of Retention and Graduation Success dbroscha@fiu.edu

Dawn Broschard, EdD Senior Research Analyst Office of Retention and Graduation Success dbroscha@fiu.edu Using Decision Trees to Analyze Students at Risk of Dropping Out in Their First Year of College Based on Data Gathered Prior to Attending Their First Semester Dawn Broschard, EdD Senior Research Analyst

More information

Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications

Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications Gary Miner Dursun Delen John Elder Charlottesville, VA, USA Andrew Fast Charlottesville, VA, USA Thomas Hill Robert

More information

Predictive Analytics Certificate Program

Predictive Analytics Certificate Program Information Technologies Programs Predictive Analytics Certificate Program Accelerate Your Career Offered in partnership with: University of California, Irvine Extension s professional certificate and

More information

Hexaware E-book on Predictive Analytics

Hexaware E-book on Predictive Analytics Hexaware E-book on Predictive Analytics Business Intelligence & Analytics Actionable Intelligence Enabled Published on : Feb 7, 2012 Hexaware E-book on Predictive Analytics What is Data mining? Data mining,

More information

2010 Data Miner Survey Highlights

2010 Data Miner Survey Highlights Predictive Analytics World Washington, DC October 2010 2010 Data Miner Survey Highlights The Views of 735 Data Miners Karl Rexer, PhD President Rexer Analytics www.rexeranalytics.com 2010 Data Miner Survey:

More information

Why is Internal Audit so Hard?

Why is Internal Audit so Hard? Why is Internal Audit so Hard? 2 2014 Why is Internal Audit so Hard? 3 2014 Why is Internal Audit so Hard? Waste Abuse Fraud 4 2014 Waves of Change 1 st Wave Personal Computers Electronic Spreadsheets

More information

Decision Support Optimization through Predictive Analytics - Leuven Statistical Day 2010

Decision Support Optimization through Predictive Analytics - Leuven Statistical Day 2010 Decision Support Optimization through Predictive Analytics - Leuven Statistical Day 2010 Ernst van Waning Senior Sales Engineer May 28, 2010 Agenda SPSS, an IBM Company SPSS Statistics User-driven product

More information

An intelligent tool for expediting and automating data mining steps. Ourania Hatzi, Nikolaos Zorbas, Mara Nikolaidou and Dimosthenis Anagnostopoulos

An intelligent tool for expediting and automating data mining steps. Ourania Hatzi, Nikolaos Zorbas, Mara Nikolaidou and Dimosthenis Anagnostopoulos An intelligent tool for expediting and automating data mining steps Ourania Hatzi, Nikolaos Zorbas, Mara Nikolaidou and Dimosthenis Anagnostopoulos Outline Data Mining, current tools An intelligent tool

More information

Game Changers. Edited by diana G. oblinger

Game Changers. Edited by diana G. oblinger Game Changers E d u c ati o n and I n f o r m ati o n Te c h n o l o g ie s Edited by diana G. oblinger Game Changers: Education and Information Technologies 2012 EDUCAUSE This book is released under a

More information

Application of Predictive Model for Elementary Students with Special Needs in New Era University

Application of Predictive Model for Elementary Students with Special Needs in New Era University Application of Predictive Model for Elementary Students with Special Needs in New Era University Jannelle ds. Ligao, Calvin Jon A. Lingat, Kristine Nicole P. Chiu, Cym Quiambao, Laurice Anne A. Iglesia

More information

SAS JOINT DATA MINING CERTIFICATION AT BRYANT UNIVERSITY

SAS JOINT DATA MINING CERTIFICATION AT BRYANT UNIVERSITY SAS JOINT DATA MINING CERTIFICATION AT BRYANT UNIVERSITY Billie Anderson Bryant University, 1150 Douglas Pike, Smithfield, RI 02917 Phone: (401) 232-6089, e-mail: banderson@bryant.edu Phyllis Schumacher

More information

Table of Contents. Chapter No. 1 Introduction 1. iii. xiv. xviii. xix. Page No.

Table of Contents. Chapter No. 1 Introduction 1. iii. xiv. xviii. xix. Page No. Table of Contents Title Declaration by the Candidate Certificate of Supervisor Acknowledgement Abstract List of Figures List of Tables List of Abbreviations Chapter Chapter No. 1 Introduction 1 ii iii

More information

W. Heath Rushing Adsurgo LLC. Harness the Power of Text Analytics: Unstructured Data Analysis for Healthcare. Session H-1 JTCC: October 23, 2015

W. Heath Rushing Adsurgo LLC. Harness the Power of Text Analytics: Unstructured Data Analysis for Healthcare. Session H-1 JTCC: October 23, 2015 W. Heath Rushing Adsurgo LLC Harness the Power of Text Analytics: Unstructured Data Analysis for Healthcare Session H-1 JTCC: October 23, 2015 Outline Demonstration: Recent article on cnn.com Introduction

More information

Benchmarking of different classes of models used for credit scoring

Benchmarking of different classes of models used for credit scoring Benchmarking of different classes of models used for credit scoring We use this competition as an opportunity to compare the performance of different classes of predictive models. In particular we want

More information

An Introduction to Health Informatics for a Global Information Based Society

An Introduction to Health Informatics for a Global Information Based Society An Introduction to Health Informatics for a Global Information Based Society A Course proposal for 2010 Healthcare Industry Skills Innovation Award Sponsored by the IBM Academic Initiative submitted by

More information

Information Management course

Information Management course Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 01 : 06/10/2015 Practical informations: Teacher: Alberto Ceselli (alberto.ceselli@unimi.it)

More information

Data Mining and Business Intelligence CIT-6-DMB. http://blackboard.lsbu.ac.uk. Faculty of Business 2011/2012. Level 6

Data Mining and Business Intelligence CIT-6-DMB. http://blackboard.lsbu.ac.uk. Faculty of Business 2011/2012. Level 6 Data Mining and Business Intelligence CIT-6-DMB http://blackboard.lsbu.ac.uk Faculty of Business 2011/2012 Level 6 Table of Contents 1. Module Details... 3 2. Short Description... 3 3. Aims of the Module...

More information

IST565 M001 Yu Spring 2015 Syllabus Data Mining

IST565 M001 Yu Spring 2015 Syllabus Data Mining IST565 M001 Yu Spring 2015 Syllabus Data Mining Draft updated 10/28/2014 Instructor: Professor Bei Yu Classroom: Hinds 117 Email: byu.teaching@gmail.com Class time: 3:45-5:05 Wednesdays Office: Hinds 320

More information

The Text Analytics Market(s)

The Text Analytics Market(s) The Text Analytics Market(s) Competitive landscape and trends by Curt A. Monash, Ph.D. President, Monash Research Editor, Text Technologies contact@monash.com http://www.monash.com http://www.texttechnologies.com

More information

Sunnie Chung. Cleveland State University

Sunnie Chung. Cleveland State University Sunnie Chung Cleveland State University Data Scientist Big Data Processing Data Mining 2 INTERSECT of Computer Scientists and Statisticians with Knowledge of Data Mining AND Big data Processing Skills:

More information

IT services for analyses of various data samples

IT services for analyses of various data samples IT services for analyses of various data samples Ján Paralič, František Babič, Martin Sarnovský, Peter Butka, Cecília Havrilová, Miroslava Muchová, Michal Puheim, Martin Mikula, Gabriel Tutoky Technical

More information

Master Specialization in Knowledge Engineering

Master Specialization in Knowledge Engineering Master Specialization in Knowledge Engineering Pavel Kordík, Ph.D. Department of Computer Science Faculty of Information Technology Czech Technical University in Prague Prague, Czech Republic http://www.fit.cvut.cz/en

More information

7-29-2013. Composition Studies. Graduate Certificate Program. Online Certificate Program in English. Indiana University East Department of English

7-29-2013. Composition Studies. Graduate Certificate Program. Online Certificate Program in English. Indiana University East Department of English 7-29-2013 Composition Studies Graduate Certificate Program Online Certificate Program in English Indiana University East Department of English Composition Studies Graduate Certificate Program Online Graduate

More information

IBM SPSS Modeler Premium

IBM SPSS Modeler Premium IBM SPSS Modeler Premium Improve model accuracy with structured and unstructured data, entity analytics and social network analysis Highlights Solve business problems faster with analytical techniques

More information

IT and CRM A basic CRM model Data source & gathering system Database system Data warehouse Information delivery system Information users

IT and CRM A basic CRM model Data source & gathering system Database system Data warehouse Information delivery system Information users 1 IT and CRM A basic CRM model Data source & gathering Database Data warehouse Information delivery Information users 2 IT and CRM Markets have always recognized the importance of gathering detailed data

More information

An interdisciplinary model for analytics education

An interdisciplinary model for analytics education An interdisciplinary model for analytics education Raffaella Settimi, PhD School of Computing, DePaul University Drew Conway s Data Science Venn Diagram http://drewconway.com/zia/2013/3/26/the-data-science-venn-diagram

More information

A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS

A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS Stacey Franklin Jones, D.Sc. ProTech Global Solutions Annapolis, MD Abstract The use of Social Media as a resource to characterize

More information

Sentiment analysis on tweets in a financial domain

Sentiment analysis on tweets in a financial domain Sentiment analysis on tweets in a financial domain Jasmina Smailović 1,2, Miha Grčar 1, Martin Žnidaršič 1 1 Dept of Knowledge Technologies, Jožef Stefan Institute, Ljubljana, Slovenia 2 Jožef Stefan International

More information

Social Media Implementations

Social Media Implementations SEM Experience Analytics Social Media Implementations SEM Experience Analytics delivers real sentiment, meaning and trends within social media for many of the world s leading consumer brand companies.

More information

Analytics: An exploration of the nomenclature in the student experience.

Analytics: An exploration of the nomenclature in the student experience. Analytics: An exploration of the nomenclature in the student experience. Rhonda Leece Student Administration and Services University of New England Abstract The student journey through higher education,

More information

Big Data: A Closer Look

Big Data: A Closer Look Big Data: A Closer Look What is Big Data? Increasingly popular search query http://www.google.com/trends/explore#q=big%20data What is Big Data? the V s Characteristics of Big Data Volume http://www.businessweek.com/articles/2014-03-06/interactive-graphichow-big-is-big-data

More information

Some vendors have a big presence in a particular industry; some are geared toward data scientists, others toward business users.

Some vendors have a big presence in a particular industry; some are geared toward data scientists, others toward business users. Bonus Chapter Ten Major Predictive Analytics Vendors In This Chapter Angoss FICO IBM RapidMiner Revolution Analytics Salford Systems SAP SAS StatSoft, Inc. TIBCO This chapter highlights ten of the major

More information

Real World Application and Usage of IBM Advanced Analytics Technology

Real World Application and Usage of IBM Advanced Analytics Technology Real World Application and Usage of IBM Advanced Analytics Technology Anthony J. Young Pre-Sales Architect for IBM Advanced Analytics February 21, 2014 Welcome Anthony J. Young Lives in Austin, TX Focused

More information

Best Practices in Data Mining. Executive Summary

Best Practices in Data Mining. Executive Summary Executive Summary Prepared by: Database & Marketing Technology Council Authors: Richard Boire, Paul Tyndall, Greg Carriere, Rob Champion Released: August 2003 Executive Summary Canadian marketers have

More information

Data Mining Solutions for the Business Environment

Data Mining Solutions for the Business Environment Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania ruxandra_stefania.petre@yahoo.com Over

More information

Introduction to Data Mining

Introduction to Data Mining Introduction to Data Mining Jay Urbain Credits: Nazli Goharian & David Grossman @ IIT Outline Introduction Data Pre-processing Data Mining Algorithms Naïve Bayes Decision Tree Neural Network Association

More information

Welcome. Data Mining: Updates in Technologies. Xindong Wu. Colorado School of Mines Golden, Colorado 80401, USA

Welcome. Data Mining: Updates in Technologies. Xindong Wu. Colorado School of Mines Golden, Colorado 80401, USA Welcome Xindong Wu Data Mining: Updates in Technologies Dept of Math and Computer Science Colorado School of Mines Golden, Colorado 80401, USA Email: xwu@ mines.edu Home Page: http://kais.mines.edu/~xwu/

More information

Big Trouble. Does Big Data spell. for Lawyers? Presented to Colorado Bar Association, Communications & Technology Law Section Denver, Colorado

Big Trouble. Does Big Data spell. for Lawyers? Presented to Colorado Bar Association, Communications & Technology Law Section Denver, Colorado Does Big Data spell Big Trouble for Lawyers? Paul Karlzen Director HR Information & Analytics April 1, 2015 Presented to Colorado Bar Association, Communications & Technology Law Section Denver, Colorado

More information

Index Contents Page No. Introduction . Data Mining & Knowledge Discovery

Index Contents Page No. Introduction . Data Mining & Knowledge Discovery Index Contents Page No. 1. Introduction 1 1.1 Related Research 2 1.2 Objective of Research Work 3 1.3 Why Data Mining is Important 3 1.4 Research Methodology 4 1.5 Research Hypothesis 4 1.6 Scope 5 2.

More information

IBM Predictive Analytics Solutions for Education

IBM Predictive Analytics Solutions for Education IBM Software White Paper Business Analytics IBM Predictive Analytics Solutions for Education Empower your institution to make the right decision every time 2 IBM Predictive Analytics Solutions for Education

More information

Course Descriptions: Undergraduate/Graduate Certificate Program in Data Visualization and Analysis

Course Descriptions: Undergraduate/Graduate Certificate Program in Data Visualization and Analysis 9/3/2013 Course Descriptions: Undergraduate/Graduate Certificate Program in Data Visualization and Analysis Seton Hall University, South Orange, New Jersey http://www.shu.edu/go/dava Visualization and

More information

Data Science Certificate Program

Data Science Certificate Program Information Technologies Programs Data Science Certificate Program Accelerate Your Career extension.uci.edu/datascience Offered in partnership with University of California, Irvine Extension s professional

More information

Master of Science in Computer Science Information Systems

Master of Science in Computer Science Information Systems Master of Science in Computer Science Information Systems 1. General Admission Requirements. Admission to Graduate Studies (see graduate admission requirements). 2. Program Admission. In addition to meeting

More information

Predictive Analytics & Predictive Modeling December 2 3, 2014. Catherine Snyder Supervisor US Dealer Audit, Audit Services General Motors Company

Predictive Analytics & Predictive Modeling December 2 3, 2014. Catherine Snyder Supervisor US Dealer Audit, Audit Services General Motors Company Predictive Analytics & Predictive Modeling December 2 3, 2014 Catherine Snyder Supervisor US Dealer Audit, Audit Services General Motors Company AGENDA Overview of General Motors Company Predictive Analytics

More information

Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.

Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance. Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Analytics

More information

IBM SPSS Direct Marketing

IBM SPSS Direct Marketing IBM Software IBM SPSS Statistics 19 IBM SPSS Direct Marketing Understand your customers and improve marketing campaigns Highlights With IBM SPSS Direct Marketing, you can: Understand your customers in

More information

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014 RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer

More information

SEO Consulting Services By Cromosys. [Strategy & Plan]

SEO Consulting Services By Cromosys. [Strategy & Plan] SEO Consulting Services By Cromosys [Strategy & Plan] CROMOSYS SEO SERVICES WITH DETAILED STRATEGY & PLAN Cromosys offers SEO Services in terms of expertise and unique services. Cromosys will help you

More information

Predictive analytics. The rise and value of predictive analytics in enterprise decision making

Predictive analytics. The rise and value of predictive analytics in enterprise decision making WHITE PAPER Predictive analytics The rise and value of predictive analytics in enterprise decision making Give me a long enough lever and a place to stand, and I can move the Earth. Archimedes, 250 B.C.

More information

testo dello schema Secondo livello Terzo livello Quarto livello Quinto livello

testo dello schema Secondo livello Terzo livello Quarto livello Quinto livello Extracting Knowledge from Biomedical Data through Logic Learning Machines and Rulex Marco Muselli Institute of Electronics, Computer and Telecommunication Engineering National Research Council of Italy,

More information

Current state of learning analytics and educational data mining

Current state of learning analytics and educational data mining Current state of learning analytics and educational data mining George Siemens Ryan S.J.d. Baker August 2013 Poll #1 How far along is your institution in using LA/ EDM at institutional level? We re thinking

More information

The Big Data Revolution And How to Extract Value from Big Data

The Big Data Revolution And How to Extract Value from Big Data data analysis data mining quality improvement web-based analytics Business White Paper The Big Data Revolution And How to Extract Value from Big Data Dr. Thomas Hill The Big Data Revolution, by Dr. Thomas

More information

M15_BERE8380_12_SE_C15.7.qxd 2/21/11 3:59 PM Page 1. 15.7 Analytics and Data Mining 1

M15_BERE8380_12_SE_C15.7.qxd 2/21/11 3:59 PM Page 1. 15.7 Analytics and Data Mining 1 M15_BERE8380_12_SE_C15.7.qxd 2/21/11 3:59 PM Page 1 15.7 Analytics and Data Mining 15.7 Analytics and Data Mining 1 Section 1.5 noted that advances in computing processing during the past 40 years have

More information

What is Data Mining? Data Mining (Knowledge discovery in database) Data mining: Basic steps. Mining tasks. Classification: YES, NO

What is Data Mining? Data Mining (Knowledge discovery in database) Data mining: Basic steps. Mining tasks. Classification: YES, NO What is Data Mining? Data Mining (Knowledge discovery in database) Data Mining: "The non trivial extraction of implicit, previously unknown, and potentially useful information from data" William J Frawley,

More information

COMP9321 Web Application Engineering

COMP9321 Web Application Engineering COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 11 (Part II) http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2411

More information

Introduction Predictive Analytics Tools: Weka

Introduction Predictive Analytics Tools: Weka Introduction Predictive Analytics Tools: Weka Predictive Analytics Center of Excellence San Diego Supercomputer Center University of California, San Diego Tools Landscape Considerations Scale User Interface

More information

Computer-Based Text- and Data Analysis Technologies and Applications. Mark Cieliebak 9.6.2015

Computer-Based Text- and Data Analysis Technologies and Applications. Mark Cieliebak 9.6.2015 Computer-Based Text- and Data Analysis Technologies and Applications Mark Cieliebak 9.6.2015 Data Scientist analyze Data Library use 2 About Me Mark Cieliebak + Software Engineer & Data Scientist + PhD

More information

Data Science and Business Analytics Certificate Data Science and Business Intelligence Certificate

Data Science and Business Analytics Certificate Data Science and Business Intelligence Certificate Data Science and Business Analytics Certificate Data Science and Business Intelligence Certificate Description The Helzberg School of Management has launched two graduate-level certificates: one in Data

More information

Get the most value from your surveys with text analysis

Get the most value from your surveys with text analysis PASW Text Analytics for Surveys 3.0 Specifications Get the most value from your surveys with text analysis The words people use to answer a question tell you a lot about what they think and feel. That

More information

Building Analytics and Big Data Capabilities Tom Davenport CDB Annual Conference May 23, 2012

Building Analytics and Big Data Capabilities Tom Davenport CDB Annual Conference May 23, 2012 Building Analytics and Big Data Capabilities Tom Davenport CDB Annual Conference May 23, 2012 A Bright Idea Informatics/Analytics on Small and Big Data It works for: Old companies (GE, P&G, Marriott, Bank

More information

Programme Specification Postgraduate Programmes

Programme Specification Postgraduate Programmes Programme Specification Postgraduate Programmes Awarding Body/Institution Teaching Institution University of London Goldsmiths, University of London Name of Final Award and Programme Title MSc Data Science

More information

BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL

BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL The Fifth International Conference on e-learning (elearning-2014), 22-23 September 2014, Belgrade, Serbia BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL SNJEŽANA MILINKOVIĆ University

More information

Data Analysis. Management Information Systems 13

Data Analysis. Management Information Systems 13 Data Analysis Management Information Systems 13 166137-01+02 Management Information Systems Spring 2014 Sync Sangwon Lee, Ph. D D. of Information & Electronic Commerce WONKWANG University Prof. Dr. SSL

More information

What is Data Science? Data, Databases, and the Extraction of Knowledge Renée T., @becomingdatasci, November 2014

What is Data Science? Data, Databases, and the Extraction of Knowledge Renée T., @becomingdatasci, November 2014 What is Data Science? { Data, Databases, and the Extraction of Knowledge Renée T., @becomingdatasci, November 2014 Let s start with: What is Data? http://upload.wikimedia.org/wikipedia/commons/f/f0/darpa

More information

from ideas to outcomes

from ideas to outcomes from ideas to outcomes SSTRIDE-ING FOR DIVERSITY A COMPREHENSIVE APPROACH Watch Video IN THE BEGINNING THERE WAS the P I M S Funded by a National Institutes of Health grant, PIMS was designed to address

More information

Perm State University Master in Finance & Information Technology (MiFIT)

Perm State University Master in Finance & Information Technology (MiFIT) Perm State University Master in Finance & Information Technology (MiFIT) Page 1 Introduction Rapid changes in financial landscape demand finance professionals to double-quick develop practical solutions

More information

Solve Your Toughest Challenges with Data Mining

Solve Your Toughest Challenges with Data Mining IBM Software Business Analytics IBM SPSS Modeler Solve Your Toughest Challenges with Data Mining Use predictive intelligence to make good decisions faster Solve Your Toughest Challenges with Data Mining

More information

Machine Learning and Data Mining. Fundamentals, robotics, recognition

Machine Learning and Data Mining. Fundamentals, robotics, recognition Machine Learning and Data Mining Fundamentals, robotics, recognition Machine Learning, Data Mining, Knowledge Discovery in Data Bases Their mutual relations Data Mining, Knowledge Discovery in Databases,

More information

PREDICTING STUDENT RETENTION & SUCCESS IN ONLINE PROGRAMS. William Bloemer & Karen Swan UNIVERSITY OF ILLINOIS SPRINGFIELD

PREDICTING STUDENT RETENTION & SUCCESS IN ONLINE PROGRAMS. William Bloemer & Karen Swan UNIVERSITY OF ILLINOIS SPRINGFIELD PREDICTING STUDENT RETENTION & SUCCESS IN ONLINE PROGRAMS William Bloemer & Karen Swan UNIVERSITY OF ILLINOIS SPRINGFIELD The rarely articulated implication of all of this data floating around is that

More information

Maximizing Return and Minimizing Cost with the Decision Management Systems

Maximizing Return and Minimizing Cost with the Decision Management Systems KDD 2012: Beijing 18 th ACM SIGKDD Conference on Knowledge Discovery and Data Mining Rich Holada, Vice President, IBM SPSS Predictive Analytics Maximizing Return and Minimizing Cost with the Decision Management

More information

Solve your toughest challenges with data mining

Solve your toughest challenges with data mining IBM Software IBM SPSS Modeler Solve your toughest challenges with data mining Use predictive intelligence to make good decisions faster Solve your toughest challenges with data mining Imagine if you could

More information

Getting Started with Oracle Data Miner 11g R2. Brendan Tierney

Getting Started with Oracle Data Miner 11g R2. Brendan Tierney Getting Started with Oracle Data Miner 11g R2 Brendan Tierney Scene Setting This is not about DB log mining This is an introduction to ODM And how ODM can be included in OBIEE (next presentation) Domain

More information

Direct-to-Company Feedback Implementations

Direct-to-Company Feedback Implementations SEM Experience Analytics Direct-to-Company Feedback Implementations SEM Experience Analytics Listening System for Direct-to-Company Feedback Implementations SEM Experience Analytics delivers real sentiment,

More information

In this presentation, you will be introduced to data mining and the relationship with meaningful use.

In this presentation, you will be introduced to data mining and the relationship with meaningful use. In this presentation, you will be introduced to data mining and the relationship with meaningful use. Data mining refers to the art and science of intelligent data analysis. It is the application of machine

More information

SEI Case Stud y June 2013

SEI Case Stud y June 2013 Degree Compass Course Recommendation System SEI Case Stud y June 2013 Institution: Austin Peay State University, a four-year public, master s university with more than 10,000 students that offers programs

More information

Automated vs. manual methods of coding and analysing free text survey responses

Automated vs. manual methods of coding and analysing free text survey responses Automated vs. manual methods of coding and analysing free text survey responses Dr Kathy Seymour, Director, Seymour Research Ltd Free text data General Specific Please use the space below to provide any

More information

Computer Assisted Language Learning (CALL): Room for CompLing? Scott, Stella, Stacia

Computer Assisted Language Learning (CALL): Room for CompLing? Scott, Stella, Stacia Computer Assisted Language Learning (CALL): Room for CompLing? Scott, Stella, Stacia Outline I What is CALL? (scott) II Popular language learning sites (stella) Livemocha.com (stacia) III IV Specific sites

More information

Table of Contents. June 2010

Table of Contents. June 2010 June 2010 From: StatSoft Analytics White Papers To: Internal release Re: Performance comparison of STATISTICA Version 9 on multi-core 64-bit machines with current 64-bit releases of SAS (Version 9.2) and

More information

Big and Smart Data for efficient decisions: How to share with decision makers the practices of Big Data Analytics?

Big and Smart Data for efficient decisions: How to share with decision makers the practices of Big Data Analytics? Big and Smart Data for efficient decisions: How to share with decision makers the practices of Big Data Analytics? Ali FOULADKAR (ali.fouladkar@upmf-grenoble.fr) PhD candidate, Grenoble University (UPMF),

More information

DEPARTMENT OF INFORMATION AND LIBRARY SCIENCE

DEPARTMENT OF INFORMATION AND LIBRARY SCIENCE COLLEGE OF LIBERAL ARTS 67 DEPARTMENT OF INFORMATION AND LIBRARY SCIENCE Degrees Offered: B.A., M.A. Chair: Lin, Sinn-cheng ( 林 信 成 ) The Department The Department of Information and Library Science offers

More information

An Introduction to WEKA. As presented by PACE

An Introduction to WEKA. As presented by PACE An Introduction to WEKA As presented by PACE Download and Install WEKA Website: http://www.cs.waikato.ac.nz/~ml/weka/index.html 2 Content Intro and background Exploring WEKA Data Preparation Creating Models/

More information

Prof. Timothy Shea Charlton College of Business Southcoast E-Commerce Conference 2015

Prof. Timothy Shea Charlton College of Business Southcoast E-Commerce Conference 2015 Prof. Timothy Shea Charlton College of Business Southcoast E-Commerce Conference 2015 Web Analytics (a little) Text Analytics CEC: Customer Experience Management Final Video The measurement, collection,

More information

WHARTON COUNTY JUNIOR COLLEGE

WHARTON COUNTY JUNIOR COLLEGE Developmental Education Program Survey Institution Summary Report WHARTON COUNTY JUNIOR COLLEGE Organization of D.E. Programs Is the Developmental Education Program Centralized? Does this Institution have

More information

Big Data and Analytics: Challenges and Opportunities

Big Data and Analytics: Challenges and Opportunities Big Data and Analytics: Challenges and Opportunities Dr. Amin Beheshti Lecturer and Senior Research Associate University of New South Wales, Australia (Service Oriented Computing Group, CSE) Talk: Sharif

More information

DATA MINING - SELECTED TOPICS

DATA MINING - SELECTED TOPICS DATA MINING - SELECTED TOPICS Peter Brezany Institute for Software Science University of Vienna E-mail : brezany@par.univie.ac.at 1 MINING SPATIAL DATABASES 2 Spatial Database Systems SDBSs offer spatial

More information

Forensic & Investigative Accounting (FIA) Section American Accounting Association Mission, Objectives and 2013-2015 Strategy.

Forensic & Investigative Accounting (FIA) Section American Accounting Association Mission, Objectives and 2013-2015 Strategy. Forensic & Investigative Accounting (FIA) Section American Accounting Association Mission, Objectives and 2013-2015 Strategy Mission The mission of the Forensic & Investigative (FIA) Section of the American

More information

Name: Srinivasan Govindaraj Title: Big Data Predictive Analytics

Name: Srinivasan Govindaraj Title: Big Data Predictive Analytics Name: Srinivasan Govindaraj Title: Big Data Predictive Analytics Please note the following IBM s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice

More information

White Paper. Data Mining for Business

White Paper. Data Mining for Business White Paper Data Mining for Business January 2010 Contents 1. INTRODUCTION... 3 2. WHY IS DATA MINING IMPORTANT?... 3 FUNDAMENTALS... 3 Example 1...3 Example 2...3 3. OPERATIONAL CONSIDERATIONS... 4 ORGANISATIONAL

More information

Use of Data Mining Techniques to Improve the Effectiveness of Sales and Marketing

Use of Data Mining Techniques to Improve the Effectiveness of Sales and Marketing Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 4, April 2015,

More information

PREDICTIVE ANALYTICS DEMYSTIFIED

PREDICTIVE ANALYTICS DEMYSTIFIED PREDICTIVE ANALYTICS DEMYSTIFIED 12.12.2014 Agenda Introduction Who we are! What is Predictive Analytics? Who needs Predictive Analytics? How to build Predictive Models? Demonstration: IBM SPSS Success

More information

Virtual Site Event. Predictive Analytics: What Managers Need to Know. Presented by: Paul Arnest, MS, MBA, PMP February 11, 2015

Virtual Site Event. Predictive Analytics: What Managers Need to Know. Presented by: Paul Arnest, MS, MBA, PMP February 11, 2015 Virtual Site Event Predictive Analytics: What Managers Need to Know Presented by: Paul Arnest, MS, MBA, PMP February 11, 2015 1 Ground Rules Virtual Site Ground Rules PMI Code of Conduct applies for this

More information

Improve Model Accuracy with Unstructured Data

Improve Model Accuracy with Unstructured Data IBM SPSS Modeler Premium Improve Model Accuracy with Unstructured Data Highlights Easily access, prepare and integrate structured data and text, Web and survey data Support the entire data mining process

More information

Massive Cloud Auditing using Data Mining on Hadoop

Massive Cloud Auditing using Data Mining on Hadoop Massive Cloud Auditing using Data Mining on Hadoop Prof. Sachin Shetty CyberBAT Team, AFRL/RIGD AFRL VFRP Tennessee State University Outline Massive Cloud Auditing Traffic Characterization Distributed

More information

Bachelor of Bachelor of Computer Science

Bachelor of Bachelor of Computer Science Bachelor of Bachelor of Computer Science Detailed Course Requirements The 2016 Monash University Handbook will be available from October 2015. This document contains interim 2016 course requirements information.

More information

COURSE SYLLABUS. Instructor Information:

COURSE SYLLABUS. Instructor Information: COURSE SYLLABUS Term: Fall 2015 Course: Econ 160 A: Economic Theory and Personal Finance Instructor Information: Instructor Name Dr. Melvin Randolph Office Number: Student Success Center Phone Number:

More information

Data Mining Applications in Higher Education

Data Mining Applications in Higher Education Executive report Data Mining Applications in Higher Education Jing Luan, PhD Chief Planning and Research Officer, Cabrillo College Founder, Knowledge Discovery Laboratories Table of contents Introduction..............................................................2

More information