Developing Data-Driven Predictive Models of Student Success Kresge Data Mining Project Phase One Report

Size: px
Start display at page:

Download "Developing Data-Driven Predictive Models of Student Success Kresge Data Mining Project Phase One Report"

Transcription

1 1 Developing Data-Driven Predictive Models of Student Success Kresge Data Mining Project Phase One Report University of Maryland, University College November 20, 2012

2 2 Executive Summary This report documents progress on the Kresge Data Mining Grant: Developing Data- Driven Predictive Models of Student Success. The grant was awarded to University of Maryland, University College (UMUC) in collaboration with two community college partners, Prince George's Community College (PGCC) and Montgomery College (MC). The purpose of the grant is to build an integrated database in order to examine student progress across multiple institutions using data mining techniques and statistical models to identify factors that predict online student success. To date, this project has three major accomplishments: 1. UMUC and the two partner community colleges, PGCC and MC, have a data sharing partnership that supports academic research and student success. 2. UMUC designed, developed, and implemented a multi-institutional database of over 250,000 students (called the Kresge Data Mart) that contains the academic history on transfer students from these two colleges integrated with data on UMUC student online behavior and academic performance. 3. Researchers used data mining and predictive modeling to investigate relationships between variables to predict student success. Initial research yielded the following results: 1. Researchers reviewed the extant literature on online education and educational data mining and identified key variables that have been found to predict successful course completion and student re-enrollment and retention. 2. Examination of the student data in the Kresge Data Mart using four classification data mining algorithms and survival analysis indicated that factors in the students classroom behaviors and previous academic background, such as number of schools attended, transfer credits, and transfer grade point average were strong predictors of student re-enrollment in the following semester.

3 3 3. Regression models utilizing predictors revealed that total transfer credit, transfer GPA, activity in course conferencing, and activity prior to the start of a course, contribute strongly to course success. 4. Predictive models verified that course success is a strong predictor of reenrollment. These data mining and statistical modeling identify variables associated with online student success for students transferring from PGCC and MC. These findings guide next research steps, which include validating data mining and predictive models on an expanded dataset, building profiles of successful online students, and developing meaningful interventions to improve success of students transferring from our partner institutions.

4 4 Introduction The purpose of this report is to document progress on the Kresge Data Mining Grant: Developing Data-Driven Predictive Models of Student Success. The research is being conducted by the Evaluation and Assessment research team in the Office of Institutional Research at University of Maryland University College (UMUC). This report is presented in six sections: 1) Section I contains the grant overview, objectives, milestones and a financial update. 2) Section II reviews the research design, general methodologies, and research questions. 3) Section III contains the literature review and research foundations. 4) Section IV reviews the data developed for this project. 5) Section V describes initial results from the analyses of research questions. 6) Section VI describes next steps to stay on track with both project and research goals. Section I: Grant overview The grant was awarded to UMUC, in collaboration with two community college partners, to build a multi-institutional database to examine student progress across multiple institutions. The purpose of this grant is to use data mining techniques to develop data driven predictive models and to identify factors that influence the success of online students. Embedded in the project is the expectation that findings from the research will be integrated into the business processes of each of the participating institutions for the purpose of evaluating the impact on student success. Ongoing evaluation is a key aspect of this agile research project. Over the course of this three-year project, student transcript data from Montgomery College and Prince George s Community College will be integrated with performance and classroom behavior data from UMUC. The research in data mining and predictive modeling will contribute to a process to identify relationships between the student s community college academic history and their performance at the four-year institution.

5 5 Grant partnership UMUC is a four-year public university that offers online degree programs to a diverse population of working adults. As part of this grant, UMUC established partnerships with two Maryland community colleges that also serve large and diverse student populations. Montgomery College (MC), established in 1946, currently enrolls over 35,000 students annually. MC is a diverse institution that serves a large international student population. Prince George's Community College (PGCC) enrolls more than 40,000 students and is also very diverse, serving students from approximately 125 different countries. Both institutions serve the metro-d.c. area, but are different in that PGCC serves more students who are considered low-income. Both institutions have endorsed the goals of this project and are committed to working with UMUC to find ways to better serve transfer students. A memorandum of Understanding (MOU) between UMUC and each partner institution was negotiated and signed in order to clarify the security and use of data used for this research. This MOU allows the research to move forward while protecting the privacy of students (See Appendix A). Objectives and milestones Specific objectives and milestones are presented below for each stage of the research project. These objectives and milestones have been modified slightly from the original grant documents, but are consistent with the grant requirements. Table 1 UMUC Kresge data mining grant objectives and milestones status as of September 1, 2012 Objectives Milestones Status Year One Develop a Identify project requirements. Complete project action Develop a project action and collaboration plan Complete plan with the partnering agencies. Hire a full-time researcher and half time Complete programmer. Build an evaluation framework. Complete Data Work with the different units at UMUC and Complete

6 6 Objectives Milestones Status Preparation partnering institutions to collect data. Clean and transform data. Complete Prepare a data universe (integrated database In process system) on transfer students in the population. Understand variables; define student Complete characteristics and retention data; develop data dictionary. Evaluation of Conduct ongoing project evaluation. In process project Year Two Build the Analyze data and identify factors that predict In process analytic models success/failure. Build the data mining and analytic techniques and In process integrate into the predictive models. Build student profiles based on the models. In process Assess the validity of the models Discuss results with advisory board and obtain feedback. Discuss results with Project Partners and obtain feedback. Evaluate the models in predicting student's chances for success and make recommendations for adjustments to the models Evaluation of Conduct ongoing project evaluation. project Design pilot study Year Three Conduct pilot study Evaluate the pilot Design intervention study. Define requirements for the pilot. Develop implementation plan. For the resulting predictive models, identify points of integration in the UMUC business processes. Partner institutions with transferring students will evaluate their business processes to identify points of integration of modeling techniques for improvement of student retention. Deploy interventions and evaluation systems through a pilot at the partnering institutions and at UMUC. Conduct student retention and analysis of student satisfaction. Evaluate the effectiveness of the pilot and assess transferring student s chances of success and retention with the model interventions.

7 7 Objectives Milestones Status Discuss results with Project Partners and develop recommendations. Plan project revision based on evaluation data. Dissemination of the data mining model and resources Incorporate feedback from the education community to improve the models Comprehensive evaluation of the project Report on the process and results Develop website and repository for resources. Present findings at national conferences on higher education to describe research purpose and results. Organize UMUC conference to engage national discussion around research in data mining and predictive modeling for the purpose of improving student success. Evaluate the attendance, interest and requests from the meetings. Develop and disseminate finding. Use feedback to identify and integrate into enhancement plan. Conduct ongoing project evaluation and develop an evaluation report Solidify data-sharing partnerships with community colleges, documenting roles, processes, and measurable outcomes of student success. Develop an integrated research paper outlining research purpose, research design, processes and products. Financials The Kresge Foundation awarded UMUC $1.2 million to build an integrated database, explore data mining techniques, build predictive models of student success, define intervention strategies to increase student success, deploy those strategies, and continuously monitor and evaluate the strategies and student outcomes. To date, UMUC has expended approximately 41% of the total grant. The expenses encompass a hardware purchase to house the data mining database, expenditures for data collection at the partner institutions, and dedicated salaries for a data mining specialist and a graduate assistant. Additional staff resources are provided in kind by UMUC. The remaining

8 8 grant (approximately $703,333) will be used to support external evaluation and conference development. The financial statement for this grant is in Appendix B. Section II: Research Design The research design for this study was generated from the grant s goal to develop data driven predictive models of online student success using data mining techniques. The design covers exploratory research questions that have been addressed so far. Additional questions are expected to be developed after review of the initial results. This study employs an exploratory multi-method research design. Exploratory research is useful in this context because it allows researchers to investigate relationships between variables and build predictive models that could be replicated and tested. A multi-method research design allows researchers to be flexible in their choice of methodology and to answer research questions using exploratory analyses. With this research design in mind, this study addresses the following exploratory research questions: Question 1: What relationships exist among variables in the extant dataset? Question 2: Which variables contribute to the prediction of online course success? Question 3: Which variables contribute to the prediction of retention in an online environment? Section III: Literature Review This literature review examines online student success and educational data mining literature as it pertains to student success. This section contains three parts: study definitions, online student success literature and educational data mining literature. Definitions Data mining in education research is a broad term that encompasses both data management and statistical techniques. Data management refers to how data are prepared for the

9 9 application of data mining techniques. These techniques are used to explore variables that may predict student progression related to online course success and retention. In order to maintain language consistency, terms used throughout the study are defined below. Students: stateside, undergraduate, first-bachelor s seeking students who transferred from either PGCC or MC. This is also commonly known as the cohort for the study or the population of interest. Course success: obtaining a final grade of A, B, or C in any course. Re-enrollment: enrolling the immediate next semester following the semester in question. Retention: re-enrolling at the institution within 12 months after initial enrollment or in a rolling window of three semesters following the semester of interest. Degree completion: attaining a bachelor s degree after enrolling as a degree seeking student. Student success: a broad term used to indicate course success, re-enrollment, retention, or degree completion. These definitions were determined by reviewing a number of sources including: 1) a review of data mining and online retention literature; 2) institutional publications, such as reports to the Middle States Commission on Higher Education and in-house studies on retention and course success and 3) UMUC, MC and PGCC undergraduate course catalogs. Additional terminology and definitions related to data mining and research methodology are discussed later in this report. Online student success literature Current literature on student success focuses primarily on student outcomes such as course success, course withdrawal and retention. For example, student and course level variables such as student characteristics, previous course work, grades, and time spent in course discussions and activities may be useful in measuring course success (Aragon & Johnson, 2008;

10 10 Morris & Finnegan, 2009; Morris, Finnegan & Lee 2009; Park & Choi, 2009). Courselevel variables acquired from student login data may have predictive value in measuring course withdrawal (Willging & Johnson, 2008; Nistor & Neubauer, 2010). Student, course, program and institution level variables such as student characteristics, number of transfer credits, final grade in any given course, experience in online environments, and course load may have utility measuring re-enrollment and retention (Aragon & Johnson, 2008; Morris & Finnegan, 2009; Boston, Diaz, Gibson, Ice, Richardson & Swan, 2011). Although these studies showcase a variety of findings related to student success, the majority of studies in retention in online learning environments use traditional statistical or qualitative methods. Park and Choi (2009) point out that expansion of methods such as data mining might have utility when student, course, program, and institutional level variables are well defined and institutionally meaningful. Literature related to educational data mining focusses on exploratory research. Educational data mining literature Data mining is a method of discovering new and potentially useful information from large amounts of data (Baker and Yacef, 2009; Luan, 2001). Educational data mining is a subset of the field of data mining that draws on a wide variety of literatures such as statistics, psychometrics, and computational modeling to examine relationships that may predict student outcomes (Romano and Ventura, 2007; Baker and Yacef, 2009). In educational data mining, data mining algorithms are used to create and improve models of student behavior in order to improve student learning (Luan, 2002). Data mining methods are most helpful for finding patterns already present in data, not necessarily in testing hypotheses (Luan, 2001). Baker and Yucef (2009) suggest that higher education research should use a variety of algorithms, such as classification, clustering or association algorithms in determining relationships between variables. Although many definitions of these techniques exist in data mining literature, Han and Kamber (2001) offer the following definitions. Classification is the process of finding a set of models or functions that describe and distinguish data classes or concepts to predict a class of objects whose class label is unknown. Clustering analyzes data objects that are related to similar outcomes without

11 11 consulting a class label. Association is the discovery of rules showing attribute value conditions that occur frequently together in a given set of data (Han & Kamber, pp.23-24). Recent research suggests that these data mining algorithms can be used to examine variables related to student success. Yu, DiGangi, Jannach-Pennell, Lo, and Kaprolet (2010) used a classification algorithm to explore potential predictors related to student retention in a traditional, undergraduate institution. In this study, the authors used a decision tree to explore demographic, academic performance, and enrollment variables as they related to student retention. This study revealed a predictable relationship between earned hours and retention, but also found that at this institution, retention was closely related to state of residence (in-state/out of state) and living location (on campus/off campus). The authors speculate that this finding points to the potential utility of online courses in improving retention for out-of-state or offcampus students. Despite these recent developments in exploring variables related to student success in traditional higher education settings, research using data mining techniques to uncover relationships among variables in online courses is limited in scope. This study is designed to fill this gap in the extant literature by utilizing data on online students who attended multiple institutions. Section IV: Project Data The Kresge project has several goals. One is for UMUC to develop a data sharing partnership with key community colleges. Another is to build an integrated multi-institutional data mart that describes the academic history of transfer students who have attended community colleges prior to enrolling at UMUC. This data mart will be used for data mining and for the development of statistical models that identify factors related to student success in an online environment. To that end, there are several successes in the area of data integration that have been achieved over the past year. First, a meaningful and significant partnership was established between UMUC, MC and PGCC. Second, data were collected from source systems and stored in secure areas for processing. Third, the vision and design for the Kresge Data Mart (KDM) has

12 12 been developed. Fourth, the student population of interest was identified and verified. Fifth, datasets were created for the initial modeling and data mining. The most significant success of this grant so far is the establishment of the data sharing collaboration between the community colleges and UMUC. The partnership agreements have been forged through a series of meetings and an MOU that guides the research and data sharing activities. Research questions have been developed to benefit the students through actionable research that will impact the business processes at each institution. Measurable outcomes have been set and ongoing evaluation is being conducted and shared with partner institutions. The data used in this effort will be collected from all three institutions and stored in a data mart called the Kresge Data Mart (KDM), which is housed on a server at UMUC. This section will describe the overall vision for the development of the infrastructure and the current status of the project. Student Data A core set of data is collected on each student from each institution. This information includes demographics, enrollment, courses, grades, and recent classroom activity in online courses. Each is described below. Demographic data. Demographic data include fields such as address, phone number, age, gender, and race/ethnicity. While some of this information may change over time, for the purpose of this study, the demographic data are considered static. Enrollment data. Enrollment data are typically captured on students who have officially registered. Enrollment data includes course registration, grades, program of study or major, and student status. This information changes each term or semester. Transfer data. Data about a student s academic history at previous institutions are also stored. This information includes number of transfer credits, courses transferred, grades associated with each transfer course, and prior degrees earned. There are two sources of transfer data. One is the community college transcript data that contains all course information on each student. These data are collected and integrated into the KDM. The other source is transcript files which may not include all information on the student s prior courses for only courses that were transferred to UMUC and equated to UMUC courses. The UMUC transcript data are not as

13 13 complete as the community college transcript data, but serve as the basis for courses and credits that are applied to the student s degree. Classroom behavior data. Webtycho, UMUC s proprietary learning management system, stores counts of student activity in various modules within each online class. Data are stored for each session, which is defined as the time between each student login and logoff. Identifying the population of interest The students to be included in the KDM are defined as students who attended UMUC between 2005 and Students who attended at least one community college prior to UMUC enrollment were identified and analyzed as a separate subpopulation. The data will eventually include those who attended a community college of interest, but did not transfer to UMUC. These two subgroups and the larger population of students will serve as the foundation of data stored in the KDM and the backbone for the research. The process of building the populations has multiple steps. The first step was to identify two partner institutions who would share data on students who transferred to UMUC. The second step will be to identify other institutions that are willing to do the same. The third step will be to identify students who transferred from key partner institutions, but who did not transfer to UMUC. This report focuses on the first stage of building an integrated database. The population of interest is transfer students. Two community colleges, Montgomery College and Prince George s Community College, that transfer a large number of students to UMUC were identified. UMUC formed a data sharing partnership with each institution to identify factors related to student success, define academic interventions, implement interventions, and examine their impact on student success. UMUC identified 257,903 students who attended UMUC between 2005 and Of these students, MC used varying matching strategies to confirm enrollment for 21,444 students, and PGCC used a different set of matching strategies to confirm enrollment for 11,046 students. Additionally, the National Student Clearinghouse (NSC) identified 12,776 students who attended MC and 8,697 students who attended PGCC. Furthermore, UMUC analyzed transcript information in its student information system and found even fewer matches. Thus, the Kresge

14 14 Data Mart includes student data on all 257,903 students, as well as information on the matched students matched by the community colleges. The KDM will serve as a critical resource for all predictive analytics project as well as research projects related to student success and integrated data management. One key finding is that the data from the community colleges yielded much more robust and accurate information about students academic histories at previous institution than UMUC s internal data resources. Data integration Institutions store student demographic, enrollment and classroom behavior data in a variety of information systems. For the purpose of this project, data from two UMUC student applications, PeopleSoft and WebTycho, are included in this database. PeopleSoft is a student information system that stores current and historical data about the student and their enrollment history. MC and PGCC have similar systems for storing student data. WebTycho is a proprietary learning management system that records online classroom for every user (student or faculty) during an online session. Figure 1 Description of the Integrated Kresge Data Mart

15 15 Each institution extracted data for each student in the matched population from the source student information systems and learning management systems. The process for merging data across institutions requires standardizing the data and developing a data dictionary. Data from multiple systems are combined in stages. At the first stage, data from each partner institution were collected and cleaned. Definitions of each data element were gathered, standardized, and documented. At the second stage, data from the UMUC s student information system and the online classroom were merged and developed in the UMUC data warehouse. The next stage integrated the community college data with the UMUC data in the Kresge Data Mart (KDM), as shown in the diagram above. The KDM combines information on the students history at prior institutions, their current course performance information, and their online course behavior, as well as common outcome measures identified in the literature. The final stage of the KDM will include information from MC and PGCC on students who transfer elsewhere, as well as students who have transferred from other community colleges to UMUC. This is the data mart that will serve as the base of the final research models and intervention analysis. Current status. The community college data have been collected and a coding structure has been developed. The code continues to evolve, as the data are still being processed. As a result, preliminary analyses presented in this report are based on datasets that are derived from the raw data collected and stored during the data integration process. Data storage. In order to carry out data mining activities, storage was required for the various sources of data and for the various stages of the data integration process. To meet the requirements of building an integrated data mart, UMUC purchased the Oracle Exadata hardware which houses the KDM, as well as the UMUC data warehouse. The data for the partner institutions are stored separately from the data warehouse for the purpose of security and confidentiality. The data sources on the Oracle Exadata hardware, which is in part supported by the Kresge Foundation, have also contributed to other analytical and predictive analytics projects related to predictive analytics. The data integration among various sources adds value to all projects related to student success.

16 16 Data preparation for initial analyses Researchers used sample datasets to develop data mining algorithms and predictive models. The datasets were designed specifically for the analysis of the exploratory research questions. The data mining algorithms and predictive models were created to explore the initial research questions are described in each section below along with initial findings. Data security Secure File Transfer Protocol (SFTP) is the method that is currently being used to transfer data between partner institutions. The storage area, which contains both community college and UMUC data, is secured by the database administrators providing accounts only to staff who are working on the development of the KDM. Section V: Initial findings Research Question 1: Examining variables and relationships This analysis uses data mining techniques in order to examine relationships amongst variables in the extant dataset. The question answered in this analysis is: What relationships exist amongst variables in the extant dataset? Dataset. The dataset used for this analysis contains 2,643 new, stateside, first bachelor s degree seeking UMUC students enrolled in 15 gateway courses that students in the Spring 2011 took to begin their program of study. Since each student can be enrolled in more than one course, this dataset contains 4,331 enrollment records and is comprised of 394 unique sections of these gateway courses. The variables in the dataset include transfer history, enrollment data for the students first semester at UMUC, and online classroom behavior defined as WebTycho actions in the classroom prior to and after the start day of classes (See Table 2).

17 17 Variable Name Emplid Subject Course Number Total Transfer Credits Number of Schools Attended Previously Time Since Last Institution Transferred in at Least One English Course Transferred in at Least One Math Course Earned Associates Degree Course Grouping Length of Course in Weeks Semester Load Re-enrolled Entering Class Opening Course Content Opening Class Roster Table 2 Dataset variables and definitions Variable definitions Unique ID for each new student Four-letter code for each course, i.e.- ACCT, MATH, or PSYCH Subject and course number together create a class A sum of credits/hours transferred from other institutions A count of the number of institutions in the new student s transfer history The number of months since previous institution Flag to indicate if the student had at least one class in the transfer history where the subject was ENGL, EWRT, or WRTG Flag to indicate if the student had at least one class in the transfer history where the subject was MATH, STAT, or ENGR The degree earned by the new student (if any) at a previous institution Developmental (EDCP100), General (MATH 009 and 012), Business and Professional, Communication, Science and Social Sciences, Technology Potential values are 8, 10, 12, and 14 Total credits attempted in Spring 2011 by the student A Yes/No flag indicating whether or not the student enrolled in at least one course the Summer of 2011, excluding the courses enrolled in Spring 2011 Number of times the new student logged into an area of the online course in WebTycho; counts were created for each of the first four weeks of the class as well as the number of times prior to the first day of class Number of times the student opened the Course Content which is an area of the online course; counts were created for each of the first four weeks of the class as well as the number of times prior to the first day of class Number of times the student opened the Class Roster area of the course; counts were created for each of the first four weeks of the class as well as the number of times prior to the first day of class

18 18 Variable Name Using the Chat Feature Opening a Study Group Conference Opening Reserved Reading Opening the Webliography Starting the Conferencing Feature Reading a Conference Note Writing a Conference Note Adding an Attachment to a Conference Note Variable definitions Number of times the student used the Chat Feature in WebTycho; counts were created for each of the first four weeks of the class as well as the number of times prior to the first day of class Number of times the student went to the conference postings in a study group; counts were created for each of the first four weeks of the class as well as the number of times prior to the first day of class Number of times the student opened the Reserved Reading area of the online course; counts were created for each of the first four weeks of the class as well as the number of times prior to the first day of class Number of times the student opened the Webliography area of the online course; counts were created for each of the first four weeks of the class as well as the number of times prior to the first day of class Number of times the student opened the Conferences area of the online course; counts were created for each of the first four weeks of the class as well as the number of times prior to the first day of class Number of times the student opened a note for reading in the Conferences area of the course; counts were created for each of the first four weeks of the class as well as the number of times prior to the first day of class Number of times the student wrote or edited a note in the Conferences area of the course; counts were created for each of the first four weeks of the class as well as the number of times prior to the first day of class Number of times the student attached a file to a note in the Conferences area of the course; counts were created for each of the first four weeks of the class as well as the number of times prior to the first day of class Notes: class roster is a numbered listing of everyone registered for the class; chat feature is a place where classmates can participate in synchronous discussion and view shared dialogue; a conference is an area where students participate in asynchronous group discussion, much like an internet forum; within each conference a student can post a reply, which is also called a note ; reserved readings is a place where faculty may post read-only documents or links to external website for students; webliography is a shared list of web sites relevant to the class. Method. A data mining program, IBM/SPSS Modeler, was used to develop models that explored the dataset for relationships between variables. Four classification algorithms were used: C&R Tree, CHAID, QUEST, and C5.0. Each of the four models has its own advantages and disadvantages; however, two models, C5.0 and CHAID, proved to be the most valuable in the identification of relationships between specific variables. Cross-model data mining techniques were also used to produce a ranking of variable importance.

19 19 Findings. All algorithms combined were used to rank variables across all models. These models generated a ranking for variables related to re-enrollment across data mining models (see Table 3). Table 3 Variable ranking derived from data mining models Ranking Very High Variables 1) Frequency of student activity in the online classroom up to the fourth week of class 2) Entering the online classroom prior to the first day of class 3) Participation in any conference feature 4) Opening and reviewing the Course Content module within the classroom 5) Opening and reviewing Reserved Reading within the classroom High Middle Low Number of Schools Previously Attended Students semester Load Time Since Last Institution Transferred in at Least One English Course Transferred in at Least One Math Course Negligible All Other Variables Note: Very high, high, middle, low, and negligible are statistical categories that describe the strength of the predictive power of the variables included in the re-enrollment model. This ranking indicates that classroom behavior is of very high importance to re-enrollment. In addition, the variables number of schools previously attended and semester load are somewhat important, and other variables are either of low importance or negligible importance. The C5.0 model indicated that the variable Entering Class Week 4, followed by Entering Class Prior to First Day, Number of Schools Previously Attended, Writing a Conference Note Prior to First Day, and Semester Load were most related to re-enrollment (see Figure 2).

20 20 Figure 2 C5.0 model with top five variables related to re-enrollment The CHAID algorithm also indicated that the variable Entering Class Week 4 was most related to re-enrollment (see Figure X). Figure 3 CHAID Model with top 10 variables related to re-enrollment

21 21 Implications. Data mining techniques indicate that student classroom behaviors are of very high importance to re-enrollment. Previous academic background, such as number of schools attended and transfer credits, may also be important. These models suggest variables related to the student behaviors prior to the first day of class and previous academic work merit further examination in statistical predictive models. Research Question 2: Course success The purpose of this analysis is to construct a logistic regression model with a set of covariates and predictor variables that predict course success. This model intends to answer the research question: Which variables, if any, predict student online course success? The intent of this preliminary model is to predict course success from online student behavior in the first four weeks of the course and student s academic background information. 1 The ultimate goal of analysis is to maximize the prediction of online course success. Dataset. The dataset used for this analysis contains 4,558 new, undergraduate, first bachelor-degree seeking students enrolled in 15 UMUC online gateway courses in Spring This dataset also contains transfer data on students from partner institutions, Montgomery College (MC) and Prince George's Community College (PGCC). Table 4 provides descriptive statistics for all students, passing or not passing, in each gateway course for this dataset. 1 Subsequent models may include: growth models, latent class analysis and structural equation models in order to capture a different type of data/information within one system of predictive model. 2 Sessions included in this dataset are Online Session 1 (OL1), OL2, OL3, OL4, and OL5.

22 22. Table 4 Descriptive statistics for key covariates Passed a course Not Passed Passed All Course N % N % N ACCT % 63 41% 154 BMGT % % 229 CCJS % 74 50% 149 CMIS % % 275 EDCP % % 418 EDCP % 38 42% 91 GVPT % 59 56% 105 HIST % 88 61% 145 IFSM % % 620 LIBS % % 1980 MATH % 5 31% 16 MATH % 49 46% 106 PSYC % 77 44% 174 WRTG % 62 65% 96 All % % 4558 Method. For this study, a logistic regression model was developed that utilized available independent variables (covariates and predictors) to predict the desired dependent outcome, course success. A logistic regression model generates the probability of success (in logits) as it is linearly related to the set of independent variables. In this study, a pseudo- R 2 value 3, or value of prediction accuracy, was used for both model diagnosis and selection. Preliminary methods focused on the identification of covariates (e.g. student demographic information and the record of previous academic work) and predictors based on the students coursework behavior. Identification of key covariates. Exploratory factor analysis (EFA) was used to identify key covariates. EFA is an exploratory or data mining method that uncovers underlying constructs, or common structures, among a set of related variables. Previous data mining 3 Pseudo or generalized R squared is a generalization of the coefficient of determination, R 2, to more general regression model such as logistic regression (Nagelkerke,1991).

23 23 analysis 4 indicated that the following covariates in student coursework data were strongly associated with the course success: 1) total number of transfer credits (Xfer_total); 2) amount of time since students attended the last institution (Xfer_time); 3); semester course load (LOAD) and 4) GPA from transferred credits (GPA_C). Table 5 shows the rule to categorize each covariate for use in the logistic regression model. Table 5 Rules for key covariates in student coursework data Attribute Variable Name Rule Total number of transfer credits Xfer_total 0= 0 credit 1= 1-29 credits 2= credits Amount of time since a student attended the last institution Xfer_time 3= more than 60 credits 0= didn t attend 1= 1-24 months 2= over 24 months Semester course load LOAD 1 = 1-3 credits 2 = over 3 credits GPA from transferred credits GPA_C 0 = no information 1 = less than = 2.5 to less than = 3 to less than = over 3.5 Additional covariates related to student behavior prior to the official class starting day (called Week 0) were also found to be associated with course success. Table 6 displays summarized rules for each student behavior covariate in Week 0. 4 See section on cluster analysis for detailed explanation of this process.

24 24 Table 6 Rules for LMS student behaviors in Week 0 Behavior Variable Name Rule Entering a class Opnclss_cat0 0 = Did not enter a class 1= Entered a class at least once Opened a class roster Opnclssrstr_cat0 0 = Did not check the roster 1 = Check the roster at least once Opened a class content Opncrscntnt_cat0 0 = Did not open a class content 1 = Open a class content at least once Created a conference note Createconfnote_cat0 0 = Did not create a conference note 1= Created a conference note at least once Created a response note Createrespnote_cat0 0 = Did not create response note 1 = Created a response note at least once Summary of students week0 behavior prior to the first day TimeZeroTotal Sum of five variables above Using these rules, a logistic regression model was used to evaluate the effect of the identified covariates (in terms of pseudo- R 2 ) on course success for use in the final model (see Table 6). These findings include: 1. Total number of transferred credits is the best covariate predictor of course success (having pseudo R 2 value around.12) It was the best predictor of success amongst the time invariant covariates, or covariates that have consistent value over all models in the analysis using different points in time. 6 The covariate number of schools previously attended (Xfer_NBR) exhibited the linear dependency with a total number of transferred credits thus to remove from the analysis.

25 25 2. GPA from transferred credits is the second best covariate predictor of course success (pseudo R 2 around.11). 3. Semester course load contributes less to course success than other covariates. Analysis of combined covariates (student behaviors and previous coursework behavior) indicates that total number of transfer credits (Xfer_total), amount of time since a student attended the last institution (Xfer_time), GPA from transferred credits (GPA_C), and summary of student s behavior prior to first day (TimeZeroTotal) accounts for approximately 15-20% of the variance based on the pseudo R 2 value (see Table 7). Table 7 Summary of pseudo R 2 estimates Covariates R 2 estimates xfer_total 0.12 xfer_time 0.06 Load 0.01 GPA_C 0.11 Opnclss_cat Opnclssrstr_cat Opncrscntnt_cat Createconfnote_cat Createrespnote_cat TimeZeroTotal 0.07 xfer_total xfer_time LOAD TimeZeroTotal GPA_C 0.19 Building the predictive model. In order to accurately predict course success, predictor variables that measure student online course behavior in Weeks 1-4 were integrated into the logistic regression model. Exploratory analysis based on the data mining techniques and the availability of data indicated the importance of five predictor variables related to online course behavior: 1) entering a class; 2) opening class content; 3) creating a conference note; 4) creating

26 26 a response note and 5) reading a conference note. In order to account for high variability in student online course behaviors predictors were categorized using the following rules (see Table 8): Table 8 Rules for student online course taking behavior beyond Week 0 Behavior Variable Name Rule Entering a class Opnclss_catN 0 = Did not enter a class 1= Entered a class at least once Opened a class content Opncrscntnt_catN2 0 = Did not open a class content more than twice 1 = Open a class content at least twice Created a conference note Createconfnote_catN 0 = Did not create a conference note 1= Created a conference note at least once Created a response note Createrespnote_catN 0 = Did not create response note 1 = Created a response note at least once Read a conference note readconfnote_catn2 0 = Did not open a class content more than twice 1 = Created a response note at least twice In this case, each predictor variable represents a weekly total of distinctive actions per student per class in Weeks 1-4 of an online gateway course (see Table 9):

27 27 Table 9 Weekly total of actions per student beyond Week 0 Variable N Mean SD Percentile OPNCLSS_WEEK OPNCLSS_WEEK OPNCLSS_WEEK OPNCLSS_WEEK OPNCLSSRSTR_WEEK OPNCLSSRSTR_WEEK OPNCLSSRSTR_WEEK OPNCLSSRSTR_WEEK OPNCRSCNTNT_WEEK OPNCRSCNTNT_WEEK OPNCRSCNTNT_WEEK OPNCRSCNTNT_WEEK READCONFNOTE_WEEK READCONFNOTE_WEEK READCONFNOTE_WEEK READCONFNOTE_WEEK UPDTCONFNOTE_WEEK UPDTCONFNOTE_WEEK UPDTCONFNOTE_WEEK UPDTCONFNOTE_WEEK CREATECONFNOTE_WEEK CREATECONFNOTE_WEEK CREATECONFNOTE_WEEK CREATECONFNOTE_WEEK CREATERESPNOTE_WEEK CREATERESPNOTE_WEEK CREATERESPNOTE_WEEK CREATERESPNOTE_WEEK This categorization accomplished three purposes: 1) removed the effect of extreme values of highly variable actions; 2) facilitated a percentile level count of each behavior and 3) examined

28 28 the effect of each cut point 7. Categorization of predictor variables also emphasizes the importance of consistent engagement in a classroom activity, which is indicated by the categorization of 1 as a predictor variable for student action from Week 1 to Week 4 and 0 as an variable indicating no action. 8 Evaluating the predictive model. The covariates and predictors included in the final logistic regression model represent the best model fit (as represented by pseudo R 2 ) to evaluate student course success. In addition, this model allows for determination of the relative contribution of each covariate and predictor at each point in time, which will aid in identifying at-risk students. For example, the power of the model in predicting course success becomes extremely high (close to 90%) and pseudo R 2 exceeding.60 using 8 weeks of data. However, since interventions for increased course success are not useful at the end of the semester, findings focus on the performance of predictors over the first four weeks, particularly the first two weeks of behaviors in the online class. Findings. Initial analysis using the final model indicates the relative contribution of covariates and predictors from Week 1 to Week 4 (see Table 10). Table 10 Relative contribution of Covariates and Predictors Week Covariates xfer_total 15% 18% 16% 17% xfer_time 2% 2% 2% 2% Load 5% 7% 6% 7% TimeZeroTotal 11% 10% 9% 7% GPA_C 7% 9% 8% 9% Predictors createrespnote_total 10% 3% 6% 2% readconfnote_total2 34% 32% 23% 21% createconfnote_total 15% 7% 14% 10% opnclss_total 0% 13% 16% 26% opncrscntnt_total2 0% 0% 0% 1% 7 Cut point refers to numeric values or rules to categorize a continuous value into categories. 8 In other words, the predictor variable represents the sum of student actions across N weeks (0-4) as the final predictor variable in the model (e.g. <action>_total).

29 29 Examination of the relative contribution of covariates indicates that total transfer credit (xfer_total) exhibited the highest level of contribution to the predictive model (15%-18%) followed by Week 0 behavior (TimeZeroTotal), transfer GPA (GPA_C), course load (LOAD) and number of times a student transferred (xfer_time), respectively. Examination of the relative contribution of predictors indicates that the variable reading conference note (readconfnote_total2) has the highest influence in the predictive model (21%-34%). Note that open class (opnclss_total) had no impact on student success in Week 1, but that impact increased Weeks 2-4. The predictive power of the model increases gradually over the first four weeks of the course, from 79% correct prediction of course success at Week One 1, to 82% correct prediction of course success at Week 4. The pseudo R 2 value increased from approximately 19% using the covariate-only model (see Table 4 for value) to 50% in Week 1 predictive model (see Table 11). Table 11 Predictive power of the model by week Outcome/Predicted R2 estimates Count Percent Success Week P/P F/F F/P P/F P/P F/F F/P P/F % 37% 10% 11% 79% % 38% 9% 11% 81% % 39% 9% 10% 81% % 38% 9% 9% 82% Note that the pseudo R 2 value increased from 50% at Week 1 to 59% at Week 4. Group level analysis. Predictive analysis by sub-group based on sessions or courses was conducted using the model. Although findings from this group-level analysis is less informative due to limited sample size, the predictive model maintained a moderate to high level of fit based on pseudo R-squared value (e.g. above mid.30s). 9 The group-level analysis demonstrated that 9 OL5 session had a fair model fit but exhibited low prediction rate. The sample size of OL5 session was too small to draw a conclusion.

30 30 the predictive model maintains high level of model fit and prediction at over 75% in the majority of sub-groups (see Table 12) 10. Table 12 Predictive power of the model by week and session Week Session Stats OL1 R Success OL2 R Success OL3 R Success OL4 R Success OL5 R Success OL123 R Success LIBS150 R Success No LIBS 150 R Success Core LIBS R This sample included all students by session level and course category (general education, and remedial course), but excluded students in LIBS 150. LIBS 150 is a general education course; it is shorter and more orientation/informative course rather than a regular educational course, and it has very high passing rate of 75.3%.

31 31 Success Core No LIBS R Success EDCP100 R Success IFSM201 R Success EDCP100 IFSM201 R Success Implications. Preliminary analysis of variables related to course success identified a set of critical covariates and predictors. Four of five predictors derived from online student behavior show a strong contribution to student success. The final model using the full dataset will examine the effect of these predictors as well as evaluating the impact of other behavioral measures such as accessing course modules. Research Question 3: Student retention The purpose of this preliminary analysis is to construct a logistic regression model with a set of covariates and predictor variables and determine the impact on student retention. This model intends to answer the research question: Which variables, if any, predict retention? Dataset. The analysis utilized the same data set for the prediction of course success with the addition of retention status from Summer 2011, Fall 2011, and Spring The retention rate was 66% (N= 3,015) in this sample. Table 13 Percentage students retained and not retained Retained Not Retained N % N % % %

32 32 Method. As with prediction of course success, this model uses logistic regression to predict retention. Preliminary methods focused on the evaluation of covariates, identified in the previous analysis, predictors based on the students coursework behavior, and course success information (i.e. passed or not passed). Evaluation of covariates. A total number of credit transferred (xfer_total) had the highest predictive power followed by GPA from the transfer credits (GPA_C). The magnitude of covariates pseudo R 2 is lower on the prediction of retention than the prediction of course success. The total covariates yield a pseudo R 2 of Table 14 shows pseudo R 2 estimates from five covariates. Table 14 Retention pseudo R 2 estimates of covariates Covariates R 2 estimates xfer_total 0.08 xfer_time 0.02 GPA_C 0.07 Load 0.01 TimeZeroTotal 0.03 Cov Total 0.14 Evaluation of predictors. For this model, predictors identified for the course success model were used and combined with a new set of predictors: 1) number of courses passed (N_Pass); 2) number of courses not passed (N_NotPass); 3) number of course withdrawals (W) or failure due to non-attendance (FN) (N_WFN); 4) passing LIBS 150 (passed_libs150); 5) passing a General Education course (passed_gened); 6) passing a remedial course (passed_remed) and 7) term GPA (PGA_Term). In this model, student behavior indicators did not predict retention as well as they predicted course success (see Table 15).

33 33 Table 15 Prediction success of combined student behavior indicators by week Retention Course Success Week Low High Low High Findings. When included in the prediction model for retention, covariates predicting course success generated a higher a pseudo R 2 value than student behavior predictors (see Table 16). Table 16 R-squared estimates for retention predictors and covariates Predictors and covariates R 2 estimates passed_libs passed_gened 0.15 passed_remed 0.15 N_Pass 0.19 N_NotPass 0.17 N_WFN 0.16 GPA_Term 0.24 COVS passed_abc passed_gened passed_remed N_Pass 0.35 N_NotPass N_WFN GPA_Term ***COVS= xfer_total xfer_time LOAD GPA_C GPA and number of passing courses had the highest pseudo R 2 estimates among seven indicators. The full model including covariates and passing indicators had a pseudo R 2 of Implications. The effect of covariate and student behavior variables made less of a contribution to this model than the prediction of course success. The final model using the course

34 34 passing indicators 11 indicated a stronger contribution to the prediction of retention. These preliminary results indicate the importance of course success as the primary factor in increasing retention, which means that interventions aimed at improving course success may have a strong indirect effect on increasing retention. Additional exploratory analysis: Student withdrawal patterns The purpose of this additional exploratory analysis was to conduct a survival analysis that examined student course withdrawal patterns. The goal of this study is to increase the understanding of withdrawal patterns so that it can be effectively incorporated in models predicting the retention. Dataset. The dataset for this study contains 19,190 undergraduate UMUC students in OL1 (Online Session 1) in Fall 2011 in 278 distinct courses. This dataset contained student course information data and WebTycho data. Summary statistics of five student demographic and academic behavior variables are listed in Table 17. Table 17 Demographics for OL1 students Variable Frequency Percent Cumulative Percent Gender Unknown Female Male Age Group Above at or below New/Returning Status NEW RET For additional information on course withdrawal related to student behavior variables is explained in the subsequent section. 12 Age 32 is used as a cut point because it is the median age of UMUC students.

35 35 The majority of the students in OL1 of FAl l2011 were female (56.7%). 56.2% of the students were at or below the UMUC median student age of 32 years old. 17.5% of the students were new to UMUC and 82.5% were returning students. Method. An exploratory survival analysis was carried out using a Kaplan-Meier estimator. Survival analysis is a statistical technique that can be used to model time-to-event data. In this case, this analysis examines the time it took for a student to withdraw from a particular course (in weeks and days) reflected as a time-to-event. 13 Survival analysis generates a table which indicates a hazard (or withdrawal) rate during the semester. Additional univariate survival analyses were also conducted to identify retention patterns among age, gender, and student status. Survival analysis. Table 18 shows the number of student withdrawals, cumulative proportion of withdrawals, and withdrawal rate. Week Number of students Table 18 Withdrawal rate in OL1 Number of Student Withdrawals Cumulative Proportion of Withdrawal Withdrawal rate Student rate of withdrawal is highest in Week 1 (N= 404 or.31%). The withdrawal rate gradually drops until Week 5, which is the week before the academic withdrawal deadline. The overall trend and withdrawal pattern is that, in general, student withdrawal rates decrease after Week 1 of the semester. 13 This analysis only examines students who received a grade of W (withdrawal).

36 36 Univariate survival analysis. This univariate survival analysis examined three covariates: gender, age, and student status. Age, which was defined as below or above 32 years old, did not prove to be statistically significant (p =.60) in terms of correlation with withdrawal patterns. Gender, male and female, approached significance (p =.06) in terms of correlation with withdrawal patterns. Student status (new or returning) showed statistical significance in terms of correlation with withdrawal (p <.001): new students withdraw at a higher rate than returning students. Figure 3 shows that the patterns of new and returning student withdrawal are different. Figure 4 Withdrawal rate for new and returning students by day

37 37 Implications. Students withdraw at a higher rate in Week 1 compared to any other week in the course session, with the exception of Week 5, which is the academic withdrawal deadline. Student status, new or returning, may significantly affect student withdrawal rate. New students withdraw at a higher rate than returning students. These findings suggest that interventions targeting new students with interventions in Week 1 may be appropriate. Section VI: Implications and next steps In summary, data mining and statistical modeling can identify variables that relate to student success. Data mining techniques identified the importance of academic background and student course behaviors as variables related to student success. Predictive models of course success and retention using variables gleaned from data mining showed similar outcomes. Specifically, online student course behavior, such as opening and reading conference notes in the first four weeks of class, were correlated with course success. Previous academic background, such as total number of transfer credits and transfer GPA, contributed both to course success and retention. Course success, instead of student behavior, was a key predictor of retention. Student course behaviors showed that students withdraw at a higher rate in Week 1 over any other week in the course session, with the exception of Week 5, and that student status may significantly affect student withdrawal rate. These findings will guide our next research steps, which will finalize and validate predictive models in order to build student profiles. These steps may include, but are not limited to: running predictive models with a full dataset; validating models; sub-group analysis examining community college students in terms of course success, re-enrollment and retention; identifying course sequences that indicate re-enrollment and retention; integrating withdrawal information into predictive models. These additional steps will aid in the building of student profiles that can be used for predicting student success and designing effective interventions.

38 38 References Aragon, S.R. & Johnson, E.S. (2008). Factors influencing completion and non-completion in online community college courses. American Journal of Online Education, 22(3), Boston, W., Diaz, S.R., Gibson, A.M., Ice, P., Richardson J., & Swan, K. (2011). An exploration of the relationship between indicators of the community of inquiry framework and retention in online programs. Journal of Asynchronous Learning Networks, 13(3), Baker, R.S. & Yacef, K. (2009). The state of educational data mining: A review and future visions. Journal of Educational Data Mining, 1(1), Finnegan, C., Morris, L.V., and Lee, K. (2009). Differences by course discipline on student behavior, persistence, and achievement in online courses of undergraduate general education. Journal of College Student Retention, 10(1), Herzog, S. (2006). Estimating student retention and degree completion time: Decision trees and neural networks vis a vis regression. New Directions for Institutional Research, 131, Ho Yu, C., DiGangi, S., Jannasch-Pennell, A., Lo, W., & Kaprolet, C. (2007, February). A datamining approach to differentiate predictors of retention. Paper presented at the Educause Southwest Conference, Phoenix, AZ. Luan, J. & Zhao, C-M. (2006). Data mining: Going beyond traditional statistics. New Directions for Institutional Research, 131, Luan, J. (2001). Data mining as driven by knowledge management and higher education: Persistence clustering and prediction. Paper presented at the SPSS Public Conference, University of California at San Francisco. Luan, J, (2002). Data mining and its applications in higher education. New Directions for Institutional Research, 113, Morris, L.V. & Finnegan, C.L. (2009). Best practices in predicting and encouraging student persistence and achievement online. Journal of College Student Retention, 10(1), Nistor, N., & Neubauer. K (2010). From participation to dropout: Quantitative participation patterns in online university courses. Computers in Education, 55, Park, J-H., & Choi. H-J. (2009). Factors influencing adult learners decision to drop out or persist in online learning. Educational Technology and Society, 12(4), Romero, C., Ventura, S., Espejo, P.G. and Hervas, C. (2008, June). Data mining algorithms to classify students. Paper presented at the1st International Conference on

39 39 Educational Data Mining, Montreal, Canada. Willging, P.A. & Johnson, S.D. (2009). Factors influencing adult student decisions to dropout of online courses. Journal of Asynchronous Learning Networks, 13 (3),

40 40 Appendix A Memoranda of Understanding

41 41

42 42

43 43

44 44

45 45

46 46

47 47

48 48

49 49

50 50

51 51 Appendix B Financial Statement

52 52

Developing Data- Driven Predictive Models of Student Success. Kresge Data Mining Project Phase Two Report

Developing Data- Driven Predictive Models of Student Success. Kresge Data Mining Project Phase Two Report Developing Data- Driven Predictive Models of Student Success Kresge Data Mining Project Phase Two Report University of Maryland University College Report Table of Contents Executive Summary... 1 Introduction...

More information

Community College Transfer Students Persistence at University

Community College Transfer Students Persistence at University Community College Transfer Students Persistence at University Alexandra List Denise Nadasen University of Maryland University College Presented at Northeast Association for Institutional Research, November

More information

PREDICTIVE ANALYTICS FOR STUDENT SUCCESS: Developing Data-Driven Predictive Models of Student Success

PREDICTIVE ANALYTICS FOR STUDENT SUCCESS: Developing Data-Driven Predictive Models of Student Success 1 PREDICTIVE ANALYTICS FOR STUDENT SUCCESS: Developing Data-Driven Predictive Models of Student Success Final Report University of Maryland University College January 6, 2015 A Research Project Funded

More information

Cross Institutional Collaborations: Building Partnerships for Student Success

Cross Institutional Collaborations: Building Partnerships for Student Success Cross Institutional Collaborations: Building Partnerships for Student Success Denise Nadasen Alexandra List University of Maryland University College Presentation at Northeast Association for Institutional

More information

Predictive Analytics for Student Success

Predictive Analytics for Student Success Predictive Analytics for Student Success A Data Sharing Initiative Between Two Community Colleges and an Online University An Executive Report Presented by University of Maryland University College Funded

More information

Interventions to Promote Community College Transfer Student Success at a Four-year, Online University

Interventions to Promote Community College Transfer Student Success at a Four-year, Online University Interventions to Promote Community College Transfer Student Success at a Four-year, Online University AACRAO Technology and Transfer Conference Fort Lauderdale, Florida Denise Nadasen, AVP Institutional

More information

Florida Study of Career and Technical Education

Florida Study of Career and Technical Education Florida Study of Career and Technical Education Final Report Louis Jacobson, Ph.D. Christine Mokher, Ph.D. 2014 IRM-2014-U-008790 Approved for Distribution Unlimited This document represents the best opinion

More information

Data Mining Applications in Higher Education

Data Mining Applications in Higher Education Executive report Data Mining Applications in Higher Education Jing Luan, PhD Chief Planning and Research Officer, Cabrillo College Founder, Knowledge Discovery Laboratories Table of contents Introduction..............................................................2

More information

Strategies for Promoting Gatekeeper Course Success Among Students Needing Remediation: Research Report for the Virginia Community College System

Strategies for Promoting Gatekeeper Course Success Among Students Needing Remediation: Research Report for the Virginia Community College System Strategies for Promoting Gatekeeper Course Success Among Students Needing Remediation: Research Report for the Virginia Community College System Josipa Roksa Davis Jenkins Shanna Smith Jaggars Matthew

More information

Using Data to Identify At-risk Students and Develop Retention Strategies

Using Data to Identify At-risk Students and Develop Retention Strategies UNIVERSITY LEADERSHIP COUNCIL Using Data to Identify At-risk Students and Develop Retention Strategies Custom Research Brief Research Associate Bryan Beaudoin Research Manager Priya Kumar June 2012 2 of

More information

MINING BIG DATA TO SOLVE THE RETENTION

MINING BIG DATA TO SOLVE THE RETENTION MINING BIG DATA TO SOLVE THE RETENTION AND GRADUATION PUZZLE Patrice Lancey, Ph.D. Uday Nair, M.S, MBA Rachel Straney, M.S. Association for Institutional Research 2014 Forum May 30, 2014 University of

More information

Enhancing Education Quality Assurance Using Data Mining. Case Study: Arab International University Systems.

Enhancing Education Quality Assurance Using Data Mining. Case Study: Arab International University Systems. Enhancing Education Quality Assurance Using Data Mining Case Study: Arab International University Systems. Faek Diko Arab International University Damascus, Syria f-diko@aeu.ac.sy Zaidoun Alzoabi Arab

More information

The Urban Transfer Research Network (UTRN) is a collaborative multiinstitutional

The Urban Transfer Research Network (UTRN) is a collaborative multiinstitutional Project Overview The Urban Transfer Research Network (UTRN) is a collaborative multiinstitutional research project focused on low-income and minority students who begin their postsecondary careers at community

More information

WKU Freshmen Performance in Foundational Courses: Implications for Retention and Graduation Rates

WKU Freshmen Performance in Foundational Courses: Implications for Retention and Graduation Rates Research Report June 7, 2011 WKU Freshmen Performance in Foundational Courses: Implications for Retention and Graduation Rates ABSTRACT In the study of higher education, few topics receive as much attention

More information

PERFORMANCE FUNDING IMPROVEMENT PLAN 2014-2015 / KEY AREAS OF FOCUS

PERFORMANCE FUNDING IMPROVEMENT PLAN 2014-2015 / KEY AREAS OF FOCUS PERFORMANCE FUNDING IMPROVEMENT PLAN 2014-2015 / KEY AREAS OF FOCUS OVERVIEW The core of the University of West Florida s mission is a commitment to ensuring student success. As outlined in the 2012-2017

More information

Co-Curricular Activities and Academic Performance -A Study of the Student Leadership Initiative Programs. Office of Institutional Research

Co-Curricular Activities and Academic Performance -A Study of the Student Leadership Initiative Programs. Office of Institutional Research Co-Curricular Activities and Academic Performance -A Study of the Student Leadership Initiative Programs Office of Institutional Research July 2014 Introduction The Leadership Initiative (LI) is a certificate

More information

Student Success Courses and Educational Outcomes at Virginia Community Colleges

Student Success Courses and Educational Outcomes at Virginia Community Colleges Student Success Courses and Educational Outcomes at Virginia Community Colleges Sung-Woo Cho and Melinda Mechur Karp February 2012 CCRC Working Paper No. 40 Address correspondence to: Sung-Woo Cho Quantitative

More information

Course Catalog. www.airweb.org/academy

Course Catalog. www.airweb.org/academy www.airweb.org/academy Course Catalog 2015 Hosted by the Association for Institutional Research, Data and Decisions Academy courses provide self-paced, online professional development for institutional

More information

Effectiveness of online teaching of Accounting at University level

Effectiveness of online teaching of Accounting at University level Effectiveness of online teaching of Accounting at University level Abstract: In recent years, online education has opened new educational environments and brought new opportunities and significant challenges

More information

Northern Illinois University Office of Assessment Services

Northern Illinois University Office of Assessment Services Northern Illinois University Office of Assessment Services 2011 University Writing Project Report Analysis of College-Level Writing Ability/Skills Fall 2011 Distributed Fall 2011 For Additional Information

More information

February 2003 Report No. 03-17

February 2003 Report No. 03-17 February 2003 Report No. 03-17 Bright Futures Contributes to Improved College Preparation, Affordability, and Enrollment at a glance Since the Bright Futures program was created in 1997, Florida s high

More information

Complete College America Common College Completion Metrics Technical Guide

Complete College America Common College Completion Metrics Technical Guide Complete College America Common College Completion Metrics Technical Guide April, 2014 All major changes from previous years are highlighted and have a * indicator that can be searched for. April 2, 2014:

More information

MCPS Graduates Earning College Degrees in STEM-Related Fields

MCPS Graduates Earning College Degrees in STEM-Related Fields Graduates Earning College Degrees in -Related Fields September 2012 Natalie Wolanin & Julie Wade OFFICE OF SHARED ACCOUNTABILITY Adrian B. Talley, Associate Superintendent 850 Hungerford Drive Rockville,

More information

Relating the ACT Indicator Understanding Complex Texts to College Course Grades

Relating the ACT Indicator Understanding Complex Texts to College Course Grades ACT Research & Policy Technical Brief 2016 Relating the ACT Indicator Understanding Complex Texts to College Course Grades Jeff Allen, PhD; Brad Bolender; Yu Fang, PhD; Dongmei Li, PhD; and Tony Thompson,

More information

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM. DATA MINING TECHNOLOGY Georgiana Marin 1 Abstract In terms of data processing, classical statistical models are restrictive; it requires hypotheses, the knowledge and experience of specialists, equations,

More information

What is Predictive Analytics?

What is Predictive Analytics? What is Predictive Analytics? Firstly, Analytics is the use of data, statistical analysis, and explanatory and predictive models to gain insights and act on complex issues. EDUCAUSE Center for Applied

More information

Application of Predictive Analytics to Higher Degree Research Course Completion Times

Application of Predictive Analytics to Higher Degree Research Course Completion Times Application of Predictive Analytics to Higher Degree Research Course Completion Times Application of Decision Theory to PhD Course Completions (2006 2013) Rachna 1 I Dhand, Senior Strategic Information

More information

The University of Arizona

The University of Arizona The University of Arizona FORMAT AND GUIDELINES FOR GRADUATE CERTIFICATE APPROVAL Directions: 1. Provide information regarding the proposed graduate certificate in the format requested on the attached

More information

Data Mining Application in Higher Learning Institutions

Data Mining Application in Higher Learning Institutions Informatics in Education, 2008, Vol. 7, No. 1, 31 54 31 2008 Institute of Mathematics and Informatics, Vilnius Data Mining Application in Higher Learning Institutions Naeimeh DELAVARI, Somnuk PHON-AMNUAISUK

More information

Master of Science in Healthcare Informatics and Analytics Program Overview

Master of Science in Healthcare Informatics and Analytics Program Overview Master of Science in Healthcare Informatics and Analytics Program Overview The program is a 60 credit, 100 week course of study that is designed to graduate students who: Understand and can apply the appropriate

More information

GRADUATE STUDENT SATISFACTION WITH AN ONLINE DISCRETE MATHEMATICS COURSE *

GRADUATE STUDENT SATISFACTION WITH AN ONLINE DISCRETE MATHEMATICS COURSE * GRADUATE STUDENT SATISFACTION WITH AN ONLINE DISCRETE MATHEMATICS COURSE * Amber Settle DePaul University 243 S. Wabash Avenue Chicago, IL 60604 (312) 362-5324 asettle@cti.depaul.edu Chad Settle University

More information

Environmental and Water Resources Engineering Program

Environmental and Water Resources Engineering Program Environmental and Water Resources Engineering Program Requirements for the Doctor of Philosophy Degree in Civil Engineering, Environmental and Water Resources Engineering Emphasis July, 2011 This document

More information

Industrial and Systems Engineering Master of Science Program Data Analytics and Optimization

Industrial and Systems Engineering Master of Science Program Data Analytics and Optimization Industrial and Systems Engineering Master of Science Program Data Analytics and Optimization Department of Integrated Systems Engineering The Ohio State University (Expected Duration: Semesters) Our society

More information

Dawn Broschard, EdD Senior Research Analyst Office of Retention and Graduation Success dbroscha@fiu.edu

Dawn Broschard, EdD Senior Research Analyst Office of Retention and Graduation Success dbroscha@fiu.edu Using Decision Trees to Analyze Students at Risk of Dropping Out in Their First Year of College Based on Data Gathered Prior to Attending Their First Semester Dawn Broschard, EdD Senior Research Analyst

More information

Report on the Scaling of the 2014 NSW Higher School Certificate. NSW Vice-Chancellors Committee Technical Committee on Scaling

Report on the Scaling of the 2014 NSW Higher School Certificate. NSW Vice-Chancellors Committee Technical Committee on Scaling Report on the Scaling of the 2014 NSW Higher School Certificate NSW Vice-Chancellors Committee Technical Committee on Scaling Contents Preface Acknowledgements Definitions iii iv v 1 The Higher School

More information

An Empirical Study of Application of Data Mining Techniques in Library System

An Empirical Study of Application of Data Mining Techniques in Library System An Empirical Study of Application of Data Mining Techniques in Library System Veepu Uppal Department of Computer Science and Engineering, Manav Rachna College of Engineering, Faridabad, India Gunjan Chindwani

More information

How To Study The Academic Performance Of An Mba

How To Study The Academic Performance Of An Mba Proceedings of the Annual Meeting of the American Statistical Association, August 5-9, 2001 WORK EXPERIENCE: DETERMINANT OF MBA ACADEMIC SUCCESS? Andrew Braunstein, Iona College Hagan School of Business,

More information

Excess Units in Pursuit of the Bachelor s Degree

Excess Units in Pursuit of the Bachelor s Degree Excess in Pursuit of the Bachelor s -- An Analysis on Native Freshmen Graduated during 2009-2010 Office of Institutional Research Sacramento State July-September, 2011 This report intends to examine the

More information

A Basic Guide to Modeling Techniques for All Direct Marketing Challenges

A Basic Guide to Modeling Techniques for All Direct Marketing Challenges A Basic Guide to Modeling Techniques for All Direct Marketing Challenges Allison Cornia Database Marketing Manager Microsoft Corporation C. Olivia Rud Executive Vice President Data Square, LLC Overview

More information

Course Descriptions: Undergraduate/Graduate Certificate Program in Data Visualization and Analysis

Course Descriptions: Undergraduate/Graduate Certificate Program in Data Visualization and Analysis 9/3/2013 Course Descriptions: Undergraduate/Graduate Certificate Program in Data Visualization and Analysis Seton Hall University, South Orange, New Jersey http://www.shu.edu/go/dava Visualization and

More information

In this presentation, you will be introduced to data mining and the relationship with meaningful use.

In this presentation, you will be introduced to data mining and the relationship with meaningful use. In this presentation, you will be introduced to data mining and the relationship with meaningful use. Data mining refers to the art and science of intelligent data analysis. It is the application of machine

More information

not possible or was possible at a high cost for collecting the data.

not possible or was possible at a high cost for collecting the data. Data Mining and Knowledge Discovery Generating knowledge from data Knowledge Discovery Data Mining White Paper Organizations collect a vast amount of data in the process of carrying out their day-to-day

More information

Predictive Models for Student Success

Predictive Models for Student Success Predictive Models for Student Success 5/21/212 Joe DeHart Des Moines Area Community College May 212 Purpose Des Moines Area Community College (DMACC) is currently implementing various changes to improve

More information

WHEN LEARNING ANALYTICS MEET BIG DATA. The Predictive Analytics Reporting (PAR) Framework November 8, 2012

WHEN LEARNING ANALYTICS MEET BIG DATA. The Predictive Analytics Reporting (PAR) Framework November 8, 2012 WHEN LEARNING ANALYTICS MEET BIG DATA The Predictive Analytics Reporting (PAR) Framework November 8, 2012 THE PAR FRAMEWORK Moderator: Ellen Wagner, WICHE Cooperative for Educational Technologies (WCET)

More information

Predicting Successful Completion of the Nursing Program: An Analysis of Prerequisites and Demographic Variables

Predicting Successful Completion of the Nursing Program: An Analysis of Prerequisites and Demographic Variables Predicting Successful Completion of the Nursing Program: An Analysis of Prerequisites and Demographic Variables Introduction In the summer of 2002, a research study commissioned by the Center for Student

More information

The Outcomes For CTE Students in Wisconsin

The Outcomes For CTE Students in Wisconsin Promoting Rigorous Career and Technical Education Programs of Study Quantitative Outcomes Study: Baseline Data Collection Report Prepared under contract to Division of Academic and Technical Education

More information

PREDICTING STUDENT RETENTION & SUCCESS IN ONLINE PROGRAMS. William Bloemer & Karen Swan UNIVERSITY OF ILLINOIS SPRINGFIELD

PREDICTING STUDENT RETENTION & SUCCESS IN ONLINE PROGRAMS. William Bloemer & Karen Swan UNIVERSITY OF ILLINOIS SPRINGFIELD PREDICTING STUDENT RETENTION & SUCCESS IN ONLINE PROGRAMS William Bloemer & Karen Swan UNIVERSITY OF ILLINOIS SPRINGFIELD The rarely articulated implication of all of this data floating around is that

More information

This presentation was made at the California Association for Institutional Research Conference on November 19, 2010.

This presentation was made at the California Association for Institutional Research Conference on November 19, 2010. This presentation was made at the California Association for Institutional Research Conference on November 19, 2010. 1 This presentation was made at the California Association for Institutional Research

More information

WORKING PAPER EXPLORING RELATIONSHIPS BETWEEN STUDENT ENGAGEMENT AND STUDENT OUTCOMES IN COMMUNITY COLLEGES: REPORT ON VALIDATION RESEARCH

WORKING PAPER EXPLORING RELATIONSHIPS BETWEEN STUDENT ENGAGEMENT AND STUDENT OUTCOMES IN COMMUNITY COLLEGES: REPORT ON VALIDATION RESEARCH WORKING PAPER EXPLORING RELATIONSHIPS BETWEEN STUDENT ENGAGEMENT AND STUDENT OUTCOMES IN COMMUNITY COLLEGES: REPORT ON VALIDATION RESEARCH By Kay M. McClenney, Ph.D. C. Nathan Marti, Ph.D. The Community

More information

State Grants and the Impact on Student Outcomes

State Grants and the Impact on Student Outcomes Page 1 of 9 TOPIC: PREPARED BY: FINANCIAL AID ALLOCATION DISCUSSION CELINA DURAN, FINANCIAL AID DIRECTOR I. SUMMARY At its September meeting, the Commission agreed to revisit the financial allocation method

More information

The Influence of a Summer Bridge Program on College Adjustment and Success: The Importance of Early Intervention and Creating a Sense of Community

The Influence of a Summer Bridge Program on College Adjustment and Success: The Importance of Early Intervention and Creating a Sense of Community The Influence of a Summer Bridge Program on College Adjustment and Success: The Importance of Early Intervention and Creating a Sense of Community Michele J. Hansen, Ph.D., Director of Assessment, University

More information

Use Data Mining Techniques to Assist Institutions in Achieving Enrollment Goals: A Case Study

Use Data Mining Techniques to Assist Institutions in Achieving Enrollment Goals: A Case Study Use Data Mining Techniques to Assist Institutions in Achieving Enrollment Goals: A Case Study Tongshan Chang The University of California Office of the President CAIR Conference in Pasadena 11/13/2008

More information

Texas A&M University-Kingsville. Responses to H.B. 1172 Timely Graduation Report Requirements November 2007

Texas A&M University-Kingsville. Responses to H.B. 1172 Timely Graduation Report Requirements November 2007 Texas A&M University-Kingsville Responses to H.B. 1172 Timely Graduation Report Requirements November 2007 This report has been prepared in compliance with Section 1, H.B. 1172 of the Texas Education Code.

More information

A COMPARISON OF POST-SECONDARY OUTCOMES FOR TECH PREP AND NON-TECH PREP STUDENTS AT SINCLAIR COMMUNITY COLLEGE. October, 2007

A COMPARISON OF POST-SECONDARY OUTCOMES FOR TECH PREP AND NON-TECH PREP STUDENTS AT SINCLAIR COMMUNITY COLLEGE. October, 2007 A COMPARISON OF POST-SECONDARY OUTCOMES FOR TECH PREP AND NON-TECH PREP STUDENTS AT SINCLAIR COMMUNITY COLLEGE October, 2007 Office of Research, Analytics and Reporting Sinclair Community College, 444

More information

Description of the program

Description of the program Study program Faculty Cycle Business Informatics Contemporary Sciences and Technologies Undergraduate ECTS 180 Offered in Tetovo and Skopje Description of the program The three-year curricula (Bachelor

More information

SAS JOINT DATA MINING CERTIFICATION AT BRYANT UNIVERSITY

SAS JOINT DATA MINING CERTIFICATION AT BRYANT UNIVERSITY SAS JOINT DATA MINING CERTIFICATION AT BRYANT UNIVERSITY Billie Anderson Bryant University, 1150 Douglas Pike, Smithfield, RI 02917 Phone: (401) 232-6089, e-mail: banderson@bryant.edu Phyllis Schumacher

More information

The Bottom Line on Student Retention: Data-Driven Approaches that Work

The Bottom Line on Student Retention: Data-Driven Approaches that Work The Bottom Line on Student Retention: Data-Driven Approaches that Work Presented by: Mary Piccioli & Silvio Vazquez NACCAP, June 7, 2010 Overview of Presentation Undergraduate Retention: Context and Overview

More information

How To Use Data Mining For Knowledge Management In Technology Enhanced Learning

How To Use Data Mining For Knowledge Management In Technology Enhanced Learning Proceedings of the 6th WSEAS International Conference on Applications of Electrical Engineering, Istanbul, Turkey, May 27-29, 2007 115 Data Mining for Knowledge Management in Technology Enhanced Learning

More information

Student Engagement and Student Outcomes: Key Findings from CCSSE Validation Research

Student Engagement and Student Outcomes: Key Findings from CCSSE Validation Research Student Engagement and Student Outcomes: Key Findings from CCSSE Validation Research Kay McClenney, C. Nathan Marti, and Courtney Adkins The findings from 20 years of research on undergraduate education

More information

Student Manual. Ph.D. in International Business Administration. A. R. Sanchez, Jr. School of Business

Student Manual. Ph.D. in International Business Administration. A. R. Sanchez, Jr. School of Business Student Manual Ph.D. in International Business Administration A. R. Sanchez, Jr. School of Business Texas A&M International University February 17, 2004 Table of contents 1. Vision 3 2. Mission.. 3 3.

More information

A Comparison of Training & Scoring in Distributed & Regional Contexts Writing

A Comparison of Training & Scoring in Distributed & Regional Contexts Writing A Comparison of Training & Scoring in Distributed & Regional Contexts Writing Edward W. Wolfe Staci Matthews Daisy Vickers Pearson July 2009 Abstract This study examined the influence of rater training

More information

Doctor of Educational Technology

Doctor of Educational Technology Overview The Doctor of degree (D.E.T.) prepares candidates to conduct rigorous evaluations of educational technology environments and to design solutions that enhance contemporary schools at all levels

More information

POC PHASE DATA. 6 institutions

POC PHASE DATA. 6 institutions POC PHASE DATA 6 institutions 1 calendar year (2010) All ONLINE Bachelors and below courses All DevEd regardless of format Two principle outcome measures investigated Did the student SUCCEED in their current

More information

02.26 Indiana University Post-Master s Degree Certificate Program in Institutional Research

02.26 Indiana University Post-Master s Degree Certificate Program in Institutional Research 02.26 Indiana University Post-Master s Degree Certificate Program in Institutional Research Introduction The increasingly complicated and challenging tasks of institutional research (IR) demand that IR

More information

TOWARD A DISTRIBUTED DATA MINING SYSTEM FOR TOURISM INDUSTRY

TOWARD A DISTRIBUTED DATA MINING SYSTEM FOR TOURISM INDUSTRY TOWARD A DISTRIBUTED DATA MINING SYSTEM FOR TOURISM INDUSTRY Danubianu Mirela Stefan cel Mare University of Suceava Faculty of Electrical Engineering andcomputer Science 13 Universitatii Street, Suceava

More information

Bringing Equity and Quality Learning Together: Institutional Priorities for Tracking and Advancing Underserved Students Success

Bringing Equity and Quality Learning Together: Institutional Priorities for Tracking and Advancing Underserved Students Success Bringing Equity and Quality Learning Together: Institutional Priorities for Tracking and Advancing Underserved Students Success Key Findings from a Survey and In-Depth Interviews among Administrators at

More information

Local outlier detection in data forensics: data mining approach to flag unusual schools

Local outlier detection in data forensics: data mining approach to flag unusual schools Local outlier detection in data forensics: data mining approach to flag unusual schools Mayuko Simon Data Recognition Corporation Paper presented at the 2012 Conference on Statistical Detection of Potential

More information

The Data Mining Process

The Data Mining Process Sequence for Determining Necessary Data. Wrong: Catalog everything you have, and decide what data is important. Right: Work backward from the solution, define the problem explicitly, and map out the data

More information

Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com

Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com SPSS-SA Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com SPSS-SA Training Brochure 2009 TABLE OF CONTENTS 1 SPSS TRAINING COURSES FOCUSING

More information

COLLEGE TRANSITION COLLABORATIVE

COLLEGE TRANSITION COLLABORATIVE COLLEGE TRANSITION COLLABORATIVE At a glance The CTC vision Our goal is to promote college persistence and achievement, especially among students from disadvantaged backgrounds, using highly scalable mindset

More information

Complete College Georgia Plan 2012-13

Complete College Georgia Plan 2012-13 Part I: Goals and Data Analysis Complete College Georgia Plan 2012-13 Complete College Georgia furthers the mission 1 of Bainbridge College, particularly our emphasis upon accessible and excellent education.

More information

Proposal for Combined BSME/MSE Program in Department of Civil and Mechanical Engineering at IPFW

Proposal for Combined BSME/MSE Program in Department of Civil and Mechanical Engineering at IPFW Proposal for Combined BSME/MSE Program in Department of Civil and Mechanical Engineering at IPFW 1. Names of the multiple-degree program - Combined Bachelor of Science in Mechanical Engineering and Master

More information

COURSE RECOMMENDER SYSTEM IN E-LEARNING

COURSE RECOMMENDER SYSTEM IN E-LEARNING International Journal of Computer Science and Communication Vol. 3, No. 1, January-June 2012, pp. 159-164 COURSE RECOMMENDER SYSTEM IN E-LEARNING Sunita B Aher 1, Lobo L.M.R.J. 2 1 M.E. (CSE)-II, Walchand

More information

IBM SPSS Direct Marketing 19

IBM SPSS Direct Marketing 19 IBM SPSS Direct Marketing 19 Note: Before using this information and the product it supports, read the general information under Notices on p. 105. This document contains proprietary information of SPSS

More information

APRIL 2010 HIGHER EDUCATION. Advancing by Degrees A Framework for Increasing College Completion

APRIL 2010 HIGHER EDUCATION. Advancing by Degrees A Framework for Increasing College Completion APRIL 2010 HIGHER EDUCATION Advancing by Degrees A Framework for Increasing College Completion Higher education leaders need to understand what really drives student success. Tracking six-year graduation

More information

Guide to Using Results

Guide to Using Results Guide to Using Results Contact Information For assistance, call ACT Engage Customer Service at 319.337.1893, 8:30 a.m. 5:00 p.m., central time, Monday through Friday, or email engage@act.org. Resources

More information

The relationship of CPA exam delay after graduation to institutional CPA exam pass rates

The relationship of CPA exam delay after graduation to institutional CPA exam pass rates The relationship of CPA exam delay after graduation to institutional CPA exam pass rates ABSTRACT John D. Morgan Winona State University This study investigates the relationship between average delay after

More information

Web Mining as a Tool for Understanding Online Learning

Web Mining as a Tool for Understanding Online Learning Web Mining as a Tool for Understanding Online Learning Jiye Ai University of Missouri Columbia Columbia, MO USA jadb3@mizzou.edu James Laffey University of Missouri Columbia Columbia, MO USA LaffeyJ@missouri.edu

More information

An Overview of Data Mining: Predictive Modeling for IR in the 21 st Century

An Overview of Data Mining: Predictive Modeling for IR in the 21 st Century An Overview of Data Mining: Predictive Modeling for IR in the 21 st Century Nora Galambos, PhD Senior Data Scientist Office of Institutional Research, Planning & Effectiveness Stony Brook University AIRPO

More information

Determining Student Performance in Online Database Courses

Determining Student Performance in Online Database Courses in Online Database Courses George Garman The Metropolitan State College of Denver ABSTRACT Using data collected from 2002 to 2011, this paper analyzes the determinants of scores earned by students in a

More information

Online and Hybrid Course Enrollment and Performance in Washington State Community and Technical Colleges

Online and Hybrid Course Enrollment and Performance in Washington State Community and Technical Colleges Online and Hybrid Course Enrollment and Performance in Washington State Community and Technical Colleges Di Xu and Shanna Smith Jaggars March 2011 CCRC Working Paper No. 31 Address correspondence to: Shanna

More information

Attrition in Online and Campus Degree Programs

Attrition in Online and Campus Degree Programs Attrition in Online and Campus Degree Programs Belinda Patterson East Carolina University pattersonb@ecu.edu Cheryl McFadden East Carolina University mcfaddench@ecu.edu Abstract The purpose of this study

More information

The Impact of Pell Grants on Academic Outcomes for Low-Income California Community College Students

The Impact of Pell Grants on Academic Outcomes for Low-Income California Community College Students Research Brief The Impact of Pell Grants on Academic Outcomes for Low-Income California Community College Students By Jennie H. Woo This is the fifth in a series of MPR Research Briefs published on the

More information

ANGEL Change Management Plan (CMP)

ANGEL Change Management Plan (CMP) ANGEL Change Management Plan (CMP) Date: September 11, 2007 Page 1 of 24 Document Revision History Version Number Date Description 1.0 2/5/2007 Intermediate (includes proposed Implementation Initiative)

More information

Distance Education Program

Distance Education Program Distance Education Program Master Plan Distance Education Committee 2013 This document serves as a guide for supporting initiatives that have been identified and developed through our Technology Master

More information

Basics of Longitudinal Cohort Analysis

Basics of Longitudinal Cohort Analysis Achieving the Dream TM Community Colleges Count Basics of Longitudinal Cohort Analysis Principles and Practices of Student Success Rick Voorhees and John Lee Longitudinal cohort analysis is a powerful

More information

THE IMPORTANCE OF TEACHING PRESENCE IN ONLINE AND HYBRID CLASSROOMS

THE IMPORTANCE OF TEACHING PRESENCE IN ONLINE AND HYBRID CLASSROOMS Allied Academies International Conference page 7 THE IMPORTANCE OF TEACHING PRESENCE IN ONLINE AND HYBRID CLASSROOMS Richard Bush, Lawrence Technological University Patricia Castelli, Lawrence Technological

More information

DEAL OR NO DEAL: ACADEMIC ADMINISTRATORS PROMOTE STUDENT SUCCESS

DEAL OR NO DEAL: ACADEMIC ADMINISTRATORS PROMOTE STUDENT SUCCESS DEAL OR NO DEAL: ACADEMIC ADMINISTRATORS PROMOTE STUDENT SUCCESS Marcia A. Gellin, Ed.D., R.N. and Mary Beard, MS.Ed. Asst. Academic Deans for Liberal Arts Erie Community College Buffalo, NY Brief Biography

More information

Identifying At-Risk Students Using Machine Learning Techniques: A Case Study with IS 100

Identifying At-Risk Students Using Machine Learning Techniques: A Case Study with IS 100 Identifying At-Risk Students Using Machine Learning Techniques: A Case Study with IS 100 Erkan Er Abstract In this paper, a model for predicting students performance levels is proposed which employs three

More information

Course Syllabus. Purposes of Course:

Course Syllabus. Purposes of Course: Course Syllabus Eco 5385.701 Predictive Analytics for Economists Summer 2014 TTh 6:00 8:50 pm and Sat. 12:00 2:50 pm First Day of Class: Tuesday, June 3 Last Day of Class: Tuesday, July 1 251 Maguire Building

More information

CHAPTER III METHOD. Introduction

CHAPTER III METHOD. Introduction CHAPTER III METHOD Introduction The purpose of this chapter is to describe the research process used in the study including the selection of study participants, the data collection method and procedures,

More information

Participation and pass rates for college preparatory transition courses in Kentucky

Participation and pass rates for college preparatory transition courses in Kentucky U.S. Department of Education March 2014 Participation and pass rates for college preparatory transition courses in Kentucky Christine Mokher CNA Key findings This study of Kentucky students who take college

More information

How To Know The Roi Of Cesp Workload Automation Software

How To Know The Roi Of Cesp Workload Automation Software WHITE PAPER CA ESP Workload Automation Software: Measuring Business Impact and ROI Sponsored by: CA Tim Grieser Eric Hatcher September 2009 Randy Perry Global Headquarters: 5 Speen Street Framingham, MA

More information

Student Feedback on Online Summer Courses

Student Feedback on Online Summer Courses Student Feedback on Online Summer Courses October 8, 2015 Santa Clara University Office of Assessment Report Introduction In the summer of 2015, approximately 700 undergraduate students were enrolled in

More information

Presenters: Pat Estes, Assessment Analyst, PEstes@edgewood.edu Liang Hou, Research Intern, LHou@edgewood.edu

Presenters: Pat Estes, Assessment Analyst, PEstes@edgewood.edu Liang Hou, Research Intern, LHou@edgewood.edu Presenters: Pat Estes, Assessment Analyst, PEstes@edgewood.edu Liang Hou, Research Intern, LHou@edgewood.edu Edgewood College Office of Institutional Assessment and Research This research was conducted

More information

EARLY VS. LATE ENROLLERS: DOES ENROLLMENT PROCRASTINATION AFFECT ACADEMIC SUCCESS? 2007-08

EARLY VS. LATE ENROLLERS: DOES ENROLLMENT PROCRASTINATION AFFECT ACADEMIC SUCCESS? 2007-08 EARLY VS. LATE ENROLLERS: DOES ENROLLMENT PROCRASTINATION AFFECT ACADEMIC SUCCESS? 2007-08 PURPOSE Matthew Wetstein, Alyssa Nguyen & Brianna Hays The purpose of the present study was to identify specific

More information

STANDARD #2. Strategic Planning

STANDARD #2. Strategic Planning As part of our ACBSP accreditation requirements, we routinely provide reliable information to the public on our performance, including student achievement such as assessment results. Portions of our report

More information

issue brief September 2013

issue brief September 2013 issue brief September 2013 DEVELOPING ALTERNATIVE PLACEMENT CRITERIA FOR ENGLISH COURSES AT CITY COLLEGE OF SAN FRANCISCO Sebastian Castrechini Recognizing the need to improve postsecondary access and

More information

Running head: CREDIT RECOVERY 1. Report Submitted to the Academic Partnerships

Running head: CREDIT RECOVERY 1. Report Submitted to the Academic Partnerships Running head: CREDIT RECOVERY 1 Credit Recovery: Improving Student Graduation and Learning Experiences in an Online Master s Degree Program Yi (Leaf) Zhang Ernest Johnson University of Texas at Arlington

More information

Data Mining Part 5. Prediction

Data Mining Part 5. Prediction Data Mining Part 5. Prediction 5.7 Spring 2010 Instructor: Dr. Masoud Yaghini Outline Introduction Linear Regression Other Regression Models References Introduction Introduction Numerical prediction is

More information