320 likes | 599 Views
Student Retention Prediction using data mining tools and Banner Data. Admir Djulovic Dennis Wilson Eastern Washington University Business Intelligence. Session Rules of etiquette. Please turn off you cell phone/pager
E N D
Student Retention Predictionusing data mining tools and Banner Data Admir Djulovic Dennis Wilson Eastern Washington University Business Intelligence Coeur d’Alene, Idaho
Session Rules of etiquette Please turn off you cell phone/pager If you must leave the session early, please do so as discreetly as possible Please avoid side conversation during the session Thank you for your cooperation! Coeur d’Alene, Idaho
Introduction • Focus: Why first time freshmen students are leaving in the first year? • Benefits of attending this session • You will learn how we use Banner and Data Mining tools to identify students at risk • Learn about factors that influence student retention • We will share our results and findings Coeur d’Alene, Idaho
Agenda Why first time freshmen students are leaving in the first year? Retention Data Mining Model Creation Results and Findings Future Work Questions Coeur d’Alene, Idaho
why STUDENTS ARE LEAVING IN THE FIRST YEAR? Coeur d’Alene, Idaho
why STUDENTS ARE LEAVING IN THE FIRST YEAR? • What are the factors that cause student to leave the university? • Pre-enrollment Information (i.e. SATandACT test scores) • Poor academic performance • Financial hardship • We want to determine data driven factors that influence student retention Coeur d’Alene, Idaho
Retention Data Mining Model Creation The model uses existing student and financial data in Banner to give us a prediction of how many first time freshmen students will or will not return the following Fall term Coeur d’Alene, Idaho
Retention Data Mining Model Creation • Determine what student attributes would provide the greatest benefit with these constrained • Pre-enrollment information • Financial Information • Housing Information • Financial Aid Information • Determine what Data Mining Predictive algorithms to use Coeur d’Alene, Idaho
Student attributes used to build the model • Special Attributes • ID – unique record identifier • RETAINEDNXTYR (Known Outcome/Target variable): Student retained next year (0: No, 1: Yes) • Pre-Enrollment Attributes • Age • Gender • SAT Scores in Reading, Math and Writing • Previous GPA (typically high school GPA) • Term Related Attributes • Account Balance • Cumulative GPA • Successive term GPA • Living on or off campus • Financial aid received or not Coeur d’Alene, Idaho
Student attributes used to build the model • Table 1: Normalized Weights of Independent Variables Using Relief Statistical Method • (All weights above 0.5 are deemed important in determining student retention.) Coeur d’Alene, Idaho
Student attributes used to build the model • Table 2: Normalized Weights of Independent Variables Using Information Gain Statistical Method • (All weights above 0.5 are deemed important in determining student retention.) Coeur d’Alene, Idaho
Student attributes used to build the model • Table 3: Normalized Weights of Independent Variables Using Chi Squared Statistics Method • (All weights above 0.5 are deemed important in determining student retention.) Coeur d’Alene, Idaho
Data Used Coeur d’Alene, Idaho • First time full time freshmen – Fall cohort (Could be applied to any population) • Cohort groups of data • Fall 2006 – 2011 Freshmen to train the model • Fall 2013 Freshmen to test model
Algorithm Selection Coeur d’Alene, Idaho • The following predictive algorithms have been used in many research paper
Training the Model Using Historical Data Coeur d’Alene, Idaho • Historical Data: • From 2006 through 2012 • Test Data: • 2013 Academic Year
Model(s) Training and Testing Phase Coeur d’Alene, Idaho
Model(s) Accuracy Coeur d’Alene, Idaho
Model(s) Accuracy Cont. Coeur d’Alene, Idaho
Applying the Model(s) USING the NEW dataset Coeur d’Alene, Idaho
Applying Models using New Dataset Coeur d’Alene, Idaho Academic Year 2013-2014
Results and findings Coeur d’Alene, Idaho
results and findings Winter Balance vs RETAINEDNXTYR (0:No; 1:Yes) Coeur d’Alene, Idaho
results and findings Winter Living on Campus vs RETAINEDNXTYR (0:No; 1:Yes) Coeur d’Alene, Idaho
results and findings Winter Received Financial Aid vs RETAINEDNXTYR (0:No; 1:Yes) Coeur d’Alene, Idaho
How Could This Retention Model Help? • Provide early warning of students at risk • Lists can be provided to different offices for student outreach • Improve student retention • Use it to forecast future student retention Coeur d’Alene, Idaho
Examples • Not returning due to the low GPA • (0:No; 1:Yes) Coeur d’Alene, Idaho
Examples Cont. • Not returning due to the high balance • (0:No; 1:Yes) Coeur d’Alene, Idaho
Future work Coeur d’Alene, Idaho
Future work • Attributes for future consideration • Student Attendants List • Student Credit Hours • Repeat Class Indicator • Types of Financial Aid • Major • College • Residency • Other Attributes Coeur d’Alene, Idaho
Session Summary • We have demonstrated how Banner data and data mining tools are used to identify students at risk • We have demonstrated how predictive models are created and how they work • Factors that contribute to a student’s dropping out • Data mining Algorithms used • Demonstrate how retention models can be used as a early warning system to identify students at risk Coeur d’Alene, Idaho
Questions & answers Coeur d’Alene, Idaho
Thank You! Admir Djulovic, Dennis Wilson Coeur d’Alene, Idaho