Home | Title Page | Introduction | Background | SEMMA | CRISP-DM | Concerns | Future | Conclusion | References |
The SEMMA data mining process was developed by SAS. The steps in this process are as follows:
An example of a university using the SAS Enterprise Miner approach is The University of Central Florida, Division of Graduate Studies which has used data mining as a tool in graduate admissions. After initial analysis, they selected 23 predictor variables (specific graduate program, academic level, gender, ethnic group, etc.) and one response variable (whether or not the student enrolled). They used a logistic regression model which is an appropriate model when predicting a binary response (enroll/not enroll). Half of the data was used to build the model. The remaining half was used to test the fit of the model. Each predictor was given a weight depending on the strength of its relation to the response variable. The findings indicated a valid model and they used the resulting model to predict enrollment for the fall 2007 semester.