| Nome: | Descrição: | Tamanho: | Formato: | |
|---|---|---|---|---|
| 1.73 MB | Adobe PDF |
Autores
Resumo(s)
In the data-driven educational context, this study focuses on supporting the admissions decision-making for all post-graduate programs offered by NOVA IMS. Employing the CRISP-DM methodology and a diverse dataset, including structured and unstructured candidate data, student performance records, and external university rankings, the research creates two distinct models. The first model, based on classifier techniques, is designed to predict the candidate acceptance. The second model, as a complementary model, applies regression techniques to predict the candidateās performance. The regression model aims to validate the selection of top candidates at the conclusion of the admission process by providing a list of candidates predicted to perform well. Both models are designed to take advantage of the available historical university data and enhance the manual and time-consuming admission process performed every year by the university's selection committee. Three text mining techniques (TF-IDF, Word2Vec and Doc2Vec) were applied to the unstructured data, which included the candidateās motivation statement, as well as textual information about their past courses and work experiences. The modelling algorithms evaluated in this study for both models were Artificial Neural Network, Decision Tree, Random Forest, and Support Vector Machine. The Random Forest Classifier delivered superior predictions for the first model, whereas the Support Vector Regression performed better for the second model. Finally, both models produced satisfactory results, successfully achieving the studyās objective.
Descrição
Dissertation presented as the partial requirement for obtaining a Master's degree in Information Management, specialization in Knowledge Management and Business Intelligence
Palavras-chave
Educational Data Mining Machine Learning Admission prediction Performance prediction Decision support SDG 4 - Quality education
