| Nome: | Descrição: | Tamanho: | Formato: | |
|---|---|---|---|---|
| 608.97 KB | Adobe PDF |
Autores
Orientador(es)
Resumo(s)
The consulting firm seeks better insights into the satisfaction of their employees. In this way, the goal of the internship was to conduct a Sentiment Analysis study of the internal surveys answered by employees. To achieve this goal each written response was manually labeled as positive, negative or neutral and different approaches, such as lexicon-based or machine learning-based, were tested and evaluated to find the best solution in terms of performance. One issue that needed to be taken into consideration was the high imbalance between classes, large majority of the data was positive and very little was neutral or negative. This meant that extra attention needed to be paid to these cases. In terms of accuracy the best model was Random Forest, although a method using Naïve Bayes from scratch performed quite well too. It was concluded that future iterations of the project would benefit from having an aspect level classification, which could inform exactly what are the sentiments toward specific aspects contained in the text.
Descrição
Internship Report presented as the partial requirement for obtaining a Master's degree in Data Science and Advanced Analytics, specialization in Business Analytics
Palavras-chave
Sentiment Analysis Human Resources Natural Language Processing Multiclass Classification Supervised Learning
