Utilize este identificador para referenciar este registo: http://hdl.handle.net/10362/10169
Título: Towards improving WEBSOM with multi-word expressions
Autor: Alves, Stefan Eduard Raposo
Orientador: Marques, Nuno Cavalheiro
Silva, Joaquim
Palavras-chave: Self-Organising Maps (SOM)
Text mining
Relevant expressions
Data de Defesa: 2013
Editora: Faculdade de Ciências e Tecnologia
Resumo: Large quantities of free-text documents are usually rich in information and covers several topics. However, since their dimension is very large, searching and filtering data is an exhaustive task. A large text collection covers a set of topics where each topic is affiliated to a group of documents. This thesis presents a method for building a document map about the core contents covered in the collection. WEBSOM is an approach that combines document encoding methods and Self-Organising Maps (SOM) to generate a document map. However, this methodology has a weakness in the document encoding method because it uses single words to characterise documents. Single words tend to be ambiguous and semantically vague, so some documents can be incorrectly related. This thesis proposes a new document encoding method to improve the WEBSOM approach by using multi word expressions (MWEs) to describe documents. Previous research and ongoing experiments encourage us to use MWEs to characterise documents because these are semantically more accurate than single words and more descriptive.
Descrição: Dissertação para obtenção do Grau de Mestre em Engenharia Informática
URI: http://hdl.handle.net/10362/10169
Aparece nas colecções:FCT: DI - Dissertações de Mestrado

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
Alves_2013.pdf3,35 MBAdobe PDFVer/Abrir

FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpace
Formato BibTex MendeleyEndnote 

Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.