Utilize este identificador para referenciar este registo: http://hdl.handle.net/10362/171103
Título: Data extraction with a state-of-the-art nip model trained on synthetically generated German rental contracts
Autor: Trut, Dominik
Orientador: Han, Qiwei
Palavras-chave: Machine learning
Natural language programming
Nlp
Named entity recognition
Ner
Synthetic data generation
Data scarcity
Proptech
Real estate 4.0
Spacy
Data de Defesa: 23-Jan-2023
Resumo: The flow of information in the real estate market is rapidly accelerating, and real estate investment companies are actively seeking automations to streamline transactions to minimize missed investment opportunities. As a complementary product feature to zoolo, a Property Technology start-up based in Germany, an extraction model was deployed to automatically derive information from scanned leases. Given the scarcity of data due to legal constraints, synthetic leases were constructed to train a fine-tuned spaCy v3.4 model. Thus, this paper reveals that three algorithmically generated synthetic lease paragraphs are suitable to provide a basis for training and applying the spaCy NLP model using Named Entity Recognition and token classification models
URI: http://hdl.handle.net/10362/171103
Designação: A Directed Research Internship, presented as part of the requirements for the Award of a Master’s degree in Business Analytics from the Nova School of Business and Economics
Aparece nas colecções:NSBE: Nova SBE - MA Dissertations



FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpace
Formato BibTex MendeleyEndnote 

Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.