Utilize este identificador para referenciar este registo:
http://hdl.handle.net/10362/171103| Título: | Data extraction with a state-of-the-art nip model trained on synthetically generated German rental contracts |
| Autor: | Trut, Dominik |
| Orientador: | Han, Qiwei |
| Palavras-chave: | Machine learning Natural language programming Nlp Named entity recognition Ner Synthetic data generation Data scarcity Proptech Real estate 4.0 Spacy |
| Data de Defesa: | 23-Jan-2023 |
| Resumo: | The flow of information in the real estate market is rapidly accelerating, and real estate investment companies are actively seeking automations to streamline transactions to minimize missed investment opportunities. As a complementary product feature to zoolo, a Property Technology start-up based in Germany, an extraction model was deployed to automatically derive information from scanned leases. Given the scarcity of data due to legal constraints, synthetic leases were constructed to train a fine-tuned spaCy v3.4 model. Thus, this paper reveals that three algorithmically generated synthetic lease paragraphs are suitable to provide a basis for training and applying the spaCy NLP model using Named Entity Recognition and token classification models |
| URI: | http://hdl.handle.net/10362/171103 |
| Designação: | A Directed Research Internship, presented as part of the requirements for the Award of a Master’s degree in Business Analytics from the Nova School of Business and Economics |
| Aparece nas colecções: | NSBE: Nova SBE - MA Dissertations |
Ficheiros deste registo:
| Ficheiro | Descrição | Tamanho | Formato | |
|---|---|---|---|---|
| 30568_Dominik_Trut_2022_23_Fall_46274_Dominik_Trut_261147_2131943021_2_.pdf | 1,64 MB | Adobe PDF | Ver/Abrir Acesso Restrito. Solicitar cópia ao autor! |
Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.











