Utilize este identificador para referenciar este registo: http://hdl.handle.net/10362/171103
Registo completo
Campo DCValorIdioma
dc.contributor.advisorHan, Qiwei-
dc.contributor.authorTrut, Dominik-
dc.date.accessioned2024-09-03T13:49:53Z-
dc.date.issued2023-01-23-
dc.date.submitted2022-12-16-
dc.identifier.urihttp://hdl.handle.net/10362/171103-
dc.description.abstractThe flow of information in the real estate market is rapidly accelerating, and real estate investment companies are actively seeking automations to streamline transactions to minimize missed investment opportunities. As a complementary product feature to zoolo, a Property Technology start-up based in Germany, an extraction model was deployed to automatically derive information from scanned leases. Given the scarcity of data due to legal constraints, synthetic leases were constructed to train a fine-tuned spaCy v3.4 model. Thus, this paper reveals that three algorithmically generated synthetic lease paragraphs are suitable to provide a basis for training and applying the spaCy NLP model using Named Entity Recognition and token classification modelspt_PT
dc.language.isoengpt_PT
dc.rightsembargoedAccesspt_PT
dc.subjectMachine learningpt_PT
dc.subjectNatural language programmingpt_PT
dc.subjectNlppt_PT
dc.subjectNamed entity recognitionpt_PT
dc.subjectNerpt_PT
dc.subjectSynthetic data generationpt_PT
dc.subjectData scarcitypt_PT
dc.subjectProptechpt_PT
dc.subjectReal estate 4.0pt_PT
dc.subjectSpacypt_PT
dc.titleData extraction with a state-of-the-art nip model trained on synthetically generated German rental contractspt_PT
dc.typemasterThesispt_PT
thesis.degree.nameA Directed Research Internship, presented as part of the requirements for the Award of a Master’s degree in Business Analytics from the Nova School of Business and Economicspt_PT
dc.date.embargo2027-12-16-
dc.identifier.tid203315936pt_PT
dc.subject.fosDomínio/Área Científica::Ciências Sociais::Economia e Gestãopt_PT
Aparece nas colecções:NSBE: Nova SBE - MA Dissertations



FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpace
Formato BibTex MendeleyEndnote 

Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.