Logo do repositório
 
A carregar...
Miniatura
Publicação

Understanding Residential Address Patterns in Urban and Rural Areas

Utilize este identificador para referenciar este registo.

Orientador(es)

Resumo(s)

Address data quality has a direct impact on demographic and other spatial analyses, since it may lead to uncertainty and potential bias. Most of the existing studies measure address quality through matching with reference databases, which can be an expensive and time-consuming process. To bridge this gap, we propose a multiclass classification algorithm to evaluate the syntactic quality of residential addresses from a large database without using external databases. Namely, we adopt a multi-objective optimization approach, based on the NSGA-II algorithm and two modified k-NN algorithms. The objective is to find the address components as well as the optimal number of neighboring examples that help explain which class (good, incorrect or incomplete and anomalous) the quality of an address belongs to, by type of region (urban, medium urban, and rural). The presented results indicate that the proposed approach outperforms the best baseline algorithms on multiclass classification, while also providing descriptive information on the most relevant features and median local neighborhood of each instance. With this study, we further extend previous research in the field of address pattern extraction, by explicitly differentiating urban and rural areas as well as invalid and anomalous addresses.

Descrição

Cruz, P., Vanneschi, L., & Painho, M. (2025). Understanding Residential Address Patterns in Urban and Rural Areas: A Machine Learning Approach. Transactions in GIS, 29(1), 1-17. Article e70003. https://doi.org/10.1111/tgis.70003 --- This work was supported by national funds through FCT (Fundação para a Ciência e a Tecnologia), under the project—UIDB/04152/2020—Centro de Investigação em Gestão de Informação (MagIC)/NOVA IMS.

Palavras-chave

address validation census data quality machine learning multiclass classification statistical operations General Earth and Planetary Sciences SDG 9 - Industry, Innovation, and Infrastructure SDG 11 - Sustainable Cities and Communities

Contexto Educativo

Citação

Projetos de investigação

Unidades organizacionais

Fascículo