Logo do repositório
 
A carregar...
Logótipo do projeto
Projeto de investigação

Sem título

Autores

Publicações

The Morais Dictionary
Publication . Salgado, Ana Castro; Romary, Laurent; Costa, Rute; Tasovac, Toma; Khan, Anas Fahad; Ramos, Margarida; Almeida, Bruno; Carvalho, Sara; Khemakhem , Mohamed; Silva, Raquel; Lehečka, Boris; Centro de Linguística da UNL (CLUNL); Departamento de Linguística (DL); Edinburgh University Press
This article outlines essential best practices for retro-digitized dictionary projects, using the ongoing MORDigital project (DOI 10.54499/PTDC/LLT-LIN/6841/2020) as a case study. The MORDigital project focuses on digitally transforming the historically significant Portuguese Morais dictionary’s first three editions (1789, 1813, 1823). While the primary objective is to create faithful digital versions of these renowned dictionaries, MORDigital stands out by going beyond the mere adoption of established best practices. Instead, it reflects on the choices made throughout the process, providing insights into the decision-making process. The key topics emphasized include (1) the establishment of a robust data model; (2) the refinement of metadata; (3) the implementation of consistent identifiers; and (4) the enhancement of encoding techniques; additionally exploring the issue of structuring domain labelling. The article aims to contribute to the ongoing discourse on best practices in retro-digitized dictionary projects and their implications for data preservation and knowledge organization.
MORDigital
Publication . Costa, Rute; Salgado, Ana; Khan, Anas; Carvalho, Sara; Romary, Laurent; Almeida, Bruno; Ramos, Margarida; Khemakhem , Mohamed; Silva, Raquel; Tasovac, Toma; Departamento de Linguística (DL); Centro de Linguística da UNL (CLUNL)
MORDigital is a newly funded Portuguese lexicographic project that aims to produce high-quality and searchable digital versions of the first three editions (1789; 1813; 1823) of the Diccionario da Lingua Portugueza by António de Morais Silva, preserving and making accessible this important work of European heritage. This paper will describe the current state of the art, the project, its objectives and the methodology proposed, the latter of which is based on a rigorous linguistic analysis and will also include steps necessary for the ontologisation of knowledge contained in and relating to the text. A section will be dedicated to the various investigation domains of the project description. The output of the project will be made available via a dedicated platform.
Modelling usage information in a legacy dictionary
Publication . Almeida, Bruno; Costa, Rute; Salgado, Ana; Ramos, Margarida; Romary, Laurent; Centro de Linguística da UNL (CLUNL); Departamento de Linguística (DL)
This paper describes ongoing work in the modelling of usage information in the context of the MORDigital project. The latter is based on the encoding and publication as linked data of Diccionario da Lingua Portugueza, a Portuguese legacy dictionary authored by António de Morais Silva, whose first edition was published in 1789. In this paper, we will focus on the modelling of domain labels in Ontolex-Lemon, based on a previous encoding of the dictionary’s entries in TEI Lex-0. This approach should be reusable for other projects involving the linked data publication of legacy dictionaries.
Applying terminological methods to lexicographic work
Publication . Salgado, Ana; Costa, Rute; Tasovac, Toma; Centro de Linguística da UNL (CLUNL); Departamento de Linguística (DL)
Applying terminological methods to lexicography helps lexicographers deal with the terms occurring in general language dictionaries, especially when it comes to writing the definitions of concepts belonging to special fields. In the context of the lexicographic work of the Dicionário da Língua Portuguesa, an updated digital version of the last Academia das Ciências de Lisboa’ dictionary published in 2001, we have assumed that terminology – in its dual dimension, both linguistic and conceptual – and lexicography are complementary in their methodological approaches. Both disciplines deal with lexical items, which can be lexical units or terms. In this paper, we apply terminological methods to improve the treatment of terms in general language dictionaries and to write definitions as a form of achieving more precision and accuracy, and also to specify the domains to which they belong. Additionally, we highlight the consistent modelling of lexicographic components, namely the hierarchy of domain labels, as they are term identification markers instead of a flat list of domains. The need to create and make available structured, organised and interoperable lexicographic resources has led us to follow a path in which the application of standards and best practices of treating and representing specialised lexicographic content are fundamental requirements.
Interlinking Lexicographic Data in the MORDigital Project
Publication . Khan, Fahad; Salgado, Ana; Costa, Rute; Carvalho, Sara; Romary, Laurent; Almeida, Bruno; Ramos, Margarida; Khemakhem , Mohamed; Silva, Raquel; Tasovac, Toma; Centro de Linguística da UNL (CLUNL); Departamento de Linguística (DL)

Unidades organizacionais

Descrição

Palavras-chave

Contribuidores

Financiadores

Entidade financiadora

Fundação para a Ciência e a Tecnologia

Programa de financiamento

3599-PPCDT

Número da atribuição

PTDC/LLT-LIN/6841/2020

ID