Logo do repositório
 
Publicação

Designing the ELEXIS Parallel Sense-Annotated Dataset in 10 European Languages

dc.contributor.authorMartelli, Federico
dc.contributor.authorNavigli, Roberto
dc.contributor.authorKrek, Simon
dc.contributor.authorKallas, Jelena
dc.contributor.authorGantar, Polona
dc.contributor.authorKoeva, Svetla
dc.contributor.authorNimb, Sanni
dc.contributor.authorSandford Pedersen, Bolette
dc.contributor.authorOlsen, Sussi
dc.contributor.authorLangemets, Margit
dc.contributor.authorKoppel, Kristina
dc.contributor.authorÜksik, Tiiu
dc.contributor.authorDobrovoljc, Kaja
dc.contributor.authorUreña-Ruiz, Rafael-J.
dc.contributor.authorSancho-Sánchez, José-Luis
dc.contributor.authorLipp, Veronika
dc.contributor.authorVáradi, Tamás
dc.contributor.authorGyőrffy, András
dc.contributor.authorLászló, Simon
dc.contributor.authorQuochi, Valeria
dc.contributor.authorMonachini, Monica
dc.contributor.authorFrontini, Francesca
dc.contributor.authorTiberius, Carole
dc.contributor.authorTempelaars, Rob
dc.contributor.authorCosta, Rute
dc.contributor.authorSalgado, Ana
dc.contributor.authorČibej, Jaka
dc.contributor.authorMunda, Tina
dc.contributor.institutionDepartamento de Linguística (DL)
dc.contributor.institutionCentro de Linguística da UNL (CLUNL)
dc.contributor.pblLexical Computing CZ s.r.o.
dc.date.accessioned2026-01-14T09:00:31Z
dc.date.available2026-01-14T09:00:31Z
dc.date.issued2021
dc.descriptionUIDB/03213/2020 UIDP/03213/2020
dc.description.abstractOver the course of the last few years, lexicography has witnessed the burgeoning of increasingly reliable automatic approaches supporting the creation of lexicographic resources such as dictionaries, lexical knowledge bases and annotated datasets. In fact, recent achievements in the field of Natural Language Processing and particularly in Word Sense Disambiguation have widely demonstrated their effectiveness not only for the creation of lexicographic resources, but also for enabling a deeper analysis of lexical-semantic data both within and across languages. Nevertheless, we argue that the potential derived from the connections between the two fields is far from exhausted. In this work, we address a serious limitation affecting both lexicography and Word Sense Disambiguation, i.e. the lack of high-quality sense-annotated data and describe our efforts aimed at constructing a novel entirely manually annotated parallel dataset in 10 European languages. For the purposes of the present paper, we concentrate on the annotation of morpho-syntactic features. Finally, unlike many of the currently available sense-annotated datasets, we will annotate semantically by using senses derived from high-quality lexicographic repositories.en
dc.description.versionpublishersversion
dc.description.versionpublished
dc.format.extent18
dc.format.extent571425
dc.identifier.issn2533-5626
dc.identifier.otherPURE: 36001760
dc.identifier.otherPURE UUID: 16bf96f3-94c3-49b0-b38d-bc4b3fae286e
dc.identifier.otherScopus: 85137076090
dc.identifier.urihttp://hdl.handle.net/10362/198725
dc.identifier.urlhttps://www.scopus.com/pages/publications/85137076090
dc.identifier.urlhttps://elex.link/elex2021/proceedings-download/
dc.language.isoeng
dc.peerreviewedyes
dc.subjectDigital lexicography
dc.subjectNatural Language Processing
dc.subjectComputational Linguistics
dc.subjectCorpus Linguistics
dc.subjectWord Sense Disambiguation
dc.titleDesigning the ELEXIS Parallel Sense-Annotated Dataset in 10 European Languagesen
dc.typejournal article
degois.publication.firstPage377
degois.publication.issue2021
degois.publication.lastPage395
degois.publication.titleProceedings of Electronic Lexicography in the 21st Century Conference
dspace.entity.typePublication
rcaap.rightsopenAccess

Ficheiros

Principais
A mostrar 1 - 1 de 1
A carregar...
Miniatura
Nome:
eLex_2021_22_pp377_395.pdf
Tamanho:
558.03 KB
Formato:
Adobe Portable Document Format