Cruz, FranciscoCastelli, Mauro2023-11-022023-11-022023-10-012610-9182PURE: 63539763PURE UUID: c547c8b0-44cb-4f1b-9f67-c94a0d4f076cScopus: 85174944049ORCID: /0000-0002-8793-1451/work/151388751http://hdl.handle.net/10362/159492Cruz, F., & Castelli, M. (2023). Learning Curves Prediction for a Transformers-based Model. Emerging Science Journal, 7(5), 1491-1500. https://doi.org/10.28991/ESJ-2023-07-05-03One of the main challenges when training or fine-tuning a machine learning model concerns the number of observations necessary to achieve satisfactory performance. While, in general, more training observations result in a better-performing model, collecting more data can be time-consuming, expensive, or even impossible. For this reason, investigating the relationship between the dataset's size and the performance of a machine learning model is fundamental to deciding, with a certain likelihood, the minimum number of observations that are necessary to ensure a satisfactory-performing model is obtained as a result of the training process. The learning curve represents the relationship between the dataset’s size and the performance of the model and is especially useful when choosing a model for a specific task or planning the annotation work of a dataset. Thus, the purpose of this paper is to find the functions that best fit the learning curves of a Transformers-based model (LayoutLM) when fine-tuned to extract information from invoices. Two new datasets of invoices are made available for such a task. Combined with a third dataset already available online, 22 sub-datasets are defined, and their learning curves are plotted based on cross-validation results. The functions are fit using a non-linear least squares technique. The results show that both a biasymptotic and a Morgan-Mercer-Flodin function fit the learning curves extremely well. Also, an empirical relation is presented to predict the learning curve from a single parameter that may be easily obtained in the early stage of the annotation process.10822335engDataset SizeDocument Data ExtractionFine-TuningLearning CurvesTransformersGeneralLearning Curves Prediction for a Transformers-based Modeljournal article10.28991/ESJ-2023-07-05-03https://www.scopus.com/pages/publications/85174944049https://zenodo.org/doi/10.5281/zenodo.6371709