Pedro Ortiz Suarez
Pedro Ortiz Suarez
Accueil
Publications
Présentations
Projets
Contactez moi
CV
Clair
Sombre
Automatique
Français
Français
Deutsch
English
Español
7
A Data-driven Approach to Natural Language Processing for Contemporary and Historical French
We determine that the importance of the pre-training dataset size was largely overestimated, as we are able to repeatedly show that language models can be pre-trained with corpora of a modest size.
Pedro Ortiz Suarez
PDF
Citation
Theses
TEL
Citation
×