Pedro Ortiz Suarez
Pedro Ortiz Suarez
Accueil
Publications
Présentations
Projets
Contactez moi
CV
Clair
Sombre
Automatique
Français
Français
Deutsch
English
Español
Benoît Sagot
Récents
Le projet FREEM : ressources, outils et enjeux pour l’étude du français d’Ancien Régime
BERTrade: Using Contextual Embeddings to Parse Old French
From FreEM to D'AlemBERT: a Large Corpus and a Language Model for Early Modern French
Towards a Cleaner Document-Oriented Multilingual Crawled Corpus
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Ungoliant: An Optimized Pipeline for the Generation of a Very Large-Scale Multilingual Web Corpus
A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages
A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages
CamemBERT: a Tasty French Language Model
Building a User-Generated Content North-African Arabizi Treebank: Tackling Hell
Les modèles de langue contextuels Camembert pour le Français : impact de la taille et de l'hétérogénéité des données d'entrainement
Establishing a New State-of-the-Art for French Named Entity Recognition
French Contextualized Word-Embeddings with a sip of CaBeRnet: a New French Balanced Reference Corpus
Asynchronous Pipeline for Processing Huge Corpora on Medium to Low Resource Infrastructures
Asynchronous Pipeline for Processing Huge Corpora on Medium to Low Resource Infrastructures
Preparing the Dictionnaire Universel for Automatic Enrichment
Citation
×