CNRS Researcher (Chargé de recherche)
Laboratoire d'Informatique de Grenoble (LIG), Université Grenoble-Alpes
I am a CNRS researcher working in Getalp team (part of Laboratoire d’Informatique de Grenoble). Before that, I was a research scientist at Naver Labs Europe (2019) and a research associate at the University of Edinburgh (2018) in Shay Cohen’s research group. I obtained my PhD in 2017 at Paris Diderot University, which is now part of Université Paris Cité. Specifically, I worked in the Laboratoire de Linguistique Formelle (LLF), under the supervision of Benoît Crabbé and on the topic of natural language parsing.
My research focuses on natural language and speech processing. Some of my current research interests: syntactic/semantic parsing, structured prediction, end-to-end speech parsing, multitask learning.
Contact: first.last@univ-grenoble-alpes.fr
PhD supervision
Publications
Tools and data
News July 2023: I have been awarded an ANR grant to work on syntactic parsing of spoken French.
Available positions:
2024:
Should Cross-Lingual AMR Parsing go Meta? An Empirical Assessment of Meta-Learning and Joint Learning AMR Parsing.
Jeongwoo Kang, Maximin Coavoux, Cédric Lopez, Didier Schwab.
Findings of EMNLP 2024
[pdf] [bib] [preprint] [Jeongwoo’s code and data] [git]
Growing Trees on Sounds: Assessing Strategies for End-to-End Dependency Parsing of Speech.
Adrien Pupier, Maximin Coavoux, Jérôme Goulian, Benjamin Lecouteux
ACL 2024
[pdf] [bib] [preprint] [Adrien’s code]
(in French) Méta-apprentissage pour l’analyse AMR translingue.
Jeongwoo Kang, Maximin Coavoux, Cédric Lopez, Didier Schwab
TALN 2024
[pdf] [bib] [Jeongwoo’s code and data]
(in French) Une approche par graphe pour l’analyse syntaxique en dépendances de bout en bout de la parole.
Adrien Pupier, Maximin Coavoux, Benjamin Lecouteux, Jérôme Goulian
TALN 2024
[pdf] [bib]
What has LeBenchmark Learnt about French Syntax?
Zdravko Dugonjić, Adrien Pupier, Benjamin Lecouteux, Maximin Coavoux
LREC-COLING 2024
[pdf] [bib] [preprint]
Limitations of Human Identification of Automatically Generated Text
Nadège Alavoine, Maximin Coavoux, Emmanuelle Esperança-Rodier, Romane Gallienne, Carlos Gonzalez Gallardo, Jérôme Goulian, Jose G. Moreno, Aurélie Névéol, Didier Schwab, Vincent Segonne, Johanna Simoens
LREC-COLING 2024
[pdf] [bib]
Jargon: A Suite of Language Models and Evaluation Tasks for French Specialized Domains
Vincent Segonne, Aidan Mannion, Laura Alonzo-Canul, Alexandre Audibert, Xingyu Liu, Cécile Macaire, Adrien Pupier, Yongxin Zhou, Mathilde Aguiar, Felix Herron, Magali Norré, Massih-Reza Amini, Pierrette Bouillon, Iris Eshkol-Taravella, Emmanuelle Esperança-Rodier, Thomas François, Lorraine Goeuriot, Jérôme Goulian, Mathieu Lafourcade, Benjamin Lecouteux, François Portet, Fabien Ringeval, Vincent Vandeghinste, Maximin Coavoux, Marco Dinarelli and Didier Schwab
LREC-COLING 2024
[pdf] [bib]
Vlexique 2.0: A rich lexicon of French verbal inflection with form-level frequencies
Sacha Beniamine, Maximin Coavoux, Olivier Bonami
IMM 2024 International Morphology Meeting
[pdf] [bib] [data] [data visualisation] [morphological tagger]
LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for Self-supervised Representations of French Speech.
Titouan Parcollet, Ha Nguyen, Solène Evain, Marcely Zanon Boito, Adrien Pupier, Salima Mdhaffar, Hang Le, Sina Alisamir, Natalia Tomashenko, Marco Dinarelli, Shucong Zhang, Alexandre Allauzen, Maximin Coavoux, Yannick Esteve, Mickael Rouvier, Jérôme Goulian, Benjamin Lecouteux, Francois Portet, Solange Rossato, Fabien Ringeval, Didier Schwab, Laurent Besacier.
Computer Speech & Language
[pdf] [bib] [preprint] [models]
2023:
Pretrained Language Models v. Court Ruling Predictions: A Case Study on a Small Dataset of French Court of Appeal Rulings.
Olivia Vaudaux, Caroline Bazzoli, Maximin Coavoux, Géraldine Vial, Étienne Vergès.
NLLP 2023
[pdf] [bib]
Neural correlates of object-extracted relative clause processing across English and Chinese.
Donald Dunagan, Miloš Stanojević, Maximin Coavoux, Shulin Zhang, Shohini Bhattasali, Jixing Li, Jonathan Brennan, John Hale.
Neurobiology of Language. 2023.
[link]
(In French) Analyse sémantique AMR pour le français par transfert translingue.
Jeongwoo Kang, Maximin Coavoux, Cédric Lopez, Didier Schwab.
TALN 2023.
[pdf] [bib] [Jeongwoo’s code]
On Detecting Policy-Related Political Ads: An Exploratory Analysis of Meta Ads in 2022 French Election.
Vera Sosnovik, Romaissa Kessi, Maximin Coavoux, Oana Goga.
TheWebConf2023.
[pdf] [bib] [preprint]
BERT Is Not The Count: Learning to Match Mathematical Statements with Proofs.
Weixian Li, Yftah Ziser, Maximin Coavoux and Shay B. Cohen.
EACL 2023.
[pdf] [bib] [preprint] [old preprint version]
2022:
(In French) Apprentissage profond pour l’estimation du quotient ouvert à partir du signal électroglottographique.
Minh-Châu Nguyên, Maximin Coavoux, Solange Rossato.
LIFT 2022.
[pdf] [bib]
(In French) Extraction de Phrases Préfabriquées des Interactions à partir d’un corpus arboré du français parlé : une étude exploratoire.
Marie-Sophie Pausé, Agnès Tutin, Olivier Kraif, Maximin Coavoux.
CMLF 2022.
[pdf] [bib]
End-to-End Dependency Parsing of Spoken French.
Adrien Pupier, Maximin Coavoux, Benjamin Lecouteux, Jérôme Goulian.
Interspeech 2022.
[pdf] [bib] [Adrien’s code]
2021:
BERT-Proof Syntactic Structures: Investigating Errors in Discontinuous Constituency Parsing.
Maximin Coavoux.
Findings of ACL 2021.
[pdf] [bib] [test suite] [parser’s code]
(In French) Contribution d’informations syntaxiques aux capacités de généralisation compositionelle des modèles seq2seq convolutifs.
Diana Nicoleta Popa, William N. Havard, Maximin Coavoux, Eric Gaussier and Laurent Besacier.
TALN 2021 (short).
[pdf] [bib]
Self-Supervised and Controlled Multi-Document Opinion Summarization.
Hady Elsahar, Maximin Coavoux, Matthias Gallé, Jos Rozen.
EACL 2021 (long).
[pdf] [bib] [preprint]
2020:
(In French) FlauBERT : des modèles de langue contextualisés pré-entraînés pour le français.
Hang Le, Loïc Vial, Jibril Frej, Vincent Segonne, Maximin Coavoux, Benjamin Lecouteux, Alexandre Allauzen, Benoît Crabbé, Laurent Besacier, Didier Schwab.
TALN 2020 (short).
[pdf] [bib]
(In French) Qu’apporte BERT à l’analyse syntaxique en constituants discontinus ? Une suite de tests pour évaluer les prédictions de structures syntaxiques discontinues en anglais.
Maximin Coavoux
TALN 2020 (short).
[pdf] [bib]
FlauBERT: Unsupervised Language Model Pre-training for French.
Hang Le, Loïc Vial, Jibril Frej, Vincent Segonne, Maximin Coavoux, Benjamin Lecouteux, Alexandre Allauzen, Benoît Crabbé, Laurent Besacier, Didier Schwab.
LREC 2020.
[pdf][bib] [pre-print] [code for FlauBERT] [code for constituency parsing experiments]
2019:
Unsupervised Aspect-Based Multi-Document Abstractive Summarization.
Maximin Coavoux, Hady Elsahar, Matthias Gallé.
NewSum 2019 workshop (short).
[pdf] [bib]
Discontinuous Constituency Parsing with a Stack-free Transition System and a Dynamic Oracle.
Maximin Coavoux, Shay B. Cohen.
NAACL 2019 (long).
[pdf] [bib] [code] [slides]
Unlexicalized Transition-based Discontinuous Constituency Parsing.
Maximin Coavoux, Benoît Crabbé, Shay B. Cohen.
TACL 2019.
[pdf] [html] [bib] [code]
2018:
2017:
2016:
2015:
Thesis: