Lexical databases

PolylexFLE is a database that primarily includes French verbal multiword expressions, partially annotated with CEFR levels (A1 to C2). The level annotation is carried out both manually, based on Beacco’s reference vocabularies, and automatically, taking into account parameters such as the frequencies of expressions in a CEFR-aligned corpus. This resource is based on the Lexique-Grammaire tables. The database was developed by Amalia Todirascu, Thomas François, and Marion Cargill.


FLELex

A lexicon for French as a Foreign Language that provides normalized frequencies of lemmas for each CEFR level. This resource (directed by Thomas François) is a result of collaboration between the CENTAL laboratory at l’UCLouvain, the LPL laboratory at Aix-Marseille University, and the EarlyTracks company.


EmoBase

EmoBase is a multilingual database developed within the Emolex project, which structures and compares the lexicon of emotions across five languages. Based on corpus analysis, the project provides tools for acquiring emotional collocations and analyzing multilingual corpora. It has resulted in over 80 publications and an international conference in 2013. The project was directed by the team from the LIDILEM laboratory in Grenoble, in partnership with the Universities of Cologne and Osnabrück.