Workshop on Multiword Expressions & Electronic Lexicons at COLING2020 (POSTPONED to DECEMBER 2020)
Date |
---|
NEW DATES: 8. – 13.12.2020 – Main Conference |
NEW DATES: 12. – 13.12.2020 – Workshop and Tutorials |
NEW DATE: 13.12.2020 – Joint Workshop on Multiword Expressions and Electronic Lexicons (MWE-LEX 2020) |
Location |
---|
virtual |
Joint Workshop on Multiword Expressions and Electronic Lexicons (MWE-LEX 2020)
The joint MWE-LEX workshop addresses two domains – multiword expressions and (electronic) lexicons – with partly overlapping communities and research interests, but divergent practices and terminologies.
Multiword expressions (MWEs) are word combinations, such as by and large, hot dog, pay a visit or pull one’s leg, which exhibit lexical, syntactic, semantic, pragmatic or statistical idiosyncrasies. MWEs encompass closely related linguistic objects: idioms, compounds, light-verb constructions, rhetorical figures, institutionalised phrases and collocations. Because of their unpredictable behavior, notably their non-compositional semantics, MWEs pose problems in linguistic modelling (e.g. treebank annotation, grammar engineering), NLP pipelines (notably when orchestrated with parsing), and end-user applications (e.g. information extraction). Modelling and processing of MWEs has been the topic of the MWE workshop, organised over the past years by the MWE section of SIGLEX.
Because MWE-hood is a largely lexical phenomenon, appropriately built electronic MWE lexicons turn out to be quite important for NLP. Their conception opens up, among others, the issues of lemmatization and of standardised representation of morphological, syntactic and semantic properties of MWEs. Large standardised multilingual, possibly interconnected, NLP-oriented MWE lexicons prove indispensable for NLP tasks such as MWE identification, due to its critical sensitivity to unseen data. But the development of such lexicons is challenging and calls for tools which would leverage, on the one hand, MWEs encoded in pre-existing NLP-unaware lexicons and, on the other hand, automatic MWE discovery in large non-annotated corpora.
Special Track: PARSEME Shared Task on Semi-Supervised verbal MWE Identification
MWE-LEX 2020 will host edition 1.2 of the PARSEME shared task on semi-supervised identification of verbal MWEs. This is a follow-up of editions 1.0 (2017), and 1.1 (2018). Edition 1.2 will feature
(a) improved and extended corpora annotated with MWEs,
(b) complementary unannotated corpora for unsupervised MWE discovery, and
(c) new evaluation metrics focusing on unseen MWEs.
The aim is to foster the development of unsupervised methods for MWE lexicon induction, which in turn can be used for identification. Authors may submit system description papers to a special track. Details are available on the shared task page.
For the 2020 edition of the PARSEME shared task, 7 teams submitted 9 system results to the shared task: 2 to the closed track and 7 to the open track.
Out of these results, 4 cover all 14 languages for which data were made available by the organizers: