Joint Workshop on Multiword Expressions and WordNet (MWE-WN 2019)

Workshop at ACL 2019 (Florence, Italy), August 2nd, 2019

Organized and sponsored by the Special Interest Group on the Lexicon (SIGLEX) of the Association for Computational Linguistics (ACL). Endorsed by the Global Wordnet Association (GWA). This joint event is the 15th edition of the Workshop on Multiword Expressions (MWE)

Last updated: August 21, 2019

NEWS:

[August 19]: The MWE-WN proceedings are available on the ACL Anthology.
[June 17]: The workshop program is online.
[May 24]: The list of accepted papers has been published.
[Now 29]: The workshop proposal has been accepted and will be co-located with ACL 2019 in Florence.

Description

As a joint event, this workshop proposal addresses two domains – multiword expressions and WordNet – with partly overlapping communities and research interests, but relatively divergent practices and terminologies.

Multiword expressions (MWEs) are word combinations, such as all of a sudden, a hot dog, to pay a visit or to pull one's leg, which exhibit lexical, syntactic, semantic, pragmatic and/or statistical idiosyncrasies. MWEs encompass closely related linguistic objects such as idioms, compounds, light verb constructions, rhetorical figures, institutionalised phrases or collocations. Modelling and computational aspects of MWEs have been covered by the Multiword Expression Workshop, organised over the past years by the MWE section of SIGLEX. Because of their unpredictable behavior, and most prominently their non-compositional semantics, MWEs pose special problems in linguistic modelling (e.g. treebank annotation and grammar engineering), in NLP pipelines (e.g. when their orchestration with parsing is concerned), and in end-use applications (e.g. information extraction or machine translation).

From its very beginning, WordNet has included MWEs, and linked their meanings into a shared network: talk, blab, sing, spill the beans, let the cat out of the bag, tattle, peach, babble, babble out, blab out “divulge confidential information or secrets”. Indeed, over 50% of entries in the Princeton WordNet of English are MWEs and most other projects have a similarly high percentage. However, MWEs are generally encoded as a string, with no internal information about syntactic structure or compositionality. Many suggestions for richer encodings have been made but not yet widely adopted, partly because of the cost of adding richer data to already large lexicons.

For the above reasons, the MWE and WN communities propose to put forward a joint event, which should allow better convergences and scientific innovation. We will call for papers focusing on research related (but not limited) to the following topics.

Joint topics on MWEs and Wordnets

Encoding MWEs in wordnets - how can we take advantage of the existing rich structure of wordnets?
Encoding MWEs in wordnets - consequences for a lexical-semantic organization of MWEs
Linking wordnets with existing MWE lexicons
Word sense disambiguation for single-word and multiword expressions
Cross-wordnet and cross-language comparisons of MWEs
MWEs in sense-annotated corpora
Semantic relations in wordnets related to MWEs

MWE-specific topics

Computationally-applicable theoretical studies on MWEs and constructions in psycholinguistics, corpus linguistics and formal grammars
MWE and construction annotation in corpora and treebanks
MWE and construction representation in manually/automatically constructed lexical resources
Processing of MWEs and constructions in syntactic and semantic frameworks (e.g. CCG, CxG, HPSG, LFG, TAG, UD, etc.), and in end-user applications (e.g. information extraction, machine translation and summarization)
Original discovery and identification methods for MWEs and constructions
MWEs and constructions in language acquisition and in non-standard language (e.g. tweets, forums, spontaneous speech)
Evaluation of annotation and processing techniques for MWEs and constructions
Retrospective comparative analyses from the PARSEME shared tasks on automatic identification of MWEs

Note that, with the intention to also perpetuate previous converging effects with the Construction Grammar community (see the LAW-MWE-CxG 2018 workshop), we extend the traditional MWE scope to grammatical constructions.

Links to the PARSEME Shared Task on Automatic Verbal MWE Identification

The two previous editions of the MWE workshop (in 2017 and in 2018) featured the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions. There is no edition planned for 2019, however we wish the MWE-WN 2019 workshop to play an important role in preparing a future edition (probably in 2020). Namely, we call for papers including retrospective and/or comparatives analyses of the results of previous shared task editions, as well as recommendations for corpus annotation enhancements.

Submission modalities

There are two tracks:

Regular research track, where the submissions must be substantially original.
Dissemination track, which welcomes recent previously published work (or work accepted for publication), dedicated explicitly both to MWEs and WordNet. This will be presented to encourage discussion, but only the abstract will appear in the proceedings.

The regular research track submissions should follow one of the 2 formats:

Long papers (8 content pages + references): Long papers should report on solid and finished research including new experimental results, resources and/or techniques.
Short papers (4 content pages + references): Short papers should report on small experiments, focused contributions, ongoing research, negative results and/or philosophical discussion.

The decisions as to oral or poster presentations of the selected papers will be taken by the PC chairs. No distinction between papers presented orally and as posters is made in the workshop proceedings. There is no limit on the number of reference pages. Authors will be granted an extra page for the final version of their papers. The submission is double-blind, as understood by the ACL 2019 submission policy. The reported research should be substantially original. Papers available as preprints can also be submitted provided that they fulfil the conditions defined by the ACL Policies for Submission, Review and Citation. For both types of submissions in this track, the ACL 2019 templates should be used.

The dissemination track submissions are not anonymous, and they should not exceed one page, including the authors' names and affiliations, the mention of the original venue, the link to the original paper and a short explanation why the paper is relevant to MWEs and Wordnets workshop. If the original paper is not publicly available, it should also be submitted in a separate .pdf file but it does not have to follow the ACL 2019 template.

All papers should be submitted via the following START space.

The MWE-WN Workshop follows the ACL 2019 multiple submission policy.

Please choose the appropriate track (research/dissemination) and for research papers the submission modality (long/short).

Important dates

We follow the ACL 2019 workshop schedule:

All deadlines are at 23:59 UTC-12 (anywhere in the world).

May 1, 2019	Paper Submission due (Deadline extended)
May 24, 2019	Notification of Acceptance
June 3, 2019	Camera-ready papers due
August 2, 2019	MWE-WN 2019 Workshop

Workshop Organizers and Program Committee Chairs

Verginica Barbu Mititelu, Romanian Academy Research Institute for Artificial Intelligence (Romania)
Francis Bond, Nanyang Technological University (Singapore)
Jelena Mitrović, University of Passau (Germany)
Carla Parra Escartín, Unbabel, Lisbon (Portugal)
Agata Savary, University of Tours (France)

Contact

For any inquiries regarding the workshop please send an email to mwewn2019@gmail.com

Anti-harassment policy

The workshop supports the ACL anti-harassment policy.

SIGLEX-MWE (archive) Workshops: MWE-WN 2019 (ACL)