Joint Workshop on Multiword Expressions and WordNet (MWE-WN 2019)
Workshop at ACL 2019 (Florence, Italy), August 2nd, 2019
Organized and sponsored by the Special Interest Group on the Lexicon (SIGLEX) of the Association for Computational Linguistics (ACL). Endorsed by the Global Wordnet Association (GWA). This joint event is the 15th edition of the Workshop on Multiword Expressions (MWE)
Last updated: August 21, 2019
- [August 19]: The MWE-WN proceedings are available on the ACL Anthology.
- [June 17]: The workshop program is online.
- [May 24]: The list of accepted papers has been published.
- [Now 29]: The workshop proposal has been accepted and will be co-located with ACL 2019 in Florence.
Description
As a joint event, this workshop proposal addresses two domains – multiword expressions and WordNet – with partly overlapping communities and research interests, but relatively divergent practices and terminologies.
Multiword expressions (MWEs) are word combinations, such as all of a sudden, a hot dog, to pay a visit or to pull one's leg, which exhibit lexical, syntactic, semantic, pragmatic and/or statistical idiosyncrasies. MWEs encompass closely related linguistic objects such as idioms, compounds, light verb constructions, rhetorical figures, institutionalised phrases or collocations. Modelling and computational aspects of MWEs have been covered by the Multiword Expression Workshop, organised over the past years by the MWE section of SIGLEX. Because of their unpredictable behavior, and most prominently their non-compositional semantics, MWEs pose special problems in linguistic modelling (e.g. treebank annotation and grammar engineering), in NLP pipelines (e.g. when their orchestration with parsing is concerned), and in end-use applications (e.g. information extraction or machine translation).
From its very beginning, WordNet has included MWEs, and linked their meanings into a shared network: talk, blab, sing, spill the beans, let the cat out of the bag, tattle, peach, babble, babble out, blab out “divulge confidential information or secrets”. Indeed, over 50% of entries in the Princeton WordNet of English are MWEs and most other projects have a similarly high percentage. However, MWEs are generally encoded as a string, with no internal information about syntactic structure or compositionality. Many suggestions for richer encodings have been made but not yet widely adopted, partly because of the cost of adding richer data to already large lexicons.
For the above reasons, the MWE and WN communities propose to put forward a joint event, which should allow better convergences and scientific innovation. We will call for papers focusing on research related (but not limited) to the following topics.
Joint topics on MWEs and Wordnets
- Encoding MWEs in wordnets - how can we take advantage of the existing rich structure of wordnets?
- Encoding MWEs in wordnets - consequences for a lexical-semantic organization of MWEs
- Linking wordnets with existing MWE lexicons
- Word sense disambiguation for single-word and multiword expressions
- Cross-wordnet and cross-language comparisons of MWEs
- MWEs in sense-annotated corpora
- Semantic relations in wordnets related to MWEs
MWE-specific topics
- Computationally-applicable theoretical studies on MWEs and constructions in psycholinguistics, corpus linguistics and formal grammars
- MWE and construction annotation in corpora and treebanks
- MWE and construction representation in manually/automatically constructed lexical resources
- Processing of MWEs and constructions in syntactic and semantic frameworks (e.g. CCG, CxG, HPSG, LFG, TAG, UD, etc.), and in end-user applications (e.g. information extraction, machine translation and summarization)
- Original discovery and identification methods for MWEs and constructions
- MWEs and constructions in language acquisition and in non-standard language (e.g. tweets, forums, spontaneous speech)
- Evaluation of annotation and processing techniques for MWEs and constructions
- Retrospective comparative analyses from the PARSEME shared tasks on automatic identification of MWEs
Note that, with the intention to also perpetuate previous converging effects with the Construction Grammar community (see the LAW-MWE-CxG 2018 workshop), we extend the traditional MWE scope to grammatical constructions.
Links to the PARSEME Shared Task on Automatic Verbal MWE Identification
The two previous editions of the MWE workshop (in 2017 and in 2018) featured the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions. There is no edition planned for 2019, however we wish the MWE-WN 2019 workshop to play an important role in preparing a future edition (probably in 2020). Namely, we call for papers including retrospective and/or comparatives analyses of the results of previous shared task editions, as well as recommendations for corpus annotation enhancements.
Submission modalities
There are two tracks:
- Regular research track, where the submissions must be substantially original.
- Dissemination track, which welcomes recent previously published work (or work accepted for publication), dedicated explicitly both to MWEs and WordNet. This will be presented to encourage discussion, but only the abstract will appear in the proceedings.
The regular research track submissions should follow one of the 2 formats:
- Long papers (8 content pages + references): Long papers should report on solid and finished research including new experimental results, resources and/or techniques.
- Short papers (4 content pages + references): Short papers should report on small experiments, focused contributions, ongoing research, negative results and/or philosophical discussion.
The decisions as to oral or poster presentations of the selected papers will be taken by the PC chairs. No distinction between papers presented orally and as posters is made in the workshop proceedings. There is no limit on the number of reference pages. Authors will be granted an extra page for the final version of their papers. The submission is double-blind, as understood by the ACL 2019 submission policy. The reported research should be substantially original. Papers available as preprints can also be submitted provided that they fulfil the conditions defined by the ACL Policies for Submission, Review and Citation. For both types of submissions in this track, the ACL 2019 templates should be used.
The dissemination track submissions are not anonymous, and they should not exceed one page, including the authors' names and affiliations, the mention of the original venue, the link to the original paper and a short explanation why the paper is relevant to MWEs and Wordnets workshop. If the original paper is not publicly available, it should also be submitted in a separate .pdf file but it does not have to follow the ACL 2019 template.
All papers should be submitted via the following START space.
The MWE-WN Workshop follows the ACL 2019 multiple submission policy.
Please choose the appropriate track (research/dissemination) and for research papers the submission modality (long/short).
Important dates
We follow the ACL 2019 workshop schedule:
All deadlines are at 23:59 UTC-12 (anywhere in the world).
May 1, 2019 | Paper Submission due (Deadline extended) |
May 24, 2019 | Notification of Acceptance |
June 3, 2019 | Camera-ready papers due |
August 2, 2019 | MWE-WN 2019 Workshop |
Workshop Organizers and Program Committee Chairs
- Verginica Barbu Mititelu, Romanian Academy Research Institute for Artificial Intelligence (Romania)
- Francis Bond, Nanyang Technological University (Singapore)
- Jelena Mitrović, University of Passau (Germany)
- Carla Parra Escartín, Unbabel, Lisbon (Portugal)
- Agata Savary, University of Tours (France)
Contact
For any inquiries regarding the workshop please send an email to mwewn2019@gmail.com
Anti-harassment policy
The workshop supports the ACL anti-harassment policy.