Collocations, multiword expressions and corpus-based discourse analysis
Cooccurrence patterns – such as collocations, multiword expressions, valency and distributional semantics – play a central role not only in corpus linguistics but also for studying public discourses and political propaganda. We improve and refine the underlying analytical techniques and develop new interactive methods for multi-modal corpus-based discourse analysis.
Research activities
- Exploring the “Fukushima Effect” (FAU Emerging Fields Initiative)
- Evaluation of distributional semantic models
Project funding
- Exploring the “Fukushima Effect”: Attitudes and opinions towards nuclear power and renewable energy and the emergence of a transnational algorithmic public sphere
(01/2017 – 12/2018)
Key publications
- Lapesa, Gabriella and Evert, Stefan (2014). A large scale evaluation of distributional semantic models: Parameters, interactions and model selection. Transactions of the Association for Computational Linguistics 2, 531–545.
☞ further information - Evert, Stefan; Uhrig, Peter; Bartsch, Sabine; Proisl, Thomas (2017). E-VIEW-alation – a large-scale evaluation study of association measures for collocation identification. In Electronic lexicography in the 21st century. Proceedings of the eLex 2017 conference, pages 531–549, Leiden, The Netherlands.
☞ interactive online viewer - Uhrig, Peter and Proisl, Thomas (2012). Less hay, more needles – using dependency-annotated corpora to provide lexicographers with more accurate lists of collocation candidates. Lexicographica 28(1), 141–180.
- Evert, Stefan (2008). Corpora and collocations. In A. Lüdeling and M. Kytö (eds.), Corpus Linguistics. An International Handbook, chapter 58, pages 1212–1248. Mouton de Gruyter, Berlin, New York.
Events
- CogALex-V Shared Task (2016) on the corpus-based identification of semantic relations
- Introductory course on Distributional Semantics (ESSLLI 2018, ESSLLI 2016, NAACL-HLT 2010, ESSLLI 2009)