Olga Majewska

I am a Research Associate in the Language Technology Lab (LTL) at the University of Cambridge. I am currently working on the MultiConvAI project with Anna Korhonen and Ivan Vulić, focusing on extending conversational AI to diverse, under-resourced languages.

Contact

Email: om304 [at] cam.ac.uk

Publications

BioVerbNet: a large semantic-syntactic classification of verbs in biomedicine
Olga Majewska, Charlotte Collins, Simon Baker, Jari Björne, Susan Windisch Brown, Anna Korhonen and Martha Palmer. Journal of Biomedical Semantics 12, 12. 2021.
[pdf][data]

Verb Knowledge Injection for Multilingual Event Processing
Olga Majewska, Ivan Vulić, Goran Glavaš, Edoardo M. Ponti, and Anna Korhonen. Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021). 2021.
[pdf]

Crossing the Conversational Chasm: A Primer on Multilingual Task-Oriented Dialogue Systems
Evgeniia Razumovskaia, Goran Glavaš, Olga Majewska, Anna Korhonen, and Ivan Vulić. 2021. arXiv preprint arXiv:2104.08570.
[pdf][github]

Semantic Dataset Construction from Human Clustering and Spatial Arrangement
Olga Majewska, Diana McCarthy, Jasper van den Bosch, Nikolaus Kriegeskorte, Ivan Vulić, and Anna Korhonen. Computational Linguistics 2021.
[pdf]

Manual Clustering and Spatial Arrangement of Verbs for Multilingual Evaluation and Typology Analysis
Olga Majewska, Ivan Vulić, Diana McCarthy, and Anna Korhonen. Proceedings of the 28th International Conference on Computational Linguistics (COLING) 2020.
[pdf][data]

Common Sense or World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained Transformers
Anne Lauscher, Olga Majewska, Leonardo FR Ribeiro, Iryna Gurevych, Nikolai Rozanov, Goran Glavaš. Proceedings of Deep Learning Inside Out (DeeLIO): The First Workshop on Knowledge Extraction and Integration for Deep Learning Architectures (collocated with EMNLP) 2020.
[pdf]

XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning
Edoardo M. Ponti, Goran Glavaš, Olga Majewska, Qianchu Liu, Ivan Vulić, and Anna Korhonen. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP) 2020.
[pdf][data]

Spatial Multi-Arrangement for Clustering and Multi-way Similarity Dataset Construction
Olga Majewska, Diana McCarthy, Jasper van den Bosch, Nikolaus Kriegeskorte, Ivan Vulić, and Anna Korhonen. Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC) 2020.
[pdf][data]

Multi-SimLex: A Large-Scale Evaluation of Multilingual and Cross-Lingual Lexical Semantic Similarity
Ivan Vulić, Simon Baker, Edoardo Maria Ponti, Ulla Petti, Ira Leviant, Kelly Wing, Olga Majewska, Eden Bar, Matt Malone, Thierry Poibeau, Roi Reichart, Anna Korhonen. arXiv e-prints March 2020.
[pdf][data]

A Neural Classification Method for Supporting the Creation of BioVerbNet
Billy Chiu, Olga Majewska*, Sampo Pyysalo, Laura Wey, Ulla Stenius, Anna Korhonen, and Martha Palmer. Journal of Biomedical Semantics 2019.
*co-first author
[pdf]

Acquiring Verb Classes Through Bottom-Up Semantic Verb Clustering
Olga Majewska, Diana McCarthy, Ivan Vulić, and Anna Korhonen. Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC) 2018.
[pdf] [data]

Investigating the cross-lingual translatability of VerbNet-style classification
Olga Majewska, Ivan Vulić, Diana McCarthy, Yan Huang, Akira Murakami, Veronika Laippala, and Anna Korhonen. Language Resources and Evaluation 2017. DOI 10.1007/s10579-017-9403-x
[pdf]

Education

PhD in Computational Linguistics, Language Technology Lab (LTL), University of Cambridge (2016-2021)
MPhil in Theoretical and Applied Linguistics, University of Cambridge (2015-2016)
BA in Modern Languages (Italian) and Linguistics, University of Oxford (2011-2015)