Papers
All presentation videos that I made can also be found on Youtube and BiliBili.
2024
- Miriam Winkler, Virginija Juozapaityte, Rob van der Goot and Barbara Plank. Slot and Intent Detection Resources for Bavarian and Lithuanian: Assessing Translations vs Natural Queries to Digital Assistants. in LREC-COLING [paper | poster | bib]
- Elisa Bassignana, Viggo Unmack Gascou, Frida Nøhr Laustsen, Gustav Kristensen, Marie Haahr Petersen, Rob van der Goot and Barbara Plank. How to Encode Domain Information in Relation Classification. in LREC-COLING [paper | video | poster | bib]
- Maria Barrett, Max Müller-Eberstein, Elisa Bassignana, Amalie Brogaard Pauli, Mike Zhang and Rob van der Goot. Can Humans Identify Domains? in LREC-COLING [paper | video | poster | bib]
- Rob van der Goot, Zoey Liu and Max Müller-Eberstein. Enough Is Enough! a Case Study on the Effect of Data Size for Evaluation Using Universal Dependencies. in LREC-COLING [paper | poster | video\ | bib]
- Rob van der Goot. Where are we Still Split on Tokenization? in EACL findings [paper | code | poster | video | bib]
- Axel Sorensen, Siyao Peng, Barbara Plank, and Rob Van Der Goot. EEVEE: An Easy Annotation Tool for Natural Language Processing. in LAW workshop [paper | demo | code | video | poster | bib]
- Mike Zhang, Rob van der Goot, Min-Yen Kan, and Barbara Plank. NNOSE: Nearest Neighbor Occupational Skill Extraction. in EACL [paper | video | code | bib]
- Mike Zhang, Rob van der Goot, and Barbara Plank. Entity Linking in the Job Market Domain. in EACL findings [paper | video | code | bib]
- Elena Senger, Mike Zhang, Rob van der Goot, and Barbara Plank. Deep Learning-based Computational Job Market Analysis: A Survey on Skill Extraction and Classification from Job Postings. In NLP4HR [paper | video | bib]
- Charlie Campanella and Rob van der Goot. Big City Bias: Evaluating the Impact of Metropolitan Size on Computational Job Market Abilities of Language Models. In NLP4HR [paper | code | bib]
2023
- Max Müller-Eberstein, Rob van der Goot, Barbara Plank, and Ivan Titov. Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training. In EMNLP Findings. [paper | video | code\ | bib]
- Robert Litschko, Max Müller-Eberstein, Rob van der Goot, Leon Weber, Barbara Plank. Establishing Trustworthiness: Rethinking Tasks and Model Evaluation. In EMNLP. [paper | video | bib]
- Elisa Bassignana, Filip Ginter, Sampo Pyysalo, Rob van der Goot, and Barbara Plank. Silver Syntax Pre-training for Cross-Domain Relation Extraction. In ACL Findings. [paper | video | poster | code | bib]
- Mike Zhang, Rob van der Goot, and Barbara Plank. ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain. In ACL. [paper | video | code | bib]
- Lina Skerath, Paulina Toborek, Anita Zielińska, Maria Barrett, and Rob van der Goot. Native Language Prediction from Gaze: a Reproducibility Study In ACL SRW. [paper | poster | code | bib]
- Rob van der Goot. MaChAmp at SemEval-2023 Tasks 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, and 12: On the Effectiveness of Intermediate Training on an Uncurated Collection of Datasets. In SemEval
[paper | video | poster | code | tex | bib] - Noëmi Aepli, Çagrı Çöltekin, Rob van der Goot, Tommi Jauhiainen, Mourhaf Kazzaz, Nikola Ljubešic, Kai North, Barbara Plank, Yves Scherrer, and Marcos Zampieri. Findings of the VarDial Evaluation Campaign 2023 In VarDial
[paper | video | bib | website] - Kia Kirstein Hansen and Rob van der Goot. Cross-Domain Evaluation of POS Taggers: From Wall Street Journal to Fandom Wiki.
[paper | data ] - Kia Kirstein Hansen, Maria Barrett, Max Müller-Eberstein, Cathrine Damgaard, Trine Naja Eriksen, and Rob van der Goot. DanTok: Domain Beats Language for Danish Social Media POS Tagging. In NoDaLiDa
[paper | poster | code | bib] - Elisa Bassignana, Filip Ginter, Sampo Pyysalo, Rob van der Goot, and Barbara Plank. Multi-CrossRE A Multi-lingual Multi-Domain Dataset for Relation Extraction. In NoDaLiDa
[paper | poster | data | bib ]
2022
- Max Müller-Eberstein, Rob van der Goot, and Barbara Plank. Spectral Probing. in EMNLP
[paper| video | poster | code | bib] - Dennis Ulmer, Elisa Bassignana, Max Müller-Eberstein, Daniel Varab, Mike Zhang, Rob van der Goot, Christian Hardmeier, and Barbara Plank. Experimental Standards for Deep Learning in Natural Language Processing Research. In Findings of EMNLP
[paper | video | code | bib] - Tanja Samardžić, Ximena Gutierrez-Vasques, Rob van der Goot, Max Müller-Eberstein, and Olga Pelloni. On Language Spaces, Scales and Cross-Lingual Transfer of UD Parsers. In CoNLL
[paper | video | code | bib] - Marcus Vielsted, Nikolaj Wallenius and Rob van der Goot. Increasing Robustness for Cross-domain Dialogue Act Classification on Social Media Data. In WNUT.
[paper | video | slides | poster | code & data | tex | bib] 🏆Best paper award🏆 - Sajawel Ahmed, Rob van der Goot, Misbahur Rehman, Carl Kruse, Ömer Özsoy, Alexander Mehler, and Gemma Roig. Tafsir Dataset: A Novel Multi-Task Benchmark for Named Entity Recognition and Topic Modeling in Classical Arabic Literature. In COLING.
[paper | bib ] - Rob van der Goot MaChAmp at SemEval-2022 tasks 2, 3, 4, 6, 10, 11, and 12: Multi-task Multi-lingual Learning for a Pre-selected Set of Semantic Datasets. In SemEval.
[paper | video | poster | slides | code | bib | tex] - Max Müller-Eberstein, Rob van der Goot and Barbara Plank Sort by Structure: Language Model Ranking as Dependency Probing. In NAACL.
[paper | poster | video | bib | code] - Rob van der Goot, Max Müller-Eberstein and Barbara Plank Frustratingly Easy Performance Improvements for Low-resource Setups: A Tale on BERT and Segment Embeddings. In LREC.
[paper | poster | video | code | bib | tex] - Max Müller-Eberstein, Rob van der Goot and Barbara Plank Probing for Labeled Dependency Trees. In ACL.
[paper | video | bib | code]
2021
- Rob van der Goot and Miryam de Lhoneux. 2021. Parsing with Pretrained Language Models, Multiple Datasets, and Dataset Embeddings. In Treebanks and Linguistic Theories.
[paper | poster | code | bib | tex] - Max Müller-Eberstein, Rob van der Goot and Barbara Plank How Universal is Genre in Universal Dependencies? In Treebanks and Linguistic Theories.
[paper | code | bib ] - Rob van der Goot. 2021. CL-MoNoise: Cross-lingual Lexical Normalization. In WNUT.
[paper | addendum | poster | code | bib | tex] - Rob van der Goot, Barbara Plank, Alan Ramponi, Tommaso Caselli, Nikola Ljubešić, Timothy Baldwin, Özlem Çetinoglu, Benjamin Muller, Talha Çolakoğlu, Arkaitz Zubiaga, Iñaki San Vicente Roncal, Wladimir Sidorenko and Rahmad Mahendra. 2021. MultiLexNorm: A Shared Task on Multilingual Lexical Normalization. In WNUT.
[paper | slides | data | bib | tex] - Rob van der Goot. 2021. We Need To Talk About train-dev-test Splits. In EMNLP.
[paper | video | addendum | slides | poster | code | bib | tex] - Maximilian Müller-Eberstein, Rob van der Goot and Barbara Plank. 2021. Genre as Weak Supervision for Cross-lingual Dependency Parsing. In EMNLP.
paper | video | slides | poster | code | bib | tex] - Dana-Maria Iliescu, Rasmus Grand, Sara Qirko and Rob van der Goot. 2021. Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get? In CALCS.
[paper | slides | video | teaser | code | bib | tex] - Rob van der Goot, Ibrahim Sharaf, Aizhan Imankulova, Ahmet Üstün, Marija Stepanović, Alan Ramponi, Siti Oryza Khairunnisa, Mamoru Komachi and Barbara Plank. 2021. From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding. In NAACL.
[paper | poster | slides | video | code | bib | tex] - Rob van der Goot, Ahmet Üstün and Barbara Plank. 2021. On the Effectiveness of Dataset Embeddings in Mono-lingual, Multi-lingual and Zero-shot Conditions. In Adapt-NLP.
[paper | poster | code | bib | tex] - Anouck Braggaar and Rob van der Goot. 2021. Challenges in Annotating and Parsing Spoken, Code-switched, Frisian-Dutch Data. In Adapt-NLP.
[paper | poster | code | bib | tex] - Rob van der Goot, Ahmet Üstün, Alan Ramponi, Ibrahim Sharaf and Barbara Plank. 2021. Massive Choice, Ample Tasks (MaChAmp):A Toolkit for Multi-task Learning in NLP. In EACL.
[paper | poster | slides | code | video | website | bib | tex] 🏆Outstanding paper award!🏆 - Rob van der Goot and Özlem Çetinoğlu. 2021. Lexical Normalization for Code-switched Data and its Effect on POS-tagging. In EACL.
[paper | poster | slides | video | code | data | bib | tex]
2020
- Rob van der Goot, Marija Stepanovic, Alan Ramponi, Ibrahim Sharaf, Ahmet Üstün, Aizhan Imankulova, Siti Oryza Khairunnisa, Mamoru Komachi and Barbara Plank. 2020. Cross-lingual Multi-task Transfer for Zero-shot Task-oriented Dialog. In RESOURCEFUL 2020.
[paper | slides | bib | tex] - Anouck Braggaar and Rob van der Goot. Creating a Universal Dependencies Treebank of Spoken Frisian-Dutch Code-switched Data. In RESOURCEFUL 2020.
[paper | slides | bib | tex] - Anders Giovanni Møller, Rob van der Goot and Barbara Plank. 2020. NLP North at WNUT-2020 Task 2: Pre-training versus Ensembling for Detection of Informative COVID-19 English Tweets. In WNUT.
[paper | code | poster | bib | tex] - Barbara Plank, Kristian Nørgaard Jensen and Rob van der Goot. 2020. DaN+: Danish Nested Named Entities and Lexical Normalization. In COLING.
[paper | code | poster | bib | tex] - Alan Ramponi, Rob van der Goot, Rosario Lombardo and Barbara Plank. 2020. Biomedical Event Extraction as Sequence Labeling. In EMNLP.
[paper | code | slides | video | bib | tex] - Rob van der Goot, Alan Ramponi, Tommaso Caselli, Michele Cafagna and Lorenzo De Mattei. 2020. Norm It! Lexical Normalization for Italian and Its Downstream Effects for Dependency Parsing. In LREC.
[paper | code | bib | tex] - Kelly Dekker and Rob van der Goot. 2020. Synthetic Data for English Lexical Normalization: How Close Can We Get to Manually Annotated Data? In LREC.
[paper | code | bib | tex] - Malvina Nissim, Rik van Noord and Rob van der Goot. 2020. Fair is Better than Sensational:Man is to Doctor as Woman is to Doctor. Computational Linguistics Journal.
[paper | poster | code | bib | tex]
2019
- Rob van der Goot. 2019. An In-depth Analysis of the Effect of Lexical Normalization on the Dependency Parsing of Social Media. In WNUT.
[paper | poster | code | bib | tex] - Rob van der Goot. 2019. MoNoise: A Multi-lingual and Easy-to-use Lexical Normalization Tool. ACL 2019.
[paper | poster | code | bib | tex] - Ahmet Üstün, Rob van der Goot, Gosse Bouma and Gertjan van Noord. 2019. Multi-Team: A Multi-attention, Multi-decoder Approach to Morphological Analysis. In SIGMORPHON.
[paper | poster | code | bib | tex] - Aria Nourbakhsh, Frida Vermeer, Gijs Wiltvank and Rob van der Goot. 2019. sthruggle at SemEval-2019 Task 5: An Ensemble Approach to HateSpeech Detection. In SemEval.
[paper | bib | tex] - Rob van der Goot. 2019. Normalization and Parsing Algorithms for Uncertain Input. PhD Thesis.
[thesis | slides | code | errata | bib | tex]
2018
- Rob van der Goot and Gertjan van Noord. 2018. Modeling Input Uncertainty in A Neural Network Dependency Parser. In EMNLP.
[paper | poster | code | appendix | bib | tex] - Rob van der Goot, Nikola Ljubešić, Ian Matroos, Malvina Nissim and Barbara Plank. 2018. Bleaching Text: Abstract Features for Cross-lingual Gender Prediction. In ACL.
[paper | slides | code | bib | tex] - Rob van der Goot, Rik van Noord and Gertjan van Noord. 2018. A Taxonomy for In-depth Evaluation of Normalization for User Generated Content. In LREC.
[paper | poster | slides | data | bib | tex]
2017
- Malvina Nissim, Lasha Abzianidze, Kilian Evang, Rob van der Goot, Hessel Haagsma, Barbara Plank and Martijn Wieling. 2017. Sharing is Caring: The Future of Shared Tasks. In Computational Linguistics journal.
[paper | data | bib | tex] - Rob van der Goot and Gertjan van Noord. 2017. MoNoise: Modeling Noise Using a Modular Normalization System. In CLIN Journal
[paper | slides | code | bib | tex] - E.Tjong Kim Sang, M. Bollmann, R. Boschker, F. Casacuberta, F. Dietz, S. Dipper, M. Domingo, R. van der Goot, M. van Koppen, N. Ljubešić, R. Östling, F. Petran, E. Pettersson, Y. Scherrer, M. Schraagen, L. Sevens, J. Tiedemann, T. Vanallemeersch and K. Zervanou. 2017. The CLIN27 Shared Task: Translating Historical Text to Contemporary Language for Improving Automatic Linguistic Annotation. In CLIN Journal
[paper | bib] - Rob van der Goot, Barbara Plank and Malvina Nissim. 2017. To Normalize, or Not to Normalize: The Impact of Normalization on Part-of-Speech Tagging. In WNUT.
[paper | slides | code | code (updated) | bib | tex] - Rob van der Goot and Gertjan van Noord. 2017. Parser Adaptation for Social Media by Integrating Normalization. In ACL.
[paper | poster | slides | code | bib | tex]
2016
- Joachim Daiber and Rob van der Goot. 2016. The Denoised Web Treebank: Evaluating Dependency Parsing under Noisy Input Conditions. In LREC.
[paper | poster | data | code | bib | tex] - Rob van der Goot. 2016. Normalizing Social Media Texts by Combining Word Embeddings and Edit Distances in a Random Forest Regressor. In NormSoMe.
[paper | slides | code | bib | tex]
2015
- Rob van der Goot and Gertjan van Noord. 2015. ROB: Using Semantic Meaning to Recognize Paraphrases. In SemEval.
[paper | poster | code | bib | tex]
2014
- Johannes Bjerva, Johan Bos, Rob van der Goot and Malvina Nissim. 2014. The Meaning Factory: Formal Semantics for Recognizing Textual Entailment and Determining Semantic Similarity. In SemEval.
[paper | code | Master Thesis | bib | tex ]