
Master Theses
Research Master Human Language Technology
- Malihehassadat (Farnaz) Bani Fatemi, 2025. Grammaticality and LLMs: Evaluating the Potential of BabyLMs for Grammatical Error Detection in NLP (pdf)
- Ariana Britez, 2025. Exploring Ensemble Strategies for Misogynous and Sexist Meme Detection (pdf)
- Szabolcs Pál, 2025. Investigation of Scalable Audio-based Speaker Identification in the context of Communicative Robots (pdf)
- Yee Man Ng, 2024. A Comparative Study of Open-Source and Closed-Source Large Language Models for Native Language Identification (pdf)
- Sidi Wang, 2024. LLMs as annotators for machine translation quality estimation (pdf)
- Marije Brandsma, 2023. Decoding Populism: Analyzing Lexical Choice and Linguistic Simplicity in Tweets (pdf)
- Bas Diender, 2023. Random Seed Influence on Language Model Generalizabilit (pdf)
- Mekselina Doğanç, 2023. Automatic Generation of Personalized Counter Narratives Based on User Profile (pdf)
- Mojca Kloos, 2023. Mitigating Gender Bias with Deep Reinforcement Learning (pdf)
- Agnieszka Kluska, 2023. Adapting Microportrait Extraction for Queer Stereotype Identification in Polish Online News (pdf)
- Vasiliki Kyrmanidi, 2023. Exploring the Impact of Structured Dialogue Representation on Neural Dialogue Response Generation (pdf)
- Dorien Renting, 2023. Multi-task fine-tuning for hate speech detection (pdf)
- Rorick Terlou, 2023. Increasing Readability with Disfluency Removal in Automatic Dutch Transcriptions (pdf)
- Marcel Feteke, 2022. Cross-lingual Transfer Using Stacked Language Adapters (pdf)
- Eliza Hobo, 2022. Simply accessible: Contextualized Lexical Simplification for Accessibility of Dutch Texts (pdf)
- Sanne Hoeken, 2022. Using Language Models for Analyzing Semantic Variation between Dutch Social Communities (pdf)
- Adrielli Lopez Rego, 2022. Matching Ontologies in the Education Domain with Semantic Similarity (pdf)
- Yilmaz Polat, 2022. The Hallucinatory World of Automatic Text Generation (pdf)
- Alessandra Polimeno, 2022. Diversifying News Recommendation Systems by Detecting Fragmentation in News Story Chains (pdf)
- Charlotte Pouw, 2022. Cross-lingual Transfer of Correlations between Linguistic Complexity and Human Reading Behaviour (pdf)
- Vivian Claes, 2021. ECBERT: Applying BERT to European Central Bank Communication to Predict Market Response (pdf)
- Søren K. Fomsgaard, 2021. In the eye of the storm with style – Investigating style features in the language of QAnon on Twitter (pdf)
- Sophie Neutel, 2021. Towards automatic ontology alignment using BERT (pdf)
- Nathan van der Molen-Pater, 2021. Information Usage in Coreference Resolution (pdf)
- András Aponyi, 2020. Estimating Translation Quality Using Distributed Representations of Words and Sentences (pdf)
- Klaudia Bartosiak, 2020. Towards Formalizing Eligibility Criteria of Clinical Trials: Biomedical Entity Linking (pdf)
- Suzana Bašić, 2020. Color as a Discriminative Property for Establishing Object Identity in Human-Robot Communication (pdf)
- Lauren Green, 2020. Semi-supervised Classification of Occupations using Pseudo-Labelling and Information Extraction (pdf)
- Ngan Nguyen, 2020. Clickbait anatomy: Identifying clickbait with machine learning (pdf)
- Jonathan Schaller, 2020. Cross-domain evaluation of a question-answering classifier (pdf)
- Lisa Vasileva, 2020. Machine Translation Detection for Neural Machine Translation Scenario (pdf)
- Karen Goes, 2019. Exploring text mining techniques to structure a digitised catalogue (pdf)
- Liza King, 2018. Modals and Measles: Computational linguistic investigations into modal use in the vaccination debate (pdf)
- Benedetta Torsi, 2018. Detecting claims in a cross-register corpus (pdf)
- Pia Sommerauer, 2017. From old to new racism? Investigating known dangers in distributional semantic approaches to conceptual change (pdf)
- Chantal van Son, 2015. Towards a Dutch frame-semantic parser (pdf)
- Femke Klaver, 2014. Authorship attribution of forum posts (pdf)
Master Language and AI (formerly Text Mining)
- Shutao Chen, 2025. From 9 to 17 Categories: Weakly Supervised Sentence-Level ICF Classification in Dutch Rehabilitation Notes with GPT-4 Labeling and MedRoBERTa Fine-Tuning (pdf)
- Xin Chen, 2025. Generating Follow-up Questions in Health Conversations Using Fine-tuned Language Models (pdf)
- Elisabetta Dentico, 2025. Grammatical Error Detection in L2 English and Italian: How Multilingual LLMs Handle Ambiguity in Learner Errors (pdf)
- K.D. Gerritsen, 2025. Exploring Implicit Abusive Speech Detection: A Comprehensive Analysis of Fine-Tuning BERT and Prompting Qwen2.5 (pdf)
- Hannah Goossens, 2025. Multitask Learning of Semantic Role Labeling and Named Entity Recognition for domain-specific documents from the Dutch East-India Company archives (pdf)
- Ningxuan Guo, 2025. Knowledge Distillation for Machine Translation Quality Estimation (pdf)
- Tessel Haagen, 2025. Trend and Popularity Analysis for Art Related Texts from 1600-1800 (pdf)
- Victoria Im, 2025. What’s in a phrase? Identifying implicit hate with generative AI (pdf)
- Urtė Jakubauskaitė, 2025. From Nine to One: Combining ICF Functioning Level Classifiers in the A-PROOF Project (pdf)
- Areumbyeol Kim, 2025. Modeling Offensive Language as a Distinct Class for Hate Speech Detection (pdf)
- Wayne Kuan, 2025. Evaluating the Impact of Continuous Pre-Training on ASR Models for Word-Level English Pronunciation Intelligibility (pdf)
- Shenglin Li, 2025. Evaluating the Impact of Linguistic Features in Harmful Meme Detection: A Systematic Ablation Study (pdf)
- Nikhil Mathews, 2025. Evaluating Generalisation in Named Entity Recognition through Robustness Testing (pdf)
- Melina Paxinou, 2025. Staying Relevant: Metaphor Detection and Domain Relevance Classification in Immunotherapy Texts (pdf)
- Maja Syrek, 2025. Evaluating Historical Language Models for literary research (pdf)
- Ino van de Wouw, 2025. Learning with Less: Contrastive Weight Tying on the BabyLM Challenge (pdf)
- Sanne van den Berg, 2025. Assessing the Role of Gesture Information in NLG: a Case Study with LLMs and Multimodal AMR (pdf)
- Selin Açikel, 2024. Lost in Translation: Analyzing Machine Translation Quality Estimation with Synthetic Challenges (pdf)
- Murat Ertas, 2024. Improving Medical Text Classifiers with Balanced Datasets (pdf)
- Payam Fakhraei, 2024. Context-Aware Hate Speech Detection using BERT: An Investigation with the Contextual Abuse Dataset (pdf)
- Chuqiao Guo, 2024. Extracting Activity Information with LLMs Using GPT-Generated Data (pdf)
- Long Ma, 2024. Chinese Healthcare Named Entity Recognition (CHNER) Using BiLSTM-CRF Classifiers (pdf)
- Alyssa MacGregor-Hastie, 2024. Chats, Agents and Lyrics (pdf)
- Csenge Szabó, 2024. Multi-Label Topic Classification of Client Feedback in the Governance Domain (pdf)
- Irma Tuinenga, 2024. Words Made Easy: a Comparative Study of Methods for English Lexical Simplification (pdf)
- Yijing Zhang, 2024. Usage of Generative Models to Ask Follow-up Questions for Health Monitoring (pdf)
- Furong Zou, 2024. Exploring An Existing ASR Model for a Binary Classification of Intelligibility on MOOC English Speech Data (pdf)
- Ajda Efendi, 2023. Document Classification on EQF levels with Multilingual datasets in English (pdf)
- Swarupa Hardikar, 2023. Exploring Open-source Generative Models for Lexical Simplification through Prompt Learning (pdf)
- Natalia Khaidanova, 2023. Machine-Translation Evaluation: Comparing Traditional and Neural Machine-Translation Evaluation Metrics for English–Russian (pdf)
- Sofia Lee, 2023. Incident in Zagreb: self-supervised task adaptation performed: Impact of Task Adapting on Transformer Models for Targeted Sentiment Analysis in Croatian Headlines (pdf)
- Quincy Liem, 2023. On the limits of entity linking on domain-specific data (pdf)
- Noah-Manuel Michael, 2023. Automated Verb Order Error Detection for Learners of Dutch as a Second Language (pdf)
- Siti Nurhalimah, 2023. Enhancing Wordnet Bahasa through Multilingual Sense Intersection (pdf)
- Cecilia Schramm, 2023. Using Semi-supervised Learning to Automatically Annotate Dutch Medical Notes for Patients’ Functioning Levels (pdf)
- Hasan Shahoud, 2023. Discovering Hidden Cues using TF-IDF and their Relevance on Cultural Inter-dependency (pdf)
- Saloni Singh, 2023. Leveraging university curricula and course descriptions to augment a knowledge graph with degree-skill relationships (pdf)
- Adam Tucker, 2023. Master Thesis An investigation of complex word identification (CWI) systems for English (pdf)
- Konstantina Andronikou, 2022. Automatic Retrieval of Topics Using Topic Modeling Techniques from Customer Conversations in the Airline Domain (pdf)
- Sharona Badloe, 2022. MedRoBERTa.nl: Transfer Learning From COVID-19 to Cancer Patients (pdf)
- Myrthe Buckens, 2022. Comparing and Evaluating Language Models for Conversational Data from the Medical Domain. (pdf)
- Felix den Heijer, 2022. NER Classification for old and modern Dutch biographies: A comparative study of finetuned BERT models and out-of-the-box tools (pdf)
- Ellemijn Galjaard, 2022. Evaluating Transfer of a Functional Level Classifier from Secondary to Primary Healthcare Notes (pdf)
- Yan Chung Li, 2022. A Challenge Set for Natural Language Inference on but-inferred propositions (pdf)
- Giorgio Malinverni, 2022. Analysing the Influence of Morphological Characteristics on the Performance of Few-Shot Prompting for Natural Language Inference in Cross-Lingual Settings (pdf)
- Lahorka Nikolovski, 2022. Synthetic Data for Domain Adaptation in Neural Machine Translation (pdf)
- Sylvia Pronk, 2022. A detailed comparison between two coreference systems and their effect on key-sentence extraction (pdf)
- Mira Reisiger, 2022. Context-based entity linking of biomedical text (pdf)
- Lois Rink, 2022. Automatic Classification of Speech Acts in tax service letters (pdf)
- Shuyi Shen, 2022. Data to text generation with a joint entity and relation based method for a job advertisement (pdf)
- Anouk Twilt, 2022. Sustainability in action: exploring automatically extracting actions from news-articles (pdf)
- Michiel van Nederpelt, 2022. Evaluating a transformer-based language model under increasingly challenging conditions for the task of offensive language detection (pdf)
- Elena Weber, 2022. Automatic Topic Classification of Customer Feedback in the Banking Domain (pdf)
- Tessel Wisman, 2022. Domain adaptation of end-to-end ASR via n-gram language modelling (pdf)
- Jingyue Zhang, 2022. Mapping text to learning objectives: A keyword-based text classification method (pdf)
- Guido Ansem, 2021. The Effect of Auxiliary Data on Low Resource Languages in Aspect Extraction (pdf)
- Gabriele Catanese, 2021. A Transfer Learning approach to Aspect Based Sentiment Analysis for airline customer feedbacks (pdf)
- Michelle Chan, 2021. An Empirical Framework for Topic Modelling for Dutch Texts based on Newspaper Articles on Soil Pollution (pdf)
- Eva den Uijl, 2021. Detecting Discriminatory Language in Job Advertising Texts (pdf)
- Stan Frinking, 2021. Using Text Mining Techniques to Detect Fall Events in Medical Patient Notes (pdf)
- Sanne Hamersma, 2021. Explorative analysis of precursors of physical aggression in a health care institute: a Text Mining approach (pdf)
- Melisha Lemain-van der Nest, 2021. Named Entity Recognition: identifying NER Indicators in Dutch Police Reports (pdf)
- Breta Micha, 2021. Automatic Terminology Extraction in domain specific texts: a comparison between a rule-based system and a BERT-based system. (pdf)
- Aju Shreshta, 2021. BERTje-based Automatic Anonymisation of Dutch Police Reports (pdf)
- Dyon van der Ende, 2021. Text Mining for Sustainability: Detecting Corporate Greenwashing with the Sustainable Development Goals (pdf)
- Jasmine van Vugt, 2021. Two Dutch fine-tuned BERT models: Named Entity Recognition and Named Entity Linking to increase findability of local geographical information. (pdf)
- Peter Caine, 2020. Mind the gap: A comparison of linguistic vs deep-learning approaches to aspect extraction and aspect category detection (pdf)
- Luca Meima, 2020. Finding potentially HIV defining conditions in medical reports (pdf)
- Jan van Casteren, 2020. Automatic Attribution Extraction From Dutch News Articles: A Beginning (pdf)
- Eva Zegelaar, 2020. An Automatic Emotion & Purpose Classifier for Dutch Tweets Written by Members of the Dutch Parliament (pdf)