Intro

Marcell Fekete

My name is Marcell Fekete, and I am a PhD fellow in Computational Linguistics at Aalborg University (Copenhagen), supervised by Prof. Johannes Bjerva. My research interests include multilinguality and the cognitive plausibility of language modelling.

Warning! Linguistics rant incoming!

Research

Google Scholar ACL ACL Anthology ORCID

When Discourse Pressures Conflict: Information Structure in Vision-Language Model OutputsarXiv (preprint), 2026

Marcell Fekete · Johannes Bjerva · Tamás Káldi

Vision-language models (VLMs) are increasingly evaluated for whether they identify the right visual content, but little is known about whether they express such content in a discourse-appropriate form. We address this research gap using information structure (IS),
Read more

Limited-Resource Adapters Are Regularizers, Not LinguistsShort Paper, ACL 2025 (Vienna, Austria)

Marcell Fekete · Nathaniel Romney Robinson · Ernests Lavrinovics · Djeride Jean-Baptiste · Raj Dabre · Johannes Bjerva · Heather Lent

Cross-lingual transfer from related high-resource languages is a well-established strategy to enhance low-resource language technologies. Prior work has shown that adapters show promise for, e.g., improving
Read more

Linguistically Grounded Analysis of Language Models using Shapley ValuesLong Paper,
NAACL 2025 (Albuquerque,
New Mexico)

Marcell Fekete · Johannes Bjerva

Understanding how linguistic knowledge is encoded in language models is crucial for improving their generalisation capabilities. In this paper, we investigate the processing of morphosyntactic phenomena, by leveraging
Read more

Words

My CV

↓  Download PDF

EXPERIENCE
PhD Fellow • Aalborg University (Department of Computer Science, Copenhagen) Sep 2022 – Present

PhD funded by the Carlsberg Foundation project Multilingual Modelling for Resource-Poor Languages, supervised by Prof. Johannes Bjerva and Heather Lent. Also supervising bachelor and master students in Software Design.

Junior Machine Learning Engineer • TAUS Sep 2021 – Jun 2022

Research projects on Dimensionality Reduction of Multilingual Sentence Embeddings Using Autoencoders and Cross-lingual Transfer of Multilingual Language Models Using Stacked Language Adapters.

NLP Intern • Underlined Mar 2021 – Sep 2021

Research into automatic topic modelling using Latent Dirichlet Allocation.

EDUCATION
MA Human Language Technology • Vrije Universiteit Amsterdam 2020 – 2022

cum laude

BA Linguistics • University of Cambridge 2015 – 2018

Upper Second-Class degree

GRANTS
DFF International Postdoc Grant (€287,000) Nov 2026 – Oct 2028

Personal grant awarded by the Independent Research Fund Denmark (Danmarks Frie Forskningsfond).

Otto Mønsteds Fonden Conference Participation (€1,000) Sep 2025

OM Fonden funding for conference attendance at ACL in Vienna, Austria.

Otto Mønsteds Fonden Conference Participation (€1,000) Jun 2025

OM Fonden funding for conference attendance at NAACL in Albuquerque, New Mexico.

UniDive COST Action Research Visit (€2,000) Feb – Apr 2025

UniDive funded research visit to the HUN-REN Hungarian Research Centre for Linguistics.

Otto Mønsteds Fonden Research Stay (€1,800) Mar – Jun 2024

OM Fonden funded research visit to the University of Edinburgh.

DISSEMINATION
Neural Networks (Workshop) Aug 2025

Introducing high school students to the basic principles of neural networks in Diósjenő, Hungary.

Analysing Language Model Knowledge using Linguistic Theory (Invited Talk) Feb 2025

Talk at the Department of Computer Science, University of Göttingen, invited by Prof. Dr. Lisa Beinborn.

Do Language Models Dream With Linguistics? (Presentation) Dec 2024

Presentation at the AAU NLP Symposium 2024.

Machine Intelligence (Lecture) Nov 2024

Guest lecture about large language modelling for P5 Software Design at Aalborg University.

Chatbots (Lecture) Nov 2024

Introducing high school students to shortcomings of chatbots at FalconNXT.

Chatbots and Large Language Models (Workshop) Aug 2024

Activity for high school students regarding chatbots and the inner workings of large language models in Diósjenő, Hungary.

Language Modelling (Lab) Nov 2023

Guest lecture and lab about large language modelling for P5 Software Design at Aalborg University.