Javi GG

AI Researcher

Hi, I'm Javi GG

Incoming PhD Student at University of Zürich, starting Fall 2026.

I do research on natural language processing.

About Me

I'm Javi, an AI researcher working on machine translation, multilingual language models, and large-scale language model training. These days I'm especially interested in document-level machine translation, test-time inference strategies, and trustworthy multilingual AI. I enjoy connecting research ideas with practical, open systems that can be useful beyond a single benchmark.

Latest Posts

Experience

Education and professional experience.

Professional Experience

Instituto de Telecomunicações logo
Instituto de Telecomunicações

Research Engineer in NLP

Feb. 2026 - Jun. 2026

Working as a research engineer in the SMURF4EU project to develop a sovereign European suite of Multimodal Reasoning Foundation Models, leveraging 1.9M GPU-hours via EuroHPC Extreme Scale Access to build open-source, long-context systems across 24 EU languages.

NLP
Multimodal Reasoning
Foundation Models
EuroHPC
Barcelona Supercomputing Center logo
Barcelona Supercomputing Center

Research Engineer in NLP

Jan. 2024 - Jan. 2026

Research engineer in the Language Technologies Group, contributing to European and national initiatives including Eloquence, ALIA, ILENIA, and AINA. Developed multilingual LLM-based neural machine translation models such as SalamandraTA-2b-instruct, SalamandraTA-7b-instruct, and Plume; worked on low-resource machine translation; and trained LLMs in multi-node distributed environments using DeepSpeed, NeMo, and Megatron.

Machine Translation
LLMs
Low-Resource MT
DeepSpeed
NeMo
Megatron
MT@UPC (Machine Translation Group at UPC) logo
MT@UPC (Machine Translation Group at UPC)

Research Engineer in NLP

Oct. 2022 - Sep. 2023

Worked on the Spanish national ROB-IN project, whose goal was to create a robot for continuous personalized assistance with self-explanatory capabilities. Contributed to the Natural Language Processing component of the system.

NLP
Dialogue Systems
Robotics
RASA
Department of Applied Statistics at UPV logo
Department of Applied Statistics at UPV

Undergraduate Research Assistant

Oct. 2019 - Aug. 2022

Worked mainly as a data scientist, handling data collection, processing, analysis, and the application of machine learning techniques.

Data Science
Machine Learning
Statistics
Data Analysis

Education

Aalto University logo
Aalto University

Master's Exchange Student (Erasmus)

Sep. 2023 - Jan. 2024

Exchange student during my Master's program.

Erasmus
Exchange Student Award
Artificial Intelligence
Polytechnic University of Catalonia (UPC), University of Barcelona (UB), and University of Rovira i Virgili (URV) logoPolytechnic University of Catalonia (UPC), University of Barcelona (UB), and University of Rovira i Virgili (URV) logoPolytechnic University of Catalonia (UPC), University of Barcelona (UB), and University of Rovira i Virgili (URV) logo
Polytechnic University of Catalonia (UPC), University of Barcelona (UB), and University of Rovira i Virgili (URV)

Master in Artificial Intelligence

2022 - 2024

Master Thesis: Improving Multilingual Neural Machine Translation by projecting language representations.

Artificial Intelligence
Polytechnic University of Valencia (UPV) logo
Polytechnic University of Valencia (UPV)

Bachelor's degree in Data Science

2018 - 2022

Bachelor Thesis: Image Captioning using pre-trained GPT-2 models.

Data Science
GPA 9.3/10

Publications

Hearing to Translate: The Effectiveness of Speech Modality Integration into LLMs

Hearing to Translate: The Effectiveness of Speech Modality Integration into LLMs

Sara Papi*, Javier Garcia Gilabert*, Zachary Hopton*, Vilém Zouhar*, Carlos Escolano, Gerard I. Gállego, Jorge Iranzo-Sánchez, Ahrii Kim, Dominik Macháček, Patricia Schmidtova, Maike Züfle

TACL2026
Machine TranslationSpeechSpeech Translation
ACADATA: Parallel Dataset of Academic Data for Machine Translation

ACADATA: Parallel Dataset of Academic Data for Machine Translation

Iñaki Lacunza*, Javier Garcia Gilabert*, Francesca De Luca Fornaciari*, Javier Aula-Blasco, Aitor Gonzalez-Agirre, Maite Melero, Marta Villegas

LREC2026
Machine TranslationDatasetsAcademic Translation
MT-LENS: An all-in-one Toolkit for Better Machine Translation Evaluation

MT-LENS: An all-in-one Toolkit for Better Machine Translation Evaluation

Javier García Gilabert, Carlos Escolano, Audrey Mash, Xixian Liao, Maite Melero

NAACL Demo2025
Machine TranslationEvaluationToolkit
ReSeTOX: Re-learning attention weights for toxicity mitigation in machine translation

ReSeTOX: Re-learning attention weights for toxicity mitigation in machine translation

Javier García Gilabert, Carlos Escolano, Marta R. Costa-Jussà

EAMT2024Oral Presentation
Machine TranslationToxicity MitigationInference

Get In Touch

Feel free to reach out for collaborations or questions.