Beca para realizar el doctoradoInfoling 8.12 (2019)

Beca para realizar el doctorado:Beca de doctorado: Dialectología y Lingüística computacional
Institución:Universidad de Gante
DescripciónPhD scholarship in Spanish Dialectology and Computational Linguistics

In the context of a research infrastructure project, a scholarship is offered for a PhD student in the LT3 Language and Translation Technology Team at the Ghent University Department of Translation, Interpreting and Communication. The successful applicant will participate in a multidisciplinary research collaboration between UGent (ΔiaLing and LT3) and UHasselt (Expertise Centre for Digital Media). The PhD research topic is part of a Hercules (FWO-funded) project (see description below), and focuses on the extension of Natural Language Processing tools for application in the domain of Spanish dialectology.

The successful candidate is appointed for two years. Further funding will be sought for the remainder of the PhD project. The starting date is as soon as possible.

Description of the Hercules project

The study of dialectal microvariation of Spanish spoken in Spain has until recently mainly focused on lexical and phonetic features. The morphosyntax of these dialects, on the contrary, remains largely unexplored, despite the recent surge in interest in dialect grammars. This is due to the lack of large annotated dialectal corpora. This project aims to fill this lacuna and will create the first morphosyntactically annotated and parsed corpus of the European Spanish dialects. This dialect corpus will be designed in a geographically balanced way and its material will proceed from the COSER corpus (Corpus Oral y Sonoro del Español Rural [Audible Corpus of Spoken Rural Spanish] , which is the largest collection of oral data in the Spanish-speaking world. As transcribing and annotating are expensive and labour-intensive, this project takes a collaborative game-based approach to building the parsed corpus of European Spanish dialects. In other words, a crowdsourced game will be built through which members of the public contribute to the co-creation of the parsed corpus by providing annotations in the context of a game.


• Fluent /(near) native in Spanish and English
• Master’s degree in a relevant field: Hispanic Linguistics, Computational Linguistics or Computer Science
• If no background in computational linguistics or computer science, a strong interest in language and speech technology is necessary
• Interested in research and having the intention to obtain a PhD degree
• Strong interpersonal and communication skills
• Eager to acquire new competences and knowledge
• Preferably knowledge of programming languages (e.g. Python, Java)
• The candidate should be able to work independently as well as in a multidisciplinary team, and will be guided by advisors with a computer science/computational linguistics background (UGent-UHasselt) and with a background in dialectology / linguistics (UGent).

How to apply

The application in English should include:
• a motivation letter, summarizing the candidate’s background and capabilities (e.g. language skills and programming skills incl.), and describing his/her motivation for this position
• attested copies of education certificates
• a list of courses with the grades obtained
• an extensive CV
• contact information (e-mail) of potential referees

Applications are to be sent by e-mail to Prof. Dr Veronique Hoste ( and Prof. Dr Miriam Bouzouita ( Application deadline: August 20th, 2019.
Plazo de envío de solicitudes: hasta el 20 de agosto de 2019
Área temática:Humanidades digitales, Lingüística computacional, Variedades del español

Fecha de publicación en Infoling:20 de agosto de 2019
Miriam Bouzouita