Home
Projects
Publications
People
Join the Lab
Contact
Login
Low-resource Languages
The Zeno’s Paradox of ‘Low-Resource’ Languages
The disparity in the languages commonly studied in Natural Language Processing (NLP) is typically reflected by referring to languages …
Hellina Hailu Nigatu
,
Atnafu Lambebo Tonja
,
Benjamin Rosman
,
Thamar Solorio
,
Monojit Choudhury
PDF
Cite
Preparing the Vuk'uzenzele and ZA-gov-multilingual South African Multilingual Corpora
This paper introduces two multilingual government themed corpora in various South African languages. The corpora were collected by …
Richard Lastrucci
,
Isheanesu Dzingirai
,
Jenalea Rajab
,
Andani Madodonga
,
Matimba Shingange
,
Daniel Njini
,
Vukosi Marivate
PDF
Cite
Participatory Translations of Oshiwambo: Towards Culture Preservation with Language Technology
In this paper, we describe a participatory, collaborative, and cost-effective process for creating translations in Oshiwambo, the most …
Wilhelmina Onyothi Nekoto
,
Julia Kreutzer
,
Jenalea Rajab
,
Millicent Ochieng
,
Jade Abbott
PDF
Cite
Analysing the Effects of Transfer Learning on Low-Resourced Named Entity Recognition Performance
Transfer learning has led to large gains in performance for nearly all NLP tasks while making downstream models easier and faster to …
Michael Beukman
PDF
Cite
Effect of Tokenisation Strategies for Low-Resourced Southern African Languages
Research into machine translation for African languages is very limited and low- resourced in terms of datasets and model evaluations. …
Jenalea Rajab
PDF
Cite
Cite
×