Lacuna Fund Announces its Second Round of Funding to Support a Community-led Movement Towards Locally Developed and Owned Language Datasets Across Africa

Languages/Datasets/Governance/Research

Lacuna Fund Announces its Second Round of Funding to Support a Community-led Movement Towards Locally Developed and Owned Language Datasets Across Africa
Apr 28 2021 Press Release
Supported projects will produce text and speech datasets for natural language processing (NLP) technologies that will have significant downstream impacts on education, financial inclusion, healthcare, agriculture, communication, and disaster response in Sub-Saharan Africa.

Lacuna Fund announces its second cohort of supported projects, whose teams will create openly accessible text and speech datasets that will fuel natural language processing (NLP) technologies in 29 languages across Africa. Funding recipients will produce training datasets in Eastern, Western, and Southern Africa that will support a range of needs for low resource languages, including machine translation, speech recognition, named entity recognition and part of speech tagging, sentiment analysis, and multi-modal datasets. All datasets produced will be locally developed and owned, and will be openly accessible to the international data community.

With over 50 impressive applications from, or in partnership with, organizations across Africa, there are many more initiatives poised for impact. This movement towards locally developed and owned datasets has only just begun, and with the right support and funding these initiatives will unlock the power of AI to deliver new social sector solutions and increase the presence of African countries on the international data map.

“In South Africa, the government uses chatbots to provide daily updates on COVID,” explains Vukosi Marivate, ABSA Chair of Data Science at University of Pretoria. “Right now, translating those updates to Latin languages is really easy, but the datasets necessary to translate those updates to a range of African languages don’t exist, which means that the government isn’t currently able to communicate with many of its people in their native languages. That is one of the many examples of why we need this work now.”

“The Rockefeller Foundation leverages science, innovation and technology to empower vulnerable families locked out of prosperity to improve their lives and prospects,” says William Asiko, Managing Director and Head of The Rockefeller Foundation’s Africa Regional Office. “We aim to enhance the delivery value to our key audiences through partnerships, meeting them where they are. In meeting the moment, embracing diversity and multiculturalism is precisely the kind of innovative, transformative change we seek to drive impact.”

Lacuna Fund began as a funder collaborative between The Rockefeller Foundation, Google.org, and Canada’s International Development Research Centre, with support from the German development agency GIZ on behalf of the Federal Ministry for Economic Cooperation and Development (BMZ) on this call for proposals. It has since evolved into a multi-stakeholder engagement composed of technical experts, thought leaders, local beneficiaries, and end users. Collectively, we are committed to creating and mobilizing training datasets that both solve urgent local problems and lead to a step change in machine learning’s potential worldwide.

See all the projects here.

About Lacuna Fund: Lacuna Fund is the world’s first collaborative effort to provide data scientists, researchers, and social entrepreneurs in low- and middle-income contexts globally with the resources they need to produce training datasets that address urgent problems in their communities. Lacuna Fund launched in July of 2020 with a pooled fund of $4 million to support the creation, expansion, and maintenance of datasets used for training or evaluation of machine learning models, initially in three key sectors: agriculture, health, and languages. Learn more at http://www.lacunafund.org