AI for Good blog

Advancing education and speech recognition in Nigeria with AI and machine learning

- Education
- Inclusivity
16 December 2020
0 Comment

As part of our participation in the ITU Focus Group on Machine Learning for Future Networks including 5G (FG‑ML5G), in March 2019, our WINEST (Wireless Networks and Embedded Systems Technologies) Research Group launched a study on “use cases and solutions for migrating to IMT‑2020/5G networks in emerging markets”.

Our focus was to determine how machine learning could help emerging markets to leapfrog technology generations to take advantage of emerging and future networks while optimizing energy consumption, network coverage, and communication overheads.

Improving education in Africa

This study led us to propose the “AI‑Based Classroom” project, designed to improve education for young pupils in Africa. Students are driving the project, under the guidance of James Agajo, Associate Professor and Head of WINEST Research Group in the Department of Computer Engineering at the Federal University of Technology in Minna, Nigeria.

With AI‑based natural language processing (NLP), classroom conversations between pupils and teachers are processed at the network edge to extract keywords while maintaining speaker anonymity.

These keywords are transmitted to a trained classifier in the central server, which is able to recommend captivating media content, providing students with intuitive examples, supporting a teacher’s explanations. The media content is then shared on a digital display in the classroom.

The system is designed to augment the efforts of elementary school teachers rather than attempting to replace them.

An efficient speech recognition library is a critical prerequisite for the development of an AI‑based classroom. This proved very difficult to find.

Automatic speech recognition for Africa

We were in search of a speech recognition library which was able to function locally, meet users’ privacy concerns, and was freely available. In view of the extraordinary number of languages spoken across Nigeria, and Africa at large, we also needed a library able to perform well in processing the English language accented in many different ways.

We evaluated many software libraries, but none of them succeeded in meeting all of these requirements. This led to the launch of a new WINEST Research Group project in February 2020 to develop a new speech recognition framework able to meet the unique requirements of the AI‑Based Classroom project. The project evolved from the discussions sparked by our presentation of the AI‑Based Classroom.

The project evolved from the discussions sparked by our presentation of the AI‑Based Classroom project at the 7th Regional Workshop on “Standardization of future networks towards building a better-connected Africa” in Abuja, Nigeria, 3–4 February 2020, convened by ITU’s standardization expert group for future networks and cloud computing, ITU Telecommunication Standardization Sector (ITU–T) Study Group 13.

The expert feedback provided by the Abuja workshop motivated our launch of a pilot project in Nigeria to develop an African automatic speech recognition (ASR) system.

We are collecting speech data and developing the ASR engine to deliver a prototype able to guide the development of a system ready for market deployment. We have developed the “Wazobia” mobile application, to support the necessary data collection, where Nigerian “voice donors” read displayed text aloud and donate the recording — anonymously.

“Wazobia” is an amalgam of three words meaning “come” in Yoruba (wa), Hausa (zo) and Igbo (bia), Nigeria’s three largest linguistics groups.

The speech data is stored on our server as “unvalidated” by default, pending the crowd-sourced validation of this data by volunteers via the Wazobia mobile app. This validation results in Boolean evaluations of the accuracy of the ASR engine’s transcriptions of recorded speech.

Architecture of the African automatic speech recognition project To date, the project has collected over three hours of speech corpus from over 170 voice donors.

The African ASR system development phase includes data pre-processing, training and software design. The project uses the Wav2letter++ ASR toolkit and looks to Facebook’s AI research article as a reference implementation.

We are progressing with the segmentation and pre-processing of the collected data for supervised and semi-supervised machine learning settings, but thus far the African ASR project only accepts English as an input language.

We aim to introduce African languages as inputs as the ASR project develops, and we plan to stimulate this key avenue of innovation by submitting our speech corpus of African languages to future ITU challenges on AI and Machine Learning in 5G and beyond.

AI and machine learning to help Africa manage pandemics

In future ITU AI/machine learning in 5G challenges we also plan to propose a new Bluetooth®-enabled contact-tracing application supported by machine learning.
This pandemic tracing application (PTA) project aims to build exposure-risk prediction models trained from anonymized user data, as explained in the table below:

Data collection for pandemic tracing application contact detection

Straight line distance between user equipment

Bluetooth signal strength

User equipment model

Operating system version

Indoor/outdoor (based on ambient light)

Radio-frequency interference (wireless local area network)

The proposed PTA deployment scenario would incorporate data collected from users with Bluetooth® discoverable, not requiring all users to install the application. However, a more detailed picture of the environment can be achieved by incorporating data from mobile devices’ gyroscopes and accelerometers when two Bluetooth®-connected devices both have the PTA installed.

Architecture of the Pandemic Tracing Application

The development of the proposed PTA will follow these guiding principles:

1. The contact tracing will be generic with configurable parameters to accommodate future pandemics.
2. It will reuse relevant features of existing frameworks but customize these features for application in Africa.
3. It will include privacy-preserving mechanisms by design.
4. The application will determine the appropriate scope of data sharing beyond a user’s device-based user-indicated preferences with respect to privacy.

The training data will be specific to pandemics, as recommended by the World Health Organization (WHO) and other health authorities.

The resulting exposure-risk prediction models will also be specific to pandemics (trained and deployed from a central server).

We are now focused on collecting the required data and we plan to submit this data to future ITU challenges on AI and Machine Learning in 5G and beyond.

Image credit: Federal University of Technology, Minna, Nigeria

Country or Area	ISO-alpha2 Code	ISO-alpha3 Code	Developed / Developing regions
Algeria	DZ	DZA	Developing
Egypt	EG	EGY	Developing
Libya	LY	LBY	Developing
Morocco	MA	MAR	Developing
Sudan	SD	SDN	Developing
Tunisia	TN	TUN	Developing
Western Sahara	EH	ESH	Developing
British Indian Ocean Territory	IO	IOT	Developing
Burundi	BI	BDI	Developing
Comoros	KM	COM	Developing
Djibouti	DJ	DJI	Developing
Eritrea	ER	ERI	Developing
Ethiopia	ET	ETH	Developing
French Southern Territories	TF	ATF	Developing
Kenya	KE	KEN	Developing
Madagascar	MG	MDG	Developing
Malawi	MW	MWI	Developing
Mauritius	MU	MUS	Developing
Mayotte	YT	MYT	Developing
Mozambique	MZ	MOZ	Developing
Réunion	RE	REU	Developing
Rwanda	RW	RWA	Developing
Seychelles	SC	SYC	Developing
Somalia	SO	SOM	Developing
South Sudan	SS	SSD	Developing
Uganda	UG	UGA	Developing
United Republic of Tanzania	TZ	TZA	Developing
Zambia	ZM	ZMB	Developing
Zimbabwe	ZW	ZWE	Developing
Angola	AO	AGO	Developing
Cameroon	CM	CMR	Developing
Central African Republic	CF	CAF	Developing
Chad	TD	TCD	Developing
Congo	CG	COG	Developing
Democratic Republic of the Congo	CD	COD	Developing
Equatorial Guinea	GQ	GNQ	Developing
Gabon	GA	GAB	Developing
Sao Tome and Principe	ST	STP	Developing
Botswana	BW	BWA	Developing
Eswatini	SZ	SWZ	Developing
Lesotho	LS	LSO	Developing
Namibia	NA	NAM	Developing
South Africa	ZA	ZAF	Developing
Benin	BJ	BEN	Developing
Burkina Faso	BF	BFA	Developing
Cabo Verde	CV	CPV	Developing
Côte d’Ivoire	CI	CIV	Developing
Gambia	GM	GMB	Developing
Ghana	GH	GHA	Developing
Guinea	GN	GIN	Developing
Guinea-Bissau	GW	GNB	Developing
Liberia	LR	LBR	Developing
Mali	ML	MLI	Developing
Mauritania	MR	MRT	Developing
Niger	NE	NER	Developing
Nigeria	NG	NGA	Developing
Saint Helena	SH	SHN	Developing
Senegal	SN	SEN	Developing
Sierra Leone	SL	SLE	Developing
Togo	TG	TGO	Developing
Anguilla	AI	AIA	Developing
Antigua and Barbuda	AG	ATG	Developing
Aruba	AW	ABW	Developing
Bahamas	BS	BHS	Developing
Barbados	BB	BRB	Developing
Bonaire, Sint Eustatius and Saba	BQ	BES	Developing
British Virgin Islands	VG	VGB	Developing
Cayman Islands	KY	CYM	Developing
Cuba	CU	CUB	Developing
Curaçao	CW	CUW	Developing
Dominica	DM	DMA	Developing
Dominican Republic	DO	DOM	Developing
Grenada	GD	GRD	Developing
Guadeloupe	GP	GLP	Developing
Haiti	HT	HTI	Developing
Jamaica	JM	JAM	Developing
Martinique	MQ	MTQ	Developing
Montserrat	MS	MSR	Developing
Puerto Rico	PR	PRI	Developing
Saint Barthélemy	BL	BLM	Developing
Saint Kitts and Nevis	KN	KNA	Developing
Saint Lucia	LC	LCA	Developing
Saint Martin (French Part)	MF	MAF	Developing
Saint Vincent and the Grenadines	VC	VCT	Developing
Sint Maarten (Dutch part)	SX	SXM	Developing
Trinidad and Tobago	TT	TTO	Developing
Turks and Caicos Islands	TC	TCA	Developing
United States Virgin Islands	VI	VIR	Developing
Belize	BZ	BLZ	Developing
Costa Rica	CR	CRI	Developing
El Salvador	SV	SLV	Developing
Guatemala	GT	GTM	Developing
Honduras	HN	HND	Developing
Mexico	MX	MEX	Developing
Nicaragua	NI	NIC	Developing
Panama	PA	PAN	Developing
Argentina	AR	ARG	Developing
Bolivia (Plurinational State of)	BO	BOL	Developing
Bouvet Island	BV	BVT	Developing
Brazil	BR	BRA	Developing
Chile	CL	CHL	Developing
Colombia	CO	COL	Developing
Ecuador	EC	ECU	Developing
Falkland Islands (Malvinas)	FK	FLK	Developing
French Guiana	GF	GUF	Developing
Guyana	GY	GUY	Developing
Paraguay	PY	PRY	Developing
Peru	PE	PER	Developing
South Georgia and the South Sandwich Islands	GS	SGS	Developing
Suriname	SR	SUR	Developing
Uruguay	UY	URY	Developing
Venezuela (Bolivarian Republic of)	VE	VEN	Developing
Kazakhstan	KZ	KAZ	Developing
Kyrgyzstan	KG	KGZ	Developing
Tajikistan	TJ	TJK	Developing
Turkmenistan	TM	TKM	Developing
Uzbekistan	UZ	UZB	Developing
China	CN	CHN	Developing
China, Hong Kong Special Administrative Region	HK	HKG	Developing
China, Macao Special Administrative Region	MO	MAC	Developing
Democratic People’s Republic of Korea	KP	PRK	Developing
Mongolia	MN	MNG	Developing
Brunei Darussalam	BN	BRN	Developing
Cambodia	KH	KHM	Developing
Indonesia	ID	IDN	Developing
Lao People’s Democratic Republic	LA	LAO	Developing
Malaysia	MY	MYS	Developing
Myanmar	MM	MMR	Developing
Philippines	PH	PHL	Developing
Singapore	SG	SGP	Developing
Thailand	TH	THA	Developing
Timor-Leste	TL	TLS	Developing
Viet Nam	VN	VNM	Developing
Afghanistan	AF	AFG	Developing
Bangladesh	BD	BGD	Developing
Bhutan	BT	BTN	Developing
India	IN	IND	Developing
Iran (Islamic Republic of)	IR	IRN	Developing
Maldives	MV	MDV	Developing
Nepal	NP	NPL	Developing
Pakistan	PK	PAK	Developing
Sri Lanka	LK	LKA	Developing
Armenia	AM	ARM	Developing
Azerbaijan	AZ	AZE	Developing
Bahrain	BH	BHR	Developing
Georgia	GE	GEO	Developing
Iraq	IQ	IRQ	Developing
Jordan	JO	JOR	Developing
Kuwait	KW	KWT	Developing
Lebanon	LB	LBN	Developing
Oman	OM	OMN	Developing
Qatar	QA	QAT	Developing
Saudi Arabia	SA	SAU	Developing
State of Palestine	PS	PSE	Developing
Syrian Arab Republic	SY	SYR	Developing
Turkey	TR	TUR	Developing
United Arab Emirates	AE	ARE	Developing
Yemen	YE	YEM	Developing
Fiji	FJ	FJI	Developing
New Caledonia	NC	NCL	Developing
Papua New Guinea	PG	PNG	Developing
Solomon Islands	SB	SLB	Developing
Vanuatu	VU	VUT	Developing
Guam	GU	GUM	Developing
Kiribati	KI	KIR	Developing
Marshall Islands	MH	MHL	Developing
Micronesia (Federated States of)	FM	FSM	Developing
Nauru	NR	NRU	Developing
Northern Mariana Islands	MP	MNP	Developing
Palau	PW	PLW	Developing
United States Minor Outlying Islands	UM	UMI	Developing
American Samoa	AS	ASM	Developing
Cook Islands	CK	COK	Developing
French Polynesia	PF	PYF	Developing
Niue	NU	NIU	Developing
Pitcairn	PN	PCN	Developing
Samoa	WS	WSM	Developing
Tokelau	TK	TKL	Developing
Tonga	TO	TON	Developing
Tuvalu	TV	TUV	Developing
Wallis and Futuna Islands	WF	WLF	Developing

AI for Good blog

Advancing education and speech recognition in Nigeria with AI and machine learning

Improving education in Africa

Automatic speech recognition for Africa

Join the #AIforGood community

Please fill in the form below

Please fill in the form below

Please fill in the form below

The registration to attend online will open shortly

The registration to attend in person will open shortly

AI for Good blog

Advancing education and speech recognition in Nigeria with AI and machine learning

Improving education in Africa

Automatic speech recognition for Africa

What impact is AI having on higher education?

AI technologies pioneering societal and artistic advances

The future of socially-assistive robots for good

Join the #AIforGood community