Kola Tubosun, curator of Yorubaname.com, is heading up a Yoruba speech-to-text initiative. Their immediate goal is to create a Siri-like application that will service millions of Yoruba-speaking people in Nigeria and elsewhere but, ultimately, their creation will help ensure the language’s longevity. Besides, a Yoruba Siri—maybe we’d call her Simi instead— is bound to have a lot of personality.
From the interview which appeared on okayafrica.com:
For those who don’t know much about speech synthesis, can you elaborate on it some more and tell us how it’ll be utilized for the Yoruba text-to-speech application?
Speech synthesis is the process of creating human speech using software and audio segments. It’s called text-to-speech because the end product needs written text to put into action. Like those bibles that read the words to you, or like those GPS systems that talk, or even these Word applications that can read to you what you have typed, the system picks out already written text and converts it to synthetic audio. It is created, usually, by a process of training the computer to string along segments of audio into comprehensible speech. Watch this video to see it in action.
What we’re trying to create for Yoruba is similar, and the uses of the application are many. For instance, most artificial intelligence softwares use spoken language as means of activating them. Siri, on the iPhone, for instance, can be spoken to and “she” speaks back. That voice is a manufactured voice. But because it can respond to commands and take commands, it is useful in many other ways. Blind people, for instance, will be able to operate their phones if they can just talk to it and tell it what they want. You can use it at ATMs to help people who don’t speak English, etc.
Why is it so important that we have this software in Yoruba in particular?
Well, Yoruba has over 30 million speakers. That is already a huge population that can benefit from this kind of innovation. Many of those 30 million do not speak English at all, which means that they are shut out of a number of things involving technology. If a market woman can use an ATM in her local language, I think that empowers her. If she can speak to her phone in Yoruba and it does what she wants, that’s a leap forward.
But more importantly, African languages have been left out, for too long in global conversations in technology and that has always bothered me. Siri exists in Danish, Finnish, and Norwegian, three languages which, combined and multiplied by two, still aren’t as widely spoken as Yoruba, yet there is Siri in those languages. Why? Because we don’t care?
So, I’m working on Yoruba because that’s the language I speak and on which I have competence as a linguist to create anything. My overarching aim, however, is to show that more can be done for any African language, and more should be done. One of the ways to keep a language from being endangered is not only to speak it to our children, but also to have them capable of adapting to changing times, in this case with technology.