Skip to main content

USC Researchers Use AI to Help Translate Bible Into Very Rare Languages

Image:
The Bible in different languages.
Image courtesy of Creative Commons

The Banner has a subscription to republish articles from Religion News Service. This story by Fiona André was published on religionnews.com July 5, 2023


Out of 7,100 documented languages, the Bible has been translated into more than 700, making it the most-translated book in the world. Yet those remaining languages—many of them extremely rare—have vexed Bible translators for decades. Two scientists are looking to new advancements in artificial intelligence to help close the gap.

“We want to reach all the languages on earth; the goal is to reach everyone,” said Joel Mathew, a research engineer who alongside Ulf Hermjakob recently launched the Greek Room, an AI-powered technology to help streamline the highly technical process of biblical translation.

Combining Hermjakob’s long experience with natural language processing technologies and Mathew’s field knowledge of Bible translation, the two University of Southern California Information Sciences Institute researchers developed the technology with an aim to target “very low-resource languages that are not even in the top 500,” said Mathew.

The Greek Room includes three main tools: spell-checking, world alignment that ensures consistency in translation, and detection of improper characters in a script. 

Matthew and Hermjakob met in 2015 when Mathew joined USC to complete a master’s degree in computer science. There, he met Hermjakob in the AI division of the Information Sciences Institute. They bonded over a shared passion for languages and their Christian faith.

Mathew, the son of two Bible translators, has observed firsthand the difficulties that come with manual translation by local church members. In his hometown, New Delhi, he took notes of all the tasks that technology could accomplish.

Spell-checking usually requires many people and much time, he explained. In the context of translation into rare languages, only local church members are qualified, and they don’t have technology to back their work.

“These are not trivial problems; these are very hard problems. But big companies are not interested in solving them; it’s not their business model to target very rare languages,” he said.

When Mathew shared with him some of the problems Indian translators faced on the ground, Hermjakob jumped at the opportunity to help.

“I always had this feeling to know how, at some point, I could apply my skills to my faith,” said Hermjakob, who earned a Ph.D. in computer science from the University of Texas.

With their project, Mathew and Hermjakob want to work on languages that do not even have a written system, grammar codes, dictionaries, or spell-checkers.

“We are thinking of languages like Uyghur or Oromo (Oromo is spoken in Ethiopia and Northern Kenya),” said Hermjakob.

Recently, they have been approached by an Indian consultant specifically interested in the spell-checking and world-alignment tool for Bible translation in Kolami, a language spoken in western India that counts 130,000 native speakers.

The Greek Room also hopes to change the traditional model of Bible translation. Historically, translations were done by Western missionaries, who could only work on two languages at most in their lifetime, explained Hermjakob. With the Greek Room, the two researchers encourage a local church-driven model.

“Local churches and local language communities are asking for translations of the Bible in their heart language,” explained Mathew, adding that in a multilingual context, the heart language is the one in which people express their deepest feelings and is usually their native language.

This first version of the Greek Room focuses on quality control so translators can prioritize other tasks that require more judgment, such as finding a way to translate a concept that doesn’t exist in a given language. In their next version, the two researchers want the tool to suggest better translations.

Now that their codes and data are available on GitHub, they hope other users will integrate their research into their tools and innovate further.

Their initiative, supported by the Wycliffe Bible Translators USA organization, is part of a broader program directed by Every Tribe, Every Nation that hopes to make Scripture available in every language by 2033.

 

c. 2023 Religion News Service

 

We Are Counting on You

The Banner is more than a magazine; it’s a ministry that impacts lives and connects us all. Your gift helps provide this important denominational gathering space for every person and family in the CRC.

Give Now

X