* . * . . .
  • Contact Us
  • Privacy Policy
  • Terms of Use
  • Cookie Privacy Policy
  • DMCA
  • California Consumer Privacy Act (CCPA)
Friday, May 16, 2025
Love Europe
  • Politics
  • Business
  • Culture
  • Opinion
  • Lifestyle
  • Sports
  • Travel
No Result
View All Result
  • Politics
  • Business
  • Culture
  • Opinion
  • Lifestyle
  • Sports
  • Travel
No Result
View All Result
Love Europe
No Result
View All Result
Home News

Could AI save Europe’s rare and endangered languages from extinction?

June 30, 2024
in News
Could AI save Europe’s rare and endangered languages from extinction?
Share on FacebookShare on Twitter
ADVERTISEMENT

The project includes a dozen “low resource” European languages, like Scottish Gaelic, Galician, Irish, Lingurian, Bosnian, Icelandic and Welsh.

According to Meta, that’s a language that has less than one million sentences in data that can be used.

Experts say that to improve the service, Meta should consult with native speakers and language specialists as the tool still needs work.

How does the project work

Meta trains its artificial intelligence (AI) with data from the Opus repository, an open source platform with a collection of authentic text of speech or writing for various languages that can program machine learning.

Contributors to the dataset are experts in natural language processing (NLP): the subset of AI research that gives computers the ability to translate and understand human language.

Meta said they also use a combination of mined data from sources like Wikipedia in their databases. 

The data is used to create what Meta calls a multilingual language model (MLM), where the AI can translate “between any pair… of languages without relying on English data,” according to their website.

The NLLB team evaluates the quality of their translations with a benchmark of human-translated sentences they’ve created that is also open source. This includes a list of “toxicity” words or phrases that humans can teach the software to filter out when translating text. 

According to their latest paper, the NLLB team improved the accuracy of translations by 44 per cent from their first model, which was released in 2020. 

When the technology is fully implemented, Meta estimates there will be more than 25 billion translations every day on Facebook News Feed, Instagram, and other platforms. 

‘Talk to the people’

William Lamb, professor of Gaelic ethnology and linguistics at the University of Edinburgh, is an expert in Scottish Gaelic, one of the low-resource languages identified by Meta in its NLLB project. 

About 2.5 per cent of Scotland’s population, roughly 130,000 people, told the 2022 census that they have some skills in the 13th-century Celtic language.

There are also roughly 2,000 Gaelic speakers in eastern Canada, where it is a minority language. UNESCO classifies the language as “threatened” by extinction because of how few people speak it regularly. 

Lamb noted that Meta’s translations in Scottish Gaelic are “not very good yet,” because of the crowdsourced data they’re using, despite their “heart being in the right place”. 

“What they should do … if they really want to improve the translation is to talk to the people, the native Gaelic speakers that still live and breathe the language,” Lamb said. 

That’s easier said than done, Lamb continued. Most of the native speakers are in their 70s and do not use computers, and the young speakers “use Gaelic habitually not in the way their grandparents do”.

ADVERTISEMENT

A good replacement would be for Meta to strike a licensing agreement with the BBC, who work to preserve the language by creating high-quality, online content in it. 

‘This needs to be done by specialists’

Alberto Bugarín-Diz, professor of AI at the University of Santiago de Compostela in Spain, believes linguists like Lamb should work with Big Tech companies to refine the data sets available to them. 

“This needs to be done by specialists who can revise the texts, correct them and update them with metadata that we could use,” Bugarin-Diz said. 

“People from humanities and from a technical background like engineers need to work together, it’s a real need,” he added.

There is an advantage for Meta in using Wikipedia, Bugarin-Diz continued, because the data would reflect “almost every aspect of human life,” meaning that the quality of the language could be much better than just using more formal texts. 

ADVERTISEMENT

But, Bugarin-Diz suggests Meta and other AI companies take the time to look for quality data online and then go through the legal requirements necessary to use it, without breaking intellectual property laws. 

Lamb, meanwhile, said he won’t recommend that people use it due to errors in the data unless Meta makes some changes in their dataset.

“I wouldn’t say their translation abilities are at the point where the tools are actually useful,” Lamb said.

“I wouldn’t encourage anybody as reliable language tools yet; I think they would be upfront in saying that too”.

Bugarín-Diz takes a different stance. 

ADVERTISEMENT

He believes that, if no one uses the Meta translations, they “will not be willing” to invest time and resources into improving them.

Like other AI tools, Bugarin-Diz believes it’s a matter of knowing the weaknesses of the technology before using it. 

Source link : https://www.euronews.com/next/2024/06/30/meta-expands-ai-translation-to-200-languages-but-experts-suggest-talking-to-native-speaker

Author :

Publish date : 2024-06-30 14:10:18

Copyright for syndicated content belongs to the linked Source.

Tags: Europe
ADVERTISEMENT
Previous Post

Asian cities, off the beaten path in Europe

Next Post

Portugal’s prime minister fires chief of staff amid corruption-fueled political crisis

Related Posts

Kuehne+Nagel introduces new direct line hauls between Türkiye and Europe inside its groupage community – Kuehne + Nagel
News

Kuehne+Nagel introduces new direct line hauls between Türkiye and Europe inside its groupage community – Kuehne + Nagel

Europe slams ‘unlawful’ Trump tariffs, vows unified response – politico.eu
News

Europe slams ‘unlawful’ Trump tariffs, vows unified response – politico.eu

Report: Assaults on Catholics more and more widespread and tolerated in Europe and Latin America – Catholic Information Company
News

Report: Assaults on Catholics more and more widespread and tolerated in Europe and Latin America – Catholic Information Company

ADVERTISEMENT

Highlights

Three US Soldiers Discovered Dead in Armored Vehicle in Lithuania, One Soldier Still Unaccounted For – EUROP INFO

Explore the F1 Monaco Grand Prix Circuit in Breathtaking 3D with Apple Maps! – EUROP INFO

Montenegro Embraces a Greener Future by Joining the EU’s LIFE Programme! – EUROP INFO

Netherlands Strengthens Military Might with 46 State-of-the-Art Leopard 2A8 Tanks! – EUROP INFO

How the April 28, 2025 Power Outage Shook Internet Traffic in Portugal and Spain – EUROP INFO

Categories

Archives

June 2024
MTWTFSS
 12
3456789
10111213141516
17181920212223
24252627282930
    Jul »
  • Contact Us
  • Privacy Policy
  • Terms of Use
  • Cookie Privacy Policy
  • DMCA
  • California Consumer Privacy Act (CCPA)
No Result
View All Result
  • Home
  • Politics
  • News
  • Business
  • Culture
  • Sports
  • Lifestyle
  • Travel
  • Opinion

© 2024 Love-Europe

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.
Go to mobile version