Why AI needs to learn new languages

from The Economist 4 months ago

The latest version, Chat GPT-4, scored 85% on a common question-and-answer test. In other languages it is less impressive. When taking the test in Telugu, an Indian language spoken by around 100m people, for instance, it scored just 62%.
The Economisthttps://www.economist.com/science-and-technology/2024/01/24/why-ai-needs-to-learn-new-languages

Large language models ( LLMs) are trained on text scraped from the internet, on which English is the lingua franca. Around 93% of Chat GPT-3's training data was in English. In Common Crawl, just one of the datasets on which the model was trained, English makes up 47% of the corpus, with other (mostly related) European languages accounting for 38% more. Chinese and Japanese combined, by contrast, made up just 9%. Telugu was not even a rounding error.
The Economisthttps://www.economist.com/science-and-technology/2024/01/24/why-ai-needs-to-learn-new-languages

Read at The Economist

#chatgpt #open-ai #language-models #multilingual-ai #low-resource-languages

[

]

[

...

]

Why AI needs to learn new languagesWhy AI needs to learn new languages Briefly

Why AI needs to learn new languages
Why AI needs to learn new languages
Briefly