News Release

AI that can learn patterns of human language

The model automatically learns higher-level language patterns that can apply to different languages, enabling it to achieve better results

Peer-Reviewed Publication

McGill University

Researchers have developed an artificial intelligence system that can learn the rules and patterns of human languages on its own.

image: Researchers have developed an artificial intelligence system that can learn the rules and patterns of human languages on its own. view more 

Credit: Jose-Luis Olivares, MIT

Human languages are notoriously complex, and linguists have long thought it would be impossible to teach a machine how to analyze speech sounds and word structures in the way humans do. But researchers from McGill University, MIT, and Cornell University have taken a step in this direction. They have developed an artificial intelligence (AI) system that can learn the rules and patterns of human languages on its own.

The model automatically learns higher-level language patterns that can apply to different languages, enabling it to achieve better results. When given words and examples of how those words change to express different grammatical functions in one language – like tense, case, or gender – this machine-learning model comes up with rules that explain why the forms of those words change. For instance, it might learn that the letter “a” must be added to the end of a word to make the masculine form feminine in Serbo-Croatian.

This system could be used to put language theories to the test and investigate subtle similarities in the way diverse languages transform words, say the researchers. “We wanted to see if we can emulate the kinds of knowledge and reasoning that humans bring to the task,” says co-author Adam Albright, a professor of linguistics at MIT.

“The exciting thing about this work is that it shows how we can build algorithms that are able to generalize from really tiny samples of language data, more like human scientists and children,” says senior author Timothy O’Donnell, assistant professor in the Department of Linguistics at McGill University, and Canada CIFAR AI Chair at Mila - Quebec Artificial Intelligence Institute.

With files from MIT News Office

About this study

Synthesizing theories of human language with Bayesian program induction” by Kevin Ellis et al. was published in Nature Communications.


Disclaimer: AAAS and EurekAlert! are not responsible for the accuracy of news releases posted to EurekAlert! by contributing institutions or for the use of any information through the EurekAlert system.