Understanding how a robot will react under different conditions is essential to guaranteeing its safe operation. But how do you know what will break a robot without actually damaging it? A new method developed by scientists at the Institute of Science and Technology Austria (IST Austria) and the Max Planck Institute for Intelligent Systems (MPI for Intelligent Systems) is the first machine learning method that can use observations made under safe conditions to make accurate predictions for all possible conditions governed by the same physical dynamics. Especially designed for real-life situations, their method provides simple, interpretable descriptions of the underlying physics. The researchers will present their findings tomorrow at this year's prestigious International Conference for Machine Learning (ICML).
In the past, machine learning was only capable of interpolating data--making predictions about situations that are "between" other, known situations. It was incapable of extrapolating--making predictions about situations outside of the known--because it learns to fit the known data as closely as possible locally, regardless of how it performs outside of these situations. In addition, collecting sufficient data for effective interpolation is both time- and resource-intensive, and requires data from extreme or dangerous situations. But now, Georg Martius, former ISTFELLOW and IST Austria postdoc, and since 2017 a group leader at MPI for Intelligent Systems in Tübingen, Subham S. Sahoo, a PhD student also at MPI for Intelligent Systems, and Christoph Lampert, professor at IST Austria, developed a new machine learning method that addresses these problems, and is the first machine learning method to accurately extrapolate to unseen situations.
The key feature of the new method is that it strives to reveal the true dynamics of the situation: it takes in data and returns the equations that describe the underlying physics. "If you know those equations," says Georg Martius, "then you can say what will happen in all situations, even if you haven't seen them." In other words, this is what allows the method to extrapolate reliably, making it unique among machine learning methods.
The team's method sets itself apart in several other ways as well. First, the final approximations previously produced during machine learning were far too complex for a human to understand or work with. In the new method, the resulting equations are far simpler: "Our method's equations are something you would see in a textbook--simple and intuitive," says Christoph Lampert. The latter is another key difference: other machine learning methods give no insight into the relationship between conditions and results--and thus, no intuition on whether the model is even plausible. "In every other area of research, we expect models that make physical sense, that tell us why," adds Lampert. "This is what we should expect from machine learning, and what our method provides." Finally, in order to guarantee interpretability and optimize for physical situations, the team based their learning method on a different type of framework. This new design is simpler than previous methods, which in practice means that less data is needed to give the same or even better results.
And it's not all theory: "In my group, we're actually working on developing a robot that uses this type of learning. In the future, the robot would experiment with different motions, then be able to use machine learning to uncover the equations that govern its body and movement, allowing it to avoid dangerous actions or situations," adds Martius. While robots are one active area of research, the method can be used with any type of data, from biological systems to X-ray transition energies, and can also be incorporated into larger machine learning networks.
S. S. Sahoo, C. H. Lampert, and G. Martius. Learning equations for extrapolation and control. In Proc. 35th International Conference on Machine Learning, ICML 2018, Stockholm, Sweden, 2018. PMLR, 2018. to appear
Arxiv Preprint: https:/
Link to conference: https:/
This project has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Sk?odowska-Curie grant agreement No 291734.
The research received funding from the ISTFELLOW program, a Marie Sk?odowska-Curie COFUND grant co-funded by IST Austria and the European Union through the Horizon 2020 research and innovation programme. This program has since been succeeded by another COFUND grant, the ISTplus program, which is open for applications from qualified postdocs all over the world.
About IST Austria
The Institute of Science and Technology (IST Austria) is a PhD-granting research institution located in Klosterneuburg, 18 km from the center of Vienna, Austria. Inaugurated in 2009, the Institute is dedicated to basic research in the natural and mathematical sciences. IST Austria employs professors on a tenure-track system, postdoctoral fellows, and doctoral students. While dedicated to the principle of curiosity-driven research, the Institute owns the rights to all scientific discoveries and is committed to promote their use. The first president of IST Austria is Thomas A. Henzinger, a leading computer scientist and former professor at the University of California in Berkeley, USA, and the EPFL in Lausanne, Switzerland. The graduate school of IST Austria offers fully-funded PhD positions to highly qualified candidates with a bachelor's or master's degree in biology, neuroscience, mathematics, computer science, physics, and related areas. http://www.
At the Max Planck Institute for Intelligent Systems we aim to understand the principles of Perception, Action and Learning in Intelligent Systems. The Max-Planck-Institute for Intelligent Systems is located in two cities: Stuttgart and Tübingen. Research at the Stuttgart site of the Max Planck Institute for Intelligent Systems covers small-scale robotics, self-organization, haptic perception, bio-inspired systems, medical robotics, and physical intelligence. The Tübingen site of the institute concentrates on machine learning, computer vision, robotics, control, and the theory of intelligence.