News Release

KAIST develops robot learning technology capable of precisely imitating even “rough” demonstrations

Professor Daehyung Park’s research team in the School of Computing develops DiSPo, a robot AI model that can flexibly improve task precision from only coarse (low-frequency) demonstrations

Peer-Reviewed Publication

The Korea Advanced Institute of Science and Technology (KAIST)

Robots with increasingly precise dexterity are becoming essential in everyday life and industrial settings, from assembling tiny smartphone components to assisting doctors in surgery. However, teaching robots delicate human movements has traditionally required collecting vast amounts of data at extremely fine time intervals, resulting in significant costs and time burdens. KAIST researchers have developed a robot artificial intelligence technology that can perform sophisticated tasks by autonomously adjusting precision according to the situation, even when trained only on coarsely (sparsely) sampled demonstrations.

KAIST, led by President Kwang Hyung Lee, announced on the 24th that a research team led by Professor Daehyung Park of the School of Computing has developed DiSPo, a multi-granularity manipulation model that generates fine-grained robot motions tailored to a user’s desired level of precision, even from rough human demonstrations.

Existing robot learning methods, such as Behavior Transformer and Diffusion Policy, are limited by their dependence on the time intervals of the data used during training. As a result, learning precision manipulation tasks such as screw fastening or component insertion has required collecting large volumes of high-frequency data at very short time intervals. This has significantly increased data collection costs and slowed down the inference speed of robot AI models.

To overcome these limitations, the research team combined Mamba, a state-space model capable of predicting time intervals, with a diffusion model that enables rich action representation. The team also introduced a new Step-scale factor mechanism, which allows users to directly control the time intervals used by the robot.

As a result, even when trained on only low-frequency (coarse) demonstration data, the robot can generate high-precision motions during inference without additional training by autonomously subdividing actions through a discretization process.

DiSPo achieved up to an 81% higher task success rate compared to state-of-the-art models in simulation environments. In real-world experiments using a collaborative robot, DiSPo stably performed challenging tasks such as passing a clamp through a narrow gap with only a 2.5 mm radial clearance and accurately pressing a small shutter button on a smartphone. This performance was up to four times higher than that of existing AI models.

The technology is expected to make a significant contribution to automation in a wide range of everyday and industrial service fields that require high precision, including precision component assembly, cable connection, medical surgery, and precision machining.

“This study demonstrates that robots can learn precise motions from coarse demonstrations and autonomously adjust their level of precision according to the task situation,” said Professor Daehyung Park. “Moving forward, this technology is expected to dramatically reduce data collection costs while serving as a general-purpose robot learning technology for various industrial fields, including precision assembly and medical applications.”

The study was led by Nayoung Oh, a master’s student at the KAIST Graduate School of AI, as the first author, and was presented on June 1 at the 2026 IEEE International Conference on Robotics and Automation, or ICRA 2026, one of the world’s most prestigious robotics conferences, held in Vienna, Austria.

Paper Title: DiSPo: Diffusion-SSM based Policy Learning for Coarse-to-Fine Action Discretization
DOI: https://doi.org/10.48550/arXiv.2409.14719

Authors: Nayoung Oh, KAIST Graduate School of AI, first author; Jaehyeong Jang, KAIST School of Computing, co-author; Moonkyeong Jung, KAIST Robotics Program, co-author; and Daehyung Park, KAIST School of Computing, corresponding author.

This research was conducted as part of the “Core Software Technology Development for Complex-Intelligence Autonomous Agents” project supported by the Ministry of Science and ICT and the Institute of Information & Communications Technology Planning & Evaluation, or IITP, under the project “Development of Task Procedure Generation Technology for Autonomous Execution of Complex Tasks by Autonomous Agents.” The study also received support from IITP’s Global AI Frontier Lab program and the Technology Innovation Program of the Ministry of Trade, Industry and Energy.


Disclaimer: AAAS and EurekAlert! are not responsible for the accuracy of news releases posted to EurekAlert! by contributing institutions or for the use of any information through the EurekAlert system.