Figure 2 (IMAGE)
Caption
Interactive digital instructor system: end-to-end pipeline. MLLM: multimodal large language model, Q&A: question-and-answer.
Credit
Qi Liu 1, Yunhao Sha 1, Kai Zhang 2 , Zhenya Huang 2, Linbo Zhu 2, 3, Junyu Lu 1, 3, Yu Su 3, 4
Usage Restrictions
none
License
Original content