TY - GEN
T1 - Human-Intention Prediction with Visual-Language Model
AU - Liang, Yongshi
AU - Zheng, Pai
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024/8
Y1 - 2024/8
N2 - Human-intention prediction is an important part of human-machine interaction wildly utilized in industrial intelligent systems. In recent years, large language models have expanded to the image task with outstanding performance, leading to an increasing attraction to the application of multimodal Large Language Models. However, the exploration of visual-language models in human-intention prediction is still limited. To address this gap, this paper investigates the effectiveness of visual-language models in predicting human intentions and successfully transfers the knowledge in LLMs to downstream classification tasks. Finally, this paper takes traffic scenarios as an example to validate the feasibility of the video-LLaMA model in predicting pedestrian behavior intentions.
AB - Human-intention prediction is an important part of human-machine interaction wildly utilized in industrial intelligent systems. In recent years, large language models have expanded to the image task with outstanding performance, leading to an increasing attraction to the application of multimodal Large Language Models. However, the exploration of visual-language models in human-intention prediction is still limited. To address this gap, this paper investigates the effectiveness of visual-language models in predicting human intentions and successfully transfers the knowledge in LLMs to downstream classification tasks. Finally, this paper takes traffic scenarios as an example to validate the feasibility of the video-LLaMA model in predicting pedestrian behavior intentions.
KW - Human-intention prediction
KW - Human-machine interaction
KW - Pedestrian intention
KW - Visual-language model
UR - http://www.scopus.com/inward/record.url?scp=105001922060&partnerID=8YFLogxK
U2 - 10.1109/ICaMaL62577.2024.10919824
DO - 10.1109/ICaMaL62577.2024.10919824
M3 - Conference article published in proceeding or book
AN - SCOPUS:105001922060
SN - 9798350378665
T3 - 2024 International Conference on Automation in Manufacturing, Transportation and Logistics, ICaMaL 2024
BT - 2024 International Conference on Automation in Manufacturing, Transportation and Logistics, ICaMaL 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2024 International Conference on Automation in Manufacturing, Transportation and Logistics, ICaMaL 2024
Y2 - 7 August 2024 through 9 August 2024
ER -