TY - GEN
T1 - A2X
T2 - 14th ACM SIGGRAPH Conference on Motion, Interaction, and Games, MIG 2021
AU - Sohn, Samuel S.
AU - Lee, Mihee
AU - Moon, Seonghyeon
AU - Qiao, Gang
AU - Usman, Muhammad
AU - Yoon, Sejong
AU - Pavlovic, Vladimir
AU - Kapadia, Mubbasir
N1 - Funding Information:
The research was supported in part by NSF awards: IIS-1703883, IIS-1955404, IIS-1955365, RETTL-2119265, and EAGER-2122119. The authors acknowledge use of the TCNJ ELSA HPC cluster, funded by NSF grant OAC-1828163, for conducting the research reported in this paper.
Publisher Copyright:
© 2021 ACM.
PY - 2021/11/10
Y1 - 2021/11/10
N2 - In recent years, human trajectory prediction (HTP) has garnered attention in computer vision literature. Although this task has much in common with the longstanding task of crowd simulation, there is little from crowd simulation that has been borrowed, especially in terms of evaluation protocols. The key difference between the two tasks is that HTP is concerned with forecasting multiple steps at a time and capturing the multimodality of real human trajectories. A majority of HTP models are trained on the same few datasets, which feature small, transient interactions between real people and little to no interaction between people and the environment. Unsurprisingly, when tested on crowd egress scenarios, these models produce erroneous trajectories that accelerate too quickly and collide too frequently, but the metrics used in HTP literature cannot convey these particular issues. To address these challenges, we propose (1) the A2X dataset, which has simulated crowd egress and complex navigation scenarios that compensate for the lack of agent-to-environment interaction in existing real datasets, and (2) evaluation metrics that convey model performance with more reliability and nuance. A subset of these metrics are novel multiverse metrics, which are better-suited for multimodal models than existing metrics. The dataset is available at: https://mubbasir.github.io/HTP-benchmark/.
AB - In recent years, human trajectory prediction (HTP) has garnered attention in computer vision literature. Although this task has much in common with the longstanding task of crowd simulation, there is little from crowd simulation that has been borrowed, especially in terms of evaluation protocols. The key difference between the two tasks is that HTP is concerned with forecasting multiple steps at a time and capturing the multimodality of real human trajectories. A majority of HTP models are trained on the same few datasets, which feature small, transient interactions between real people and little to no interaction between people and the environment. Unsurprisingly, when tested on crowd egress scenarios, these models produce erroneous trajectories that accelerate too quickly and collide too frequently, but the metrics used in HTP literature cannot convey these particular issues. To address these challenges, we propose (1) the A2X dataset, which has simulated crowd egress and complex navigation scenarios that compensate for the lack of agent-to-environment interaction in existing real datasets, and (2) evaluation metrics that convey model performance with more reliability and nuance. A subset of these metrics are novel multiverse metrics, which are better-suited for multimodal models than existing metrics. The dataset is available at: https://mubbasir.github.io/HTP-benchmark/.
KW - datasets
KW - evaluation metrics
KW - human trajectory prediction
UR - http://www.scopus.com/inward/record.url?scp=85119505488&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85119505488&partnerID=8YFLogxK
U2 - 10.1145/3487983.3488302
DO - 10.1145/3487983.3488302
M3 - Conference contribution
AN - SCOPUS:85119505488
T3 - Proceedings - MIG 2021: 14th ACM SIGGRAPH Conference on Motion, Interaction, and Games
BT - Proceedings - MIG 2021
A2 - Spencer, Stephen N.
PB - Association for Computing Machinery, Inc
Y2 - 10 November 2021 through 12 November 2021
ER -