TY - JOUR
T1 - A2X
T2 - An end-to-end framework for assessing agent and environment interactions in multimodal human trajectory prediction
AU - Sohn, Samuel S.
AU - Lee, Mihee
AU - Moon, Seonghyeon
AU - Qiao, Gang
AU - Usman, Muhammad
AU - Yoon, Sejong
AU - Pavlovic, Vladimir
AU - Kapadia, Mubbasir
N1 - Funding Information:
The research was supported in part by NSF awards: IIS-1703883 , IIS-1955404 , IIS-1955365 , RETTL-2119265 , and EAGER-2122119 . The authors acknowledge use of the TCNJ ELSA HPC cluster, funded by NSF grant OAC-1828163, for conducting the research reported in this paper. This material is based upon work supported by the U.S. Department of Homeland Security under Grant Award Number 22STESE00001 01 01 . Disclaimer. The views and conclusions contained in this document are those of the authors and should not be interpreted as necessarily representing the official policies, either expressed or implied, of the U.S. Department of Homeland Security.
Publisher Copyright:
© 2022 Elsevier Ltd
PY - 2022/8
Y1 - 2022/8
N2 - In recent years, human trajectory prediction (HTP) has garnered attention in computer vision literature. Although this task has much in common with the longstanding task of crowd simulation, there is little from crowd simulation that has been borrowed, especially in terms of evaluation protocols. The key difference between the two tasks is that HTP is concerned with forecasting multiple steps at a time and capturing the multimodality of real human trajectories. A majority of HTP models are trained on the same few datasets, which feature small, transient interactions between real people and little to no interaction between people and the environment. Unsurprisingly, when tested on crowd egress scenarios, these models produce erroneous trajectories that accelerate too quickly and collide too frequently, but the metrics used in HTP literature cannot convey these particular issues. To address these challenges, we propose (1) the A2X dataset, which has simulated crowd egress and complex navigation scenarios that compensate for the lack of agent-to-environment interaction in existing real datasets, (2) evaluation metrics that convey model performance with more reliability and nuance, and (3) a guideline for future data acquisition in HTP. A subset of the proposed metrics are novel multiverse metrics, which are better suited for multimodal models than existing metrics. The dataset is available at: https://mubbasir.github.io/HTP-benchmark.
AB - In recent years, human trajectory prediction (HTP) has garnered attention in computer vision literature. Although this task has much in common with the longstanding task of crowd simulation, there is little from crowd simulation that has been borrowed, especially in terms of evaluation protocols. The key difference between the two tasks is that HTP is concerned with forecasting multiple steps at a time and capturing the multimodality of real human trajectories. A majority of HTP models are trained on the same few datasets, which feature small, transient interactions between real people and little to no interaction between people and the environment. Unsurprisingly, when tested on crowd egress scenarios, these models produce erroneous trajectories that accelerate too quickly and collide too frequently, but the metrics used in HTP literature cannot convey these particular issues. To address these challenges, we propose (1) the A2X dataset, which has simulated crowd egress and complex navigation scenarios that compensate for the lack of agent-to-environment interaction in existing real datasets, (2) evaluation metrics that convey model performance with more reliability and nuance, and (3) a guideline for future data acquisition in HTP. A subset of the proposed metrics are novel multiverse metrics, which are better suited for multimodal models than existing metrics. The dataset is available at: https://mubbasir.github.io/HTP-benchmark.
KW - Datasets
KW - Evaluation metrics
KW - Human trajectory prediction
UR - http://www.scopus.com/inward/record.url?scp=85132718271&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85132718271&partnerID=8YFLogxK
U2 - 10.1016/j.cag.2022.05.010
DO - 10.1016/j.cag.2022.05.010
M3 - Article
AN - SCOPUS:85132718271
SN - 0097-8493
VL - 106
SP - 130
EP - 140
JO - Computers and Graphics (Pergamon)
JF - Computers and Graphics (Pergamon)
ER -