GATraj: A graph- and attention-based multi-agent trajectory prediction model

Research output: Contribution to journalArticleResearchpeer review

Authors

  • Hao Cheng
  • Mengmeng Liu
  • Lin Chen
  • Hellward Broszio
  • Monika Sester
  • Michael Ying Yang

External Research Organisations

  • International Institute for Geo-Information Science and Earth Observation - ITC
  • VISCODA GmbH
View graph of relations

Details

Original languageEnglish
Pages (from-to)163-175
Number of pages13
JournalISPRS Journal of Photogrammetry and Remote Sensing
Volume205
Early online date12 Oct 2023
Publication statusPublished - Nov 2023

Abstract

Trajectory prediction has been a long-standing problem in intelligent systems like autonomous driving and robot navigation. Models trained on large-scale benchmarks have made significant progress in improving prediction accuracy. However, the importance on efficiency for real-time applications has been less emphasized. This paper proposes an attention-based graph model, named GATraj, which achieves a good balance of prediction accuracy and inference speed. We use attention mechanisms to model the spatial–temporal dynamics of agents, such as pedestrians or vehicles, and a graph convolutional network to model their interactions. Additionally, a Laplacian mixture decoder is implemented to mitigate mode collapse and generate diverse multimodal predictions for each agent. GATraj achieves state-of-the-art prediction performance at a much higher speed when tested on the ETH/UCY datasets for pedestrian trajectories, and good performance at about 100 Hz inference speed when tested on the nuScenes dataset for autonomous driving. We conduct extensive experiments to analyze the probability estimation of the Laplacian mixture decoder and compare it with a Gaussian mixture decoder for predicting different multimodalities. Furthermore, comprehensive ablation studies demonstrate the effectiveness of each proposed module in GATraj.

Keywords

    Autonomous driving, Graph model, Mixture density network, Pedestrian, Trajectory prediction

ASJC Scopus subject areas

Cite this

GATraj: A graph- and attention-based multi-agent trajectory prediction model. / Cheng, Hao; Liu, Mengmeng; Chen, Lin et al.
In: ISPRS Journal of Photogrammetry and Remote Sensing, Vol. 205, 11.2023, p. 163-175.

Research output: Contribution to journalArticleResearchpeer review

Cheng H, Liu M, Chen L, Broszio H, Sester M, Yang MY. GATraj: A graph- and attention-based multi-agent trajectory prediction model. ISPRS Journal of Photogrammetry and Remote Sensing. 2023 Nov;205:163-175. Epub 2023 Oct 12. doi: 10.48550/arXiv.2209.07857, 10.1016/j.isprsjprs.2023.10.001
Cheng, Hao ; Liu, Mengmeng ; Chen, Lin et al. / GATraj : A graph- and attention-based multi-agent trajectory prediction model. In: ISPRS Journal of Photogrammetry and Remote Sensing. 2023 ; Vol. 205. pp. 163-175.
Download
@article{f2b657db57434a57b2c4720e61f7564a,
title = "GATraj: A graph- and attention-based multi-agent trajectory prediction model",
abstract = "Trajectory prediction has been a long-standing problem in intelligent systems like autonomous driving and robot navigation. Models trained on large-scale benchmarks have made significant progress in improving prediction accuracy. However, the importance on efficiency for real-time applications has been less emphasized. This paper proposes an attention-based graph model, named GATraj, which achieves a good balance of prediction accuracy and inference speed. We use attention mechanisms to model the spatial–temporal dynamics of agents, such as pedestrians or vehicles, and a graph convolutional network to model their interactions. Additionally, a Laplacian mixture decoder is implemented to mitigate mode collapse and generate diverse multimodal predictions for each agent. GATraj achieves state-of-the-art prediction performance at a much higher speed when tested on the ETH/UCY datasets for pedestrian trajectories, and good performance at about 100 Hz inference speed when tested on the nuScenes dataset for autonomous driving. We conduct extensive experiments to analyze the probability estimation of the Laplacian mixture decoder and compare it with a Gaussian mixture decoder for predicting different multimodalities. Furthermore, comprehensive ablation studies demonstrate the effectiveness of each proposed module in GATraj.",
keywords = "Autonomous driving, Graph model, Mixture density network, Pedestrian, Trajectory prediction",
author = "Hao Cheng and Mengmeng Liu and Lin Chen and Hellward Broszio and Monika Sester and Yang, {Michael Ying}",
note = "Funding Information: This work is partially supported by the MSCA European Postdoctoral Fellowships under the 101062870 – VeVuSafety project and partially performed in the framework of project KaBa (Kamerabasierte Bewegungsanalyse aller Verkehrsteilnehmer f{\"u}r automatisiertes Fahren) supported by the European Regional Development Fund at VISCODA company.",
year = "2023",
month = nov,
doi = "10.48550/arXiv.2209.07857",
language = "English",
volume = "205",
pages = "163--175",
journal = "ISPRS Journal of Photogrammetry and Remote Sensing",
issn = "0924-2716",
publisher = "Elsevier",

}

Download

TY - JOUR

T1 - GATraj

T2 - A graph- and attention-based multi-agent trajectory prediction model

AU - Cheng, Hao

AU - Liu, Mengmeng

AU - Chen, Lin

AU - Broszio, Hellward

AU - Sester, Monika

AU - Yang, Michael Ying

N1 - Funding Information: This work is partially supported by the MSCA European Postdoctoral Fellowships under the 101062870 – VeVuSafety project and partially performed in the framework of project KaBa (Kamerabasierte Bewegungsanalyse aller Verkehrsteilnehmer für automatisiertes Fahren) supported by the European Regional Development Fund at VISCODA company.

PY - 2023/11

Y1 - 2023/11

N2 - Trajectory prediction has been a long-standing problem in intelligent systems like autonomous driving and robot navigation. Models trained on large-scale benchmarks have made significant progress in improving prediction accuracy. However, the importance on efficiency for real-time applications has been less emphasized. This paper proposes an attention-based graph model, named GATraj, which achieves a good balance of prediction accuracy and inference speed. We use attention mechanisms to model the spatial–temporal dynamics of agents, such as pedestrians or vehicles, and a graph convolutional network to model their interactions. Additionally, a Laplacian mixture decoder is implemented to mitigate mode collapse and generate diverse multimodal predictions for each agent. GATraj achieves state-of-the-art prediction performance at a much higher speed when tested on the ETH/UCY datasets for pedestrian trajectories, and good performance at about 100 Hz inference speed when tested on the nuScenes dataset for autonomous driving. We conduct extensive experiments to analyze the probability estimation of the Laplacian mixture decoder and compare it with a Gaussian mixture decoder for predicting different multimodalities. Furthermore, comprehensive ablation studies demonstrate the effectiveness of each proposed module in GATraj.

AB - Trajectory prediction has been a long-standing problem in intelligent systems like autonomous driving and robot navigation. Models trained on large-scale benchmarks have made significant progress in improving prediction accuracy. However, the importance on efficiency for real-time applications has been less emphasized. This paper proposes an attention-based graph model, named GATraj, which achieves a good balance of prediction accuracy and inference speed. We use attention mechanisms to model the spatial–temporal dynamics of agents, such as pedestrians or vehicles, and a graph convolutional network to model their interactions. Additionally, a Laplacian mixture decoder is implemented to mitigate mode collapse and generate diverse multimodal predictions for each agent. GATraj achieves state-of-the-art prediction performance at a much higher speed when tested on the ETH/UCY datasets for pedestrian trajectories, and good performance at about 100 Hz inference speed when tested on the nuScenes dataset for autonomous driving. We conduct extensive experiments to analyze the probability estimation of the Laplacian mixture decoder and compare it with a Gaussian mixture decoder for predicting different multimodalities. Furthermore, comprehensive ablation studies demonstrate the effectiveness of each proposed module in GATraj.

KW - Autonomous driving

KW - Graph model

KW - Mixture density network

KW - Pedestrian

KW - Trajectory prediction

UR - http://www.scopus.com/inward/record.url?scp=85174064136&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2209.07857

DO - 10.48550/arXiv.2209.07857

M3 - Article

VL - 205

SP - 163

EP - 175

JO - ISPRS Journal of Photogrammetry and Remote Sensing

JF - ISPRS Journal of Photogrammetry and Remote Sensing

SN - 0924-2716

ER -

By the same author(s)