Traffic Control Recognition with an Attention Mechanism Using Speed-Profile and Satellite Imagery data

Hao Cheng; Haoran Lei; Stefania Zourlidou; Monika Sester

doi:10.5194/isprs-archives-XLIII-B4-2022-287-2022

Details

Originalsprache	Englisch
Seiten (von - bis)	287-293
Seitenumfang	7
Fachzeitschrift	International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives
Jahrgang	43
Ausgabenummer	B4-2022
Publikationsstatus	Veröffentlicht - 1 Juni 2022
Veranstaltung	2022 24th ISPRS Congress on Imaging Today, Foreseeing Tomorrow, Commission IV - Nice, Frankreich Dauer: 6 Juni 2022 → 11 Juni 2022

Abstract

Traffic regulators at intersections act as an essential factor that influences traffic flow and, subsequently, the route choices of commuters. A digital map that provides up-to-date traffic control information is beneficial not only for facilitating the commuters' trips, but also for energy-saving and environmental protection. In this paper, instead of using expensive surveying methods, we propose an automatic way based on a Conditional Variational Autoencoder (CVAE) to recognize traffic regulators, i. e., arm rules at intersections, by leveraging the GPS data collected from vehicles and the satellite imagery retrieved from digital maps, i. e., Google Maps. We apply a Long Short-Term Memory to extract the motion dynamics over a GPS sequence traversed through the intersection. Simultaneously, we build a Convolutional Neural Network (CNN) to extract the grid-based local imagery information associated with each step of the GPS positions. Moreover, a self-attention mechanism is adopted to extract the spatial and temporal features over both the GPS and grid sequences. The extracted temporal and spatial features are then combined for detecting the traffic arm rules. To analyze the performance of our method, we tested it on a GPS dataset collected by driving vehicles in Hannover, a medium-sized German city. Compared to a Random Forest model and an Encoder-Decoder model, our proposed model achieved better results with both accuracy and F1-score of 0.90 for the three-class (arm rules of uncontrolled, traffic light, and priority sign) task. We also carried out ablation studies to further investigate the effectiveness of the GPS input branch, the image input branch, and the self-attention mechanism in our model.

ASJC Scopus Sachgebiete

Informatik (insg.)
Information systems
Sozialwissenschaften (insg.)
Geografie, Planung und Entwicklung

Zitieren

Traffic Control Recognition with an Attention Mechanism Using Speed-Profile and Satellite Imagery data. / Cheng, Hao; Lei, Haoran; Zourlidou, Stefania et al.
in: International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives, Jahrgang 43, Nr. B4-2022, 01.06.2022, S. 287-293.

Publikation: Beitrag in Fachzeitschrift › Konferenzaufsatz in Fachzeitschrift › Forschung › Peer-Review

Cheng, H, Lei, H, Zourlidou, S & Sester, M 2022, 'Traffic Control Recognition with an Attention Mechanism Using Speed-Profile and Satellite Imagery data', International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives, Jg. 43, Nr. B4-2022, S. 287-293. https://doi.org/10.5194/isprs-archives-XLIII-B4-2022-287-2022

Cheng, H., Lei, H., Zourlidou, S., & Sester, M. (2022). Traffic Control Recognition with an Attention Mechanism Using Speed-Profile and Satellite Imagery data. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives, 43(B4-2022), 287-293. https://doi.org/10.5194/isprs-archives-XLIII-B4-2022-287-2022

Cheng H, Lei H, Zourlidou S, Sester M. Traffic Control Recognition with an Attention Mechanism Using Speed-Profile and Satellite Imagery data. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives. 2022 Jun 1;43(B4-2022):287-293. doi: 10.5194/isprs-archives-XLIII-B4-2022-287-2022

Cheng, Hao ; Lei, Haoran ; Zourlidou, Stefania et al. / Traffic Control Recognition with an Attention Mechanism Using Speed-Profile and Satellite Imagery data. in: International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives. 2022 ; Jahrgang 43, Nr. B4-2022. S. 287-293.

Download

@article{7f30637fe8974a6fb77e27daf8c0fd6f,

title = "Traffic Control Recognition with an Attention Mechanism Using Speed-Profile and Satellite Imagery data",

abstract = "Traffic regulators at intersections act as an essential factor that influences traffic flow and, subsequently, the route choices of commuters. A digital map that provides up-to-date traffic control information is beneficial not only for facilitating the commuters' trips, but also for energy-saving and environmental protection. In this paper, instead of using expensive surveying methods, we propose an automatic way based on a Conditional Variational Autoencoder (CVAE) to recognize traffic regulators, i. e., arm rules at intersections, by leveraging the GPS data collected from vehicles and the satellite imagery retrieved from digital maps, i. e., Google Maps. We apply a Long Short-Term Memory to extract the motion dynamics over a GPS sequence traversed through the intersection. Simultaneously, we build a Convolutional Neural Network (CNN) to extract the grid-based local imagery information associated with each step of the GPS positions. Moreover, a self-attention mechanism is adopted to extract the spatial and temporal features over both the GPS and grid sequences. The extracted temporal and spatial features are then combined for detecting the traffic arm rules. To analyze the performance of our method, we tested it on a GPS dataset collected by driving vehicles in Hannover, a medium-sized German city. Compared to a Random Forest model and an Encoder-Decoder model, our proposed model achieved better results with both accuracy and F1-score of 0.90 for the three-class (arm rules of uncontrolled, traffic light, and priority sign) task. We also carried out ablation studies to further investigate the effectiveness of the GPS input branch, the image input branch, and the self-attention mechanism in our model. ",

keywords = "Attention Mechanism, Classification, Deep Learning, Generative Model, Traffic Regulation",

author = "Hao Cheng and Haoran Lei and Stefania Zourlidou and Monika Sester",

note = "Funding Information: This work is supported by the German Research Foundation (DFG) via the project GRK2159 i.c.sens.; 2022 24th ISPRS Congress on Imaging Today, Foreseeing Tomorrow, Commission IV ; Conference date: 06-06-2022 Through 11-06-2022",

year = "2022",

month = jun,

day = "1",

doi = "10.5194/isprs-archives-XLIII-B4-2022-287-2022",

language = "English",

volume = "43",

pages = "287--293",

number = "B4-2022",

}

Download

TY - JOUR

T1 - Traffic Control Recognition with an Attention Mechanism Using Speed-Profile and Satellite Imagery data

AU - Cheng, Hao

AU - Lei, Haoran

AU - Zourlidou, Stefania

AU - Sester, Monika

N1 - Funding Information: This work is supported by the German Research Foundation (DFG) via the project GRK2159 i.c.sens.

PY - 2022/6/1

Y1 - 2022/6/1

N2 - Traffic regulators at intersections act as an essential factor that influences traffic flow and, subsequently, the route choices of commuters. A digital map that provides up-to-date traffic control information is beneficial not only for facilitating the commuters' trips, but also for energy-saving and environmental protection. In this paper, instead of using expensive surveying methods, we propose an automatic way based on a Conditional Variational Autoencoder (CVAE) to recognize traffic regulators, i. e., arm rules at intersections, by leveraging the GPS data collected from vehicles and the satellite imagery retrieved from digital maps, i. e., Google Maps. We apply a Long Short-Term Memory to extract the motion dynamics over a GPS sequence traversed through the intersection. Simultaneously, we build a Convolutional Neural Network (CNN) to extract the grid-based local imagery information associated with each step of the GPS positions. Moreover, a self-attention mechanism is adopted to extract the spatial and temporal features over both the GPS and grid sequences. The extracted temporal and spatial features are then combined for detecting the traffic arm rules. To analyze the performance of our method, we tested it on a GPS dataset collected by driving vehicles in Hannover, a medium-sized German city. Compared to a Random Forest model and an Encoder-Decoder model, our proposed model achieved better results with both accuracy and F1-score of 0.90 for the three-class (arm rules of uncontrolled, traffic light, and priority sign) task. We also carried out ablation studies to further investigate the effectiveness of the GPS input branch, the image input branch, and the self-attention mechanism in our model.

AB - Traffic regulators at intersections act as an essential factor that influences traffic flow and, subsequently, the route choices of commuters. A digital map that provides up-to-date traffic control information is beneficial not only for facilitating the commuters' trips, but also for energy-saving and environmental protection. In this paper, instead of using expensive surveying methods, we propose an automatic way based on a Conditional Variational Autoencoder (CVAE) to recognize traffic regulators, i. e., arm rules at intersections, by leveraging the GPS data collected from vehicles and the satellite imagery retrieved from digital maps, i. e., Google Maps. We apply a Long Short-Term Memory to extract the motion dynamics over a GPS sequence traversed through the intersection. Simultaneously, we build a Convolutional Neural Network (CNN) to extract the grid-based local imagery information associated with each step of the GPS positions. Moreover, a self-attention mechanism is adopted to extract the spatial and temporal features over both the GPS and grid sequences. The extracted temporal and spatial features are then combined for detecting the traffic arm rules. To analyze the performance of our method, we tested it on a GPS dataset collected by driving vehicles in Hannover, a medium-sized German city. Compared to a Random Forest model and an Encoder-Decoder model, our proposed model achieved better results with both accuracy and F1-score of 0.90 for the three-class (arm rules of uncontrolled, traffic light, and priority sign) task. We also carried out ablation studies to further investigate the effectiveness of the GPS input branch, the image input branch, and the self-attention mechanism in our model.

KW - Attention Mechanism

KW - Classification

KW - Deep Learning

KW - Generative Model

KW - Traffic Regulation

UR - http://www.scopus.com/inward/record.url?scp=85132154897&partnerID=8YFLogxK

U2 - 10.5194/isprs-archives-XLIII-B4-2022-287-2022

DO - 10.5194/isprs-archives-XLIII-B4-2022-287-2022

M3 - Conference article

AN - SCOPUS:85132154897

VL - 43

SP - 287

EP - 293

JO - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives

JF - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives

SN - 1682-1750

IS - B4-2022

T2 - 2022 24th ISPRS Congress on Imaging Today, Foreseeing Tomorrow, Commission IV

Y2 - 6 June 2022 through 11 June 2022

ER -

Research@Leibniz University

Traffic Control Recognition with an Attention Mechanism Using Speed-Profile and Satellite Imagery data

Autorschaft

Organisationseinheiten

Details

Abstract

ASJC Scopus Sachgebiete

Zitieren

Von denselben Autoren

Integrated Multi-Stereo Camera System for Robust Indoor Localization with Temporal Fusion

Investigating Effects of Future Path Visualisation on Path Choices During Collision Encounters

3D Uncertain Implicit Surface Mapping Using GMM and GP

Visualising Collision Spot Uncertainty with Augmented Reality

StreamLTS: Query-Based Temporal-Spatial LiDAR Fusion for Cooperative Object Detection

Integrated Multi-Stereo Camera System for Robust Indoor Localization with Temporal Fusion

Investigating Effects of Future Path Visualisation on Path Choices During Collision Encounters

3D Uncertain Implicit Surface Mapping Using GMM and GP

Visualising Collision Spot Uncertainty with Augmented Reality

StreamLTS: Query-Based Temporal-Spatial LiDAR Fusion for Cooperative Object Detection

Integrated Multi-Stereo Camera System for Robust Indoor Localization with Temporal Fusion