Design Space Exploration of Semantic Segmentation CNN SalsaNext for Constrained Architectures

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

View graph of relations

Details

Original languageEnglish
Title of host publicationProceedings - 2024 IEEE 35th International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2024
Pages28-29
Number of pages2
ISBN (electronic)979-8-3503-4963-4
Publication statusPublished - 2024

Publication series

NameIEEE International Conference on Application-Specific Systems, Architectures, and Processors
ISSN (Print)2160-0511
ISSN (electronic)2160-052X

Abstract

The growing use of LiDAR systems and constrained computing resources in the automotive sector require efficient LiDAR processing. SalsaNext, a convolutional neural network for semantic segmentation, is a promising candidate for deployment in that area. To extend the research regarding its quantization and investigate its adaptability to constrained resources, a design space exploration is performed. The design space, defined by model size, topology, and compute precision, is evaluated on a Jetson AGX Orin regarding classification accuracy, latency, and energy efficiency. The results display a trade-off between classification accuracy and runtime. The smallest model evaluated in INT8 on the GPU provides the smallest latency of 14.48 ms with a mloU score of 43.2%. A mloU score of 47.7% at a latency of 26.92 ms can be achieved with the medium-sized model and modified topology evaluated in INT8 on the DLA. The medium-sized model with modified topology provides good classification accuracy evaluated in FP32 on the GPU with a mloU score of 55.2% in 67.85 ms.

Keywords

    CNN Optimization, CNN Quantization, Design Space Exploration, SalsaNext, Semantic Segmentation

ASJC Scopus subject areas

Sustainable Development Goals

Cite this

Design Space Exploration of Semantic Segmentation CNN SalsaNext for Constrained Architectures. / Renke, Oliver; Riggers, Christoph; Karrenbauer, Jens et al.
Proceedings - 2024 IEEE 35th International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2024. 2024. p. 28-29 (IEEE International Conference on Application-Specific Systems, Architectures, and Processors).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Renke, O, Riggers, C, Karrenbauer, J & Blume, H 2024, Design Space Exploration of Semantic Segmentation CNN SalsaNext for Constrained Architectures. in Proceedings - 2024 IEEE 35th International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2024. IEEE International Conference on Application-Specific Systems, Architectures, and Processors, pp. 28-29. https://doi.org/10.1109/asap61560.2024.00016
Renke, O., Riggers, C., Karrenbauer, J., & Blume, H. (2024). Design Space Exploration of Semantic Segmentation CNN SalsaNext for Constrained Architectures. In Proceedings - 2024 IEEE 35th International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2024 (pp. 28-29). (IEEE International Conference on Application-Specific Systems, Architectures, and Processors). https://doi.org/10.1109/asap61560.2024.00016
Renke O, Riggers C, Karrenbauer J, Blume H. Design Space Exploration of Semantic Segmentation CNN SalsaNext for Constrained Architectures. In Proceedings - 2024 IEEE 35th International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2024. 2024. p. 28-29. (IEEE International Conference on Application-Specific Systems, Architectures, and Processors). doi: 10.1109/asap61560.2024.00016
Renke, Oliver ; Riggers, Christoph ; Karrenbauer, Jens et al. / Design Space Exploration of Semantic Segmentation CNN SalsaNext for Constrained Architectures. Proceedings - 2024 IEEE 35th International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2024. 2024. pp. 28-29 (IEEE International Conference on Application-Specific Systems, Architectures, and Processors).
Download
@inproceedings{bd329bb3c7b643b3aa62454a4bec48ca,
title = "Design Space Exploration of Semantic Segmentation CNN SalsaNext for Constrained Architectures",
abstract = "The growing use of LiDAR systems and constrained computing resources in the automotive sector require efficient LiDAR processing. SalsaNext, a convolutional neural network for semantic segmentation, is a promising candidate for deployment in that area. To extend the research regarding its quantization and investigate its adaptability to constrained resources, a design space exploration is performed. The design space, defined by model size, topology, and compute precision, is evaluated on a Jetson AGX Orin regarding classification accuracy, latency, and energy efficiency. The results display a trade-off between classification accuracy and runtime. The smallest model evaluated in INT8 on the GPU provides the smallest latency of 14.48 ms with a mloU score of 43.2%. A mloU score of 47.7% at a latency of 26.92 ms can be achieved with the medium-sized model and modified topology evaluated in INT8 on the DLA. The medium-sized model with modified topology provides good classification accuracy evaluated in FP32 on the GPU with a mloU score of 55.2% in 67.85 ms.",
keywords = "CNN Optimization, CNN Quantization, Design Space Exploration, SalsaNext, Semantic Segmentation",
author = "Oliver Renke and Christoph Riggers and Jens Karrenbauer and Holger Blume",
note = "Publisher Copyright: {\textcopyright} 2024 IEEE.",
year = "2024",
doi = "10.1109/asap61560.2024.00016",
language = "English",
isbn = "979-8-3503-4964-1",
series = "IEEE International Conference on Application-Specific Systems, Architectures, and Processors",
pages = "28--29",
booktitle = "Proceedings - 2024 IEEE 35th International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2024",

}

Download

TY - GEN

T1 - Design Space Exploration of Semantic Segmentation CNN SalsaNext for Constrained Architectures

AU - Renke, Oliver

AU - Riggers, Christoph

AU - Karrenbauer, Jens

AU - Blume, Holger

N1 - Publisher Copyright: © 2024 IEEE.

PY - 2024

Y1 - 2024

N2 - The growing use of LiDAR systems and constrained computing resources in the automotive sector require efficient LiDAR processing. SalsaNext, a convolutional neural network for semantic segmentation, is a promising candidate for deployment in that area. To extend the research regarding its quantization and investigate its adaptability to constrained resources, a design space exploration is performed. The design space, defined by model size, topology, and compute precision, is evaluated on a Jetson AGX Orin regarding classification accuracy, latency, and energy efficiency. The results display a trade-off between classification accuracy and runtime. The smallest model evaluated in INT8 on the GPU provides the smallest latency of 14.48 ms with a mloU score of 43.2%. A mloU score of 47.7% at a latency of 26.92 ms can be achieved with the medium-sized model and modified topology evaluated in INT8 on the DLA. The medium-sized model with modified topology provides good classification accuracy evaluated in FP32 on the GPU with a mloU score of 55.2% in 67.85 ms.

AB - The growing use of LiDAR systems and constrained computing resources in the automotive sector require efficient LiDAR processing. SalsaNext, a convolutional neural network for semantic segmentation, is a promising candidate for deployment in that area. To extend the research regarding its quantization and investigate its adaptability to constrained resources, a design space exploration is performed. The design space, defined by model size, topology, and compute precision, is evaluated on a Jetson AGX Orin regarding classification accuracy, latency, and energy efficiency. The results display a trade-off between classification accuracy and runtime. The smallest model evaluated in INT8 on the GPU provides the smallest latency of 14.48 ms with a mloU score of 43.2%. A mloU score of 47.7% at a latency of 26.92 ms can be achieved with the medium-sized model and modified topology evaluated in INT8 on the DLA. The medium-sized model with modified topology provides good classification accuracy evaluated in FP32 on the GPU with a mloU score of 55.2% in 67.85 ms.

KW - CNN Optimization

KW - CNN Quantization

KW - Design Space Exploration

KW - SalsaNext

KW - Semantic Segmentation

UR - http://www.scopus.com/inward/record.url?scp=85203107748&partnerID=8YFLogxK

U2 - 10.1109/asap61560.2024.00016

DO - 10.1109/asap61560.2024.00016

M3 - Conference contribution

SN - 979-8-3503-4964-1

T3 - IEEE International Conference on Application-Specific Systems, Architectures, and Processors

SP - 28

EP - 29

BT - Proceedings - 2024 IEEE 35th International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2024

ER -

By the same author(s)