Loading [MathJax]/extensions/tex2jax.js

A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

  • Sven Gesper
  • Daniel Köhler
  • Gia Bao Thieu
  • Jasper Homann
  • Holger Blume
  • Guillermo Payá-Vayá

Research Organisations

External Research Organisations

  • Technische Universität Braunschweig
  • Robert Bosch GmbH

Details

Original languageEnglish
Title of host publicationEmbedded Computer Systems
Subtitle of host publicationArchitectures, Modeling, and Simulation - 24th International Conference, SAMOS 2024, Proceedings
EditorsLuigi Carro, Francesco Regazzoni, Christian Pilato
PublisherSpringer Science and Business Media Deutschland GmbH
Pages167-182
Number of pages16
ISBN (electronic)978-3-031-78377-7
ISBN (print)9783031783760
Publication statusPublished - 28 Jan 2025
Event24th International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation, SAMOS 2024 - Samos, Greece
Duration: 29 Jun 20244 Jul 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume15226 LNCS
ISSN (Print)0302-9743
ISSN (electronic)1611-3349

Abstract

Efficient processing architectures for irregular data patterns require vector element addressing with flexible indices. Therefore, state-of-the-art SIMD vector extensions implement gather and scatter instructions for indexed addressing of data in memory. In vertical vector processors, different data is processed sequentially in parallel lanes and can be exchanged via chaining. This paper proposes an extension of such chaining mechanisms in a vertical vector processor architecture (V2PRO) to flexibly chain not only data but also address offsets between vector lanes. The indirect addressing enables vector access patterns with irregular strides for both register file and memory. The extension has a low hardware overhead of +4.8 % lookup tables and +1.8% registers on a Xilinx Ultrascale+ FPGA. A runtime evaluation for two applications from computer vision, namely Deformable Convolutions and point cloud encoding with PointPillars, demonstrates speedups of at least an order of magnitude with the proposed extension.

Keywords

    Computer Vision, Indirect Addressing Mode, Radar Object Detection, Vector Processor Architecture

ASJC Scopus subject areas

Cite this

A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor. / Gesper, Sven; Köhler, Daniel; Thieu, Gia Bao et al.
Embedded Computer Systems: Architectures, Modeling, and Simulation - 24th International Conference, SAMOS 2024, Proceedings. ed. / Luigi Carro; Francesco Regazzoni; Christian Pilato. Springer Science and Business Media Deutschland GmbH, 2025. p. 167-182 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 15226 LNCS).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Gesper, S, Köhler, D, Thieu, GB, Homann, J, Meinl, F, Blume, H & Payá-Vayá, G 2025, A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor. in L Carro, F Regazzoni & C Pilato (eds), Embedded Computer Systems: Architectures, Modeling, and Simulation - 24th International Conference, SAMOS 2024, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 15226 LNCS, Springer Science and Business Media Deutschland GmbH, pp. 167-182, 24th International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation, SAMOS 2024, Samos, Greece, 29 Jun 2024. https://doi.org/10.1007/978-3-031-78377-7_12
Gesper, S., Köhler, D., Thieu, G. B., Homann, J., Meinl, F., Blume, H., & Payá-Vayá, G. (2025). A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor. In L. Carro, F. Regazzoni, & C. Pilato (Eds.), Embedded Computer Systems: Architectures, Modeling, and Simulation - 24th International Conference, SAMOS 2024, Proceedings (pp. 167-182). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 15226 LNCS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-78377-7_12
Gesper S, Köhler D, Thieu GB, Homann J, Meinl F, Blume H et al. A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor. In Carro L, Regazzoni F, Pilato C, editors, Embedded Computer Systems: Architectures, Modeling, and Simulation - 24th International Conference, SAMOS 2024, Proceedings. Springer Science and Business Media Deutschland GmbH. 2025. p. 167-182. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-031-78377-7_12
Gesper, Sven ; Köhler, Daniel ; Thieu, Gia Bao et al. / A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor. Embedded Computer Systems: Architectures, Modeling, and Simulation - 24th International Conference, SAMOS 2024, Proceedings. editor / Luigi Carro ; Francesco Regazzoni ; Christian Pilato. Springer Science and Business Media Deutschland GmbH, 2025. pp. 167-182 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
Download
@inproceedings{3ab21c68795d443992fdf1b01243e9fd,
title = "A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor",
abstract = "Efficient processing architectures for irregular data patterns require vector element addressing with flexible indices. Therefore, state-of-the-art SIMD vector extensions implement gather and scatter instructions for indexed addressing of data in memory. In vertical vector processors, different data is processed sequentially in parallel lanes and can be exchanged via chaining. This paper proposes an extension of such chaining mechanisms in a vertical vector processor architecture (V2PRO) to flexibly chain not only data but also address offsets between vector lanes. The indirect addressing enables vector access patterns with irregular strides for both register file and memory. The extension has a low hardware overhead of +4.8 % lookup tables and +1.8% registers on a Xilinx Ultrascale+ FPGA. A runtime evaluation for two applications from computer vision, namely Deformable Convolutions and point cloud encoding with PointPillars, demonstrates speedups of at least an order of magnitude with the proposed extension.",
keywords = "Computer Vision, Indirect Addressing Mode, Radar Object Detection, Vector Processor Architecture",
author = "Sven Gesper and Daniel K{\"o}hler and Thieu, {Gia Bao} and Jasper Homann and Frank Meinl and Holger Blume and Guillermo Pay{\'a}-Vay{\'a}",
note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.; 24th International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation, SAMOS 2024 ; Conference date: 29-06-2024 Through 04-07-2024",
year = "2025",
month = jan,
day = "28",
doi = "10.1007/978-3-031-78377-7_12",
language = "English",
isbn = "9783031783760",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Science and Business Media Deutschland GmbH",
pages = "167--182",
editor = "Luigi Carro and Francesco Regazzoni and Christian Pilato",
booktitle = "Embedded Computer Systems",
address = "Germany",

}

Download

TY - GEN

T1 - A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor

AU - Gesper, Sven

AU - Köhler, Daniel

AU - Thieu, Gia Bao

AU - Homann, Jasper

AU - Meinl, Frank

AU - Blume, Holger

AU - Payá-Vayá, Guillermo

N1 - Publisher Copyright: © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

PY - 2025/1/28

Y1 - 2025/1/28

N2 - Efficient processing architectures for irregular data patterns require vector element addressing with flexible indices. Therefore, state-of-the-art SIMD vector extensions implement gather and scatter instructions for indexed addressing of data in memory. In vertical vector processors, different data is processed sequentially in parallel lanes and can be exchanged via chaining. This paper proposes an extension of such chaining mechanisms in a vertical vector processor architecture (V2PRO) to flexibly chain not only data but also address offsets between vector lanes. The indirect addressing enables vector access patterns with irregular strides for both register file and memory. The extension has a low hardware overhead of +4.8 % lookup tables and +1.8% registers on a Xilinx Ultrascale+ FPGA. A runtime evaluation for two applications from computer vision, namely Deformable Convolutions and point cloud encoding with PointPillars, demonstrates speedups of at least an order of magnitude with the proposed extension.

AB - Efficient processing architectures for irregular data patterns require vector element addressing with flexible indices. Therefore, state-of-the-art SIMD vector extensions implement gather and scatter instructions for indexed addressing of data in memory. In vertical vector processors, different data is processed sequentially in parallel lanes and can be exchanged via chaining. This paper proposes an extension of such chaining mechanisms in a vertical vector processor architecture (V2PRO) to flexibly chain not only data but also address offsets between vector lanes. The indirect addressing enables vector access patterns with irregular strides for both register file and memory. The extension has a low hardware overhead of +4.8 % lookup tables and +1.8% registers on a Xilinx Ultrascale+ FPGA. A runtime evaluation for two applications from computer vision, namely Deformable Convolutions and point cloud encoding with PointPillars, demonstrates speedups of at least an order of magnitude with the proposed extension.

KW - Computer Vision

KW - Indirect Addressing Mode

KW - Radar Object Detection

KW - Vector Processor Architecture

UR - http://www.scopus.com/inward/record.url?scp=85218467582&partnerID=8YFLogxK

U2 - 10.1007/978-3-031-78377-7_12

DO - 10.1007/978-3-031-78377-7_12

M3 - Conference contribution

AN - SCOPUS:85218467582

SN - 9783031783760

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 167

EP - 182

BT - Embedded Computer Systems

A2 - Carro, Luigi

A2 - Regazzoni, Francesco

A2 - Pilato, Christian

PB - Springer Science and Business Media Deutschland GmbH

T2 - 24th International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation, SAMOS 2024

Y2 - 29 June 2024 through 4 July 2024

ER -

By the same author(s)