A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor

Sven Gesper; Daniel Köhler; Gia Bao Thieu; Jasper Homann; Frank Meinl; Holger Blume; Guillermo Payá-Vayá

doi:10.1007/978-3-031-78377-7_12

Details

Original language	English
Title of host publication	Embedded Computer Systems
Subtitle of host publication	Architectures, Modeling, and Simulation - 24th International Conference, SAMOS 2024, Proceedings
Editors	Luigi Carro, Francesco Regazzoni, Christian Pilato
Publisher	Springer Science and Business Media Deutschland GmbH
Pages	167-182
Number of pages	16
ISBN (electronic)	978-3-031-78377-7
ISBN (print)	9783031783760
Publication status	Published - 28 Jan 2025
Event	24th International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation, SAMOS 2024 - Samos, Greece Duration: 29 Jun 2024 → 4 Jul 2024

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	15226 LNCS
ISSN (Print)	0302-9743
ISSN (electronic)	1611-3349

Abstract

Efficient processing architectures for irregular data patterns require vector element addressing with flexible indices. Therefore, state-of-the-art SIMD vector extensions implement gather and scatter instructions for indexed addressing of data in memory. In vertical vector processors, different data is processed sequentially in parallel lanes and can be exchanged via chaining. This paper proposes an extension of such chaining mechanisms in a vertical vector processor architecture (V2PRO) to flexibly chain not only data but also address offsets between vector lanes. The indirect addressing enables vector access patterns with irregular strides for both register file and memory. The extension has a low hardware overhead of +4.8 % lookup tables and +1.8% registers on a Xilinx Ultrascale+ FPGA. A runtime evaluation for two applications from computer vision, namely Deformable Convolutions and point cloud encoding with PointPillars, demonstrates speedups of at least an order of magnitude with the proposed extension.

Keywords

Computer Vision, Indirect Addressing Mode, Radar Object Detection, Vector Processor Architecture

ASJC Scopus subject areas

Mathematics(all)
Theoretical Computer Science
Computer Science(all)
General Computer Science

Cite this

A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor. / Gesper, Sven; Köhler, Daniel; Thieu, Gia Bao et al.
Embedded Computer Systems: Architectures, Modeling, and Simulation - 24th International Conference, SAMOS 2024, Proceedings. ed. / Luigi Carro; Francesco Regazzoni; Christian Pilato. Springer Science and Business Media Deutschland GmbH, 2025. p. 167-182 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 15226 LNCS).

Research output: Chapter in book/report/conference proceeding › Conference contribution › Research › peer review

Gesper, S, Köhler, D, Thieu, GB, Homann, J, Meinl, F, Blume, H & Payá-Vayá, G 2025, A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor. in L Carro, F Regazzoni & C Pilato (eds), Embedded Computer Systems: Architectures, Modeling, and Simulation - 24th International Conference, SAMOS 2024, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 15226 LNCS, Springer Science and Business Media Deutschland GmbH, pp. 167-182, 24th International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation, SAMOS 2024, Samos, Greece, 29 Jun 2024. https://doi.org/10.1007/978-3-031-78377-7_12

Gesper, S., Köhler, D., Thieu, G. B., Homann, J., Meinl, F., Blume, H., & Payá-Vayá, G. (2025). A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor. In L. Carro, F. Regazzoni, & C. Pilato (Eds.), Embedded Computer Systems: Architectures, Modeling, and Simulation - 24th International Conference, SAMOS 2024, Proceedings (pp. 167-182). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 15226 LNCS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-78377-7_12

Gesper S, Köhler D, Thieu GB, Homann J, Meinl F, Blume H et al. A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor. In Carro L, Regazzoni F, Pilato C, editors, Embedded Computer Systems: Architectures, Modeling, and Simulation - 24th International Conference, SAMOS 2024, Proceedings. Springer Science and Business Media Deutschland GmbH. 2025. p. 167-182. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-031-78377-7_12

Gesper, Sven ; Köhler, Daniel ; Thieu, Gia Bao et al. / A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor. Embedded Computer Systems: Architectures, Modeling, and Simulation - 24th International Conference, SAMOS 2024, Proceedings. editor / Luigi Carro ; Francesco Regazzoni ; Christian Pilato. Springer Science and Business Media Deutschland GmbH, 2025. pp. 167-182 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

Download

@inproceedings{3ab21c68795d443992fdf1b01243e9fd,

title = "A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor",

abstract = "Efficient processing architectures for irregular data patterns require vector element addressing with flexible indices. Therefore, state-of-the-art SIMD vector extensions implement gather and scatter instructions for indexed addressing of data in memory. In vertical vector processors, different data is processed sequentially in parallel lanes and can be exchanged via chaining. This paper proposes an extension of such chaining mechanisms in a vertical vector processor architecture (V2PRO) to flexibly chain not only data but also address offsets between vector lanes. The indirect addressing enables vector access patterns with irregular strides for both register file and memory. The extension has a low hardware overhead of +4.8 % lookup tables and +1.8% registers on a Xilinx Ultrascale+ FPGA. A runtime evaluation for two applications from computer vision, namely Deformable Convolutions and point cloud encoding with PointPillars, demonstrates speedups of at least an order of magnitude with the proposed extension.",

keywords = "Computer Vision, Indirect Addressing Mode, Radar Object Detection, Vector Processor Architecture",

author = "Sven Gesper and Daniel K{\"o}hler and Thieu, {Gia Bao} and Jasper Homann and Frank Meinl and Holger Blume and Guillermo Pay{\'a}-Vay{\'a}",

note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.; 24th International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation, SAMOS 2024 ; Conference date: 29-06-2024 Through 04-07-2024",

year = "2025",

month = jan,

day = "28",

doi = "10.1007/978-3-031-78377-7_12",

language = "English",

isbn = "9783031783760",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "167--182",

editor = "Luigi Carro and Francesco Regazzoni and Christian Pilato",

booktitle = "Embedded Computer Systems",

address = "Germany",

}

Download

TY - GEN

T1 - A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor

AU - Gesper, Sven

AU - Köhler, Daniel

AU - Thieu, Gia Bao

AU - Homann, Jasper

AU - Meinl, Frank

AU - Blume, Holger

AU - Payá-Vayá, Guillermo

N1 - Publisher Copyright: © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

PY - 2025/1/28

Y1 - 2025/1/28

N2 - Efficient processing architectures for irregular data patterns require vector element addressing with flexible indices. Therefore, state-of-the-art SIMD vector extensions implement gather and scatter instructions for indexed addressing of data in memory. In vertical vector processors, different data is processed sequentially in parallel lanes and can be exchanged via chaining. This paper proposes an extension of such chaining mechanisms in a vertical vector processor architecture (V2PRO) to flexibly chain not only data but also address offsets between vector lanes. The indirect addressing enables vector access patterns with irregular strides for both register file and memory. The extension has a low hardware overhead of +4.8 % lookup tables and +1.8% registers on a Xilinx Ultrascale+ FPGA. A runtime evaluation for two applications from computer vision, namely Deformable Convolutions and point cloud encoding with PointPillars, demonstrates speedups of at least an order of magnitude with the proposed extension.

AB - Efficient processing architectures for irregular data patterns require vector element addressing with flexible indices. Therefore, state-of-the-art SIMD vector extensions implement gather and scatter instructions for indexed addressing of data in memory. In vertical vector processors, different data is processed sequentially in parallel lanes and can be exchanged via chaining. This paper proposes an extension of such chaining mechanisms in a vertical vector processor architecture (V2PRO) to flexibly chain not only data but also address offsets between vector lanes. The indirect addressing enables vector access patterns with irregular strides for both register file and memory. The extension has a low hardware overhead of +4.8 % lookup tables and +1.8% registers on a Xilinx Ultrascale+ FPGA. A runtime evaluation for two applications from computer vision, namely Deformable Convolutions and point cloud encoding with PointPillars, demonstrates speedups of at least an order of magnitude with the proposed extension.

KW - Computer Vision

KW - Indirect Addressing Mode

KW - Radar Object Detection

KW - Vector Processor Architecture

UR - http://www.scopus.com/inward/record.url?scp=85218467582&partnerID=8YFLogxK

U2 - 10.1007/978-3-031-78377-7_12

DO - 10.1007/978-3-031-78377-7_12

M3 - Conference contribution

AN - SCOPUS:85218467582

SN - 9783031783760

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 167

EP - 182

BT - Embedded Computer Systems

A2 - Carro, Luigi

A2 - Regazzoni, Francesco

A2 - Pilato, Christian

PB - Springer Science and Business Media Deutschland GmbH

T2 - 24th International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation, SAMOS 2024

Y2 - 29 June 2024 through 4 July 2024

ER -

Research@Leibniz University

A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor

Authors

Research Organisations

External Research Organisations

Details

Publication series

Abstract

Keywords

ASJC Scopus subject areas

Cite this

By the same author(s)

Towards real-time LiDAR processing on RISC-V-based ASIPs: fast trigonometric approximations via parabolic synthesis

Hardware und Software vereint: Innovation durch Co-Design

RRNS Arith Lib – An Open-Source Redundant Residue Number System Arithmetic VHDL Library

Fiber deviation and optimized toolpath strategies in melt electrowriting of tubular scaffolds

SmartHeaP- A High-level Programmable and Customized Hearing Aid System on Chip Integrated in a Research Hearing Aid Prototype

Towards real-time LiDAR processing on RISC-V-based ASIPs: fast trigonometric approximations via parabolic synthesis

Hardware und Software vereint: Innovation durch Co-Design

RRNS Arith Lib – An Open-Source Redundant Residue Number System Arithmetic VHDL Library

Fiber deviation and optimized toolpath strategies in melt electrowriting of tubular scaffolds

SmartHeaP- A High-level Programmable and Customized Hearing Aid System on Chip Integrated in a Research Hearing Aid Prototype

Towards real-time LiDAR processing on RISC-V-based ASIPs: fast trigonometric approximations via parabolic synthesis