A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor

Sven Gesper; Daniel Köhler; Gia Bao Thieu; Jasper Homann; Frank Meinl; Holger Blume; Guillermo Payá-Vayá

doi:10.1007/978-3-031-78377-7_12

Details

Originalsprache	Englisch
Titel des Sammelwerks	Embedded Computer Systems
Untertitel	Architectures, Modeling, and Simulation - 24th International Conference, SAMOS 2024, Proceedings
Herausgeber/-innen	Luigi Carro, Francesco Regazzoni, Christian Pilato
Herausgeber (Verlag)	Springer Science and Business Media Deutschland GmbH
Seiten	167-182
Seitenumfang	16
ISBN (elektronisch)	978-3-031-78377-7
ISBN (Print)	9783031783760
Publikationsstatus	Veröffentlicht - 28 Jan. 2025
Veranstaltung	24th International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation, SAMOS 2024 - Samos, Griechenland Dauer: 29 Juni 2024 → 4 Juli 2024

Publikationsreihe

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Band	15226 LNCS
ISSN (Print)	0302-9743
ISSN (elektronisch)	1611-3349

Abstract

Efficient processing architectures for irregular data patterns require vector element addressing with flexible indices. Therefore, state-of-the-art SIMD vector extensions implement gather and scatter instructions for indexed addressing of data in memory. In vertical vector processors, different data is processed sequentially in parallel lanes and can be exchanged via chaining. This paper proposes an extension of such chaining mechanisms in a vertical vector processor architecture (V2PRO) to flexibly chain not only data but also address offsets between vector lanes. The indirect addressing enables vector access patterns with irregular strides for both register file and memory. The extension has a low hardware overhead of +4.8 % lookup tables and +1.8% registers on a Xilinx Ultrascale+ FPGA. A runtime evaluation for two applications from computer vision, namely Deformable Convolutions and point cloud encoding with PointPillars, demonstrates speedups of at least an order of magnitude with the proposed extension.

ASJC Scopus Sachgebiete

Mathematik (insg.)
Theoretische Informatik
Informatik (insg.)
Allgemeine Computerwissenschaft

Zitieren

A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor. / Gesper, Sven; Köhler, Daniel; Thieu, Gia Bao et al.
Embedded Computer Systems: Architectures, Modeling, and Simulation - 24th International Conference, SAMOS 2024, Proceedings. Hrsg. / Luigi Carro; Francesco Regazzoni; Christian Pilato. Springer Science and Business Media Deutschland GmbH, 2025. S. 167-182 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Band 15226 LNCS).

Publikation: Beitrag in Buch/Bericht/Sammelwerk/Konferenzband › Aufsatz in Konferenzband › Forschung › Peer-Review

Gesper, S, Köhler, D, Thieu, GB, Homann, J, Meinl, F, Blume, H & Payá-Vayá, G 2025, A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor. in L Carro, F Regazzoni & C Pilato (Hrsg.), Embedded Computer Systems: Architectures, Modeling, and Simulation - 24th International Conference, SAMOS 2024, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Bd. 15226 LNCS, Springer Science and Business Media Deutschland GmbH, S. 167-182, 24th International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation, SAMOS 2024, Samos, Griechenland, 29 Juni 2024. https://doi.org/10.1007/978-3-031-78377-7_12

Gesper, S., Köhler, D., Thieu, G. B., Homann, J., Meinl, F., Blume, H., & Payá-Vayá, G. (2025). A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor. In L. Carro, F. Regazzoni, & C. Pilato (Hrsg.), Embedded Computer Systems: Architectures, Modeling, and Simulation - 24th International Conference, SAMOS 2024, Proceedings (S. 167-182). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Band 15226 LNCS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-78377-7_12

Gesper S, Köhler D, Thieu GB, Homann J, Meinl F, Blume H et al. A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor. in Carro L, Regazzoni F, Pilato C, Hrsg., Embedded Computer Systems: Architectures, Modeling, and Simulation - 24th International Conference, SAMOS 2024, Proceedings. Springer Science and Business Media Deutschland GmbH. 2025. S. 167-182. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-031-78377-7_12

Gesper, Sven ; Köhler, Daniel ; Thieu, Gia Bao et al. / A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor. Embedded Computer Systems: Architectures, Modeling, and Simulation - 24th International Conference, SAMOS 2024, Proceedings. Hrsg. / Luigi Carro ; Francesco Regazzoni ; Christian Pilato. Springer Science and Business Media Deutschland GmbH, 2025. S. 167-182 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

Download

@inproceedings{3ab21c68795d443992fdf1b01243e9fd,

title = "A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor",

abstract = "Efficient processing architectures for irregular data patterns require vector element addressing with flexible indices. Therefore, state-of-the-art SIMD vector extensions implement gather and scatter instructions for indexed addressing of data in memory. In vertical vector processors, different data is processed sequentially in parallel lanes and can be exchanged via chaining. This paper proposes an extension of such chaining mechanisms in a vertical vector processor architecture (V2PRO) to flexibly chain not only data but also address offsets between vector lanes. The indirect addressing enables vector access patterns with irregular strides for both register file and memory. The extension has a low hardware overhead of +4.8 % lookup tables and +1.8% registers on a Xilinx Ultrascale+ FPGA. A runtime evaluation for two applications from computer vision, namely Deformable Convolutions and point cloud encoding with PointPillars, demonstrates speedups of at least an order of magnitude with the proposed extension.",

keywords = "Computer Vision, Indirect Addressing Mode, Radar Object Detection, Vector Processor Architecture",

author = "Sven Gesper and Daniel K{\"o}hler and Thieu, {Gia Bao} and Jasper Homann and Frank Meinl and Holger Blume and Guillermo Pay{\'a}-Vay{\'a}",

note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.; 24th International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation, SAMOS 2024 ; Conference date: 29-06-2024 Through 04-07-2024",

year = "2025",

month = jan,

day = "28",

doi = "10.1007/978-3-031-78377-7_12",

language = "English",

isbn = "9783031783760",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "167--182",

editor = "Luigi Carro and Francesco Regazzoni and Christian Pilato",

booktitle = "Embedded Computer Systems",

address = "Germany",

}

Download

TY - GEN

T1 - A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor

AU - Gesper, Sven

AU - Köhler, Daniel

AU - Thieu, Gia Bao

AU - Homann, Jasper

AU - Meinl, Frank

AU - Blume, Holger

AU - Payá-Vayá, Guillermo

N1 - Publisher Copyright: © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

PY - 2025/1/28

Y1 - 2025/1/28

N2 - Efficient processing architectures for irregular data patterns require vector element addressing with flexible indices. Therefore, state-of-the-art SIMD vector extensions implement gather and scatter instructions for indexed addressing of data in memory. In vertical vector processors, different data is processed sequentially in parallel lanes and can be exchanged via chaining. This paper proposes an extension of such chaining mechanisms in a vertical vector processor architecture (V2PRO) to flexibly chain not only data but also address offsets between vector lanes. The indirect addressing enables vector access patterns with irregular strides for both register file and memory. The extension has a low hardware overhead of +4.8 % lookup tables and +1.8% registers on a Xilinx Ultrascale+ FPGA. A runtime evaluation for two applications from computer vision, namely Deformable Convolutions and point cloud encoding with PointPillars, demonstrates speedups of at least an order of magnitude with the proposed extension.

AB - Efficient processing architectures for irregular data patterns require vector element addressing with flexible indices. Therefore, state-of-the-art SIMD vector extensions implement gather and scatter instructions for indexed addressing of data in memory. In vertical vector processors, different data is processed sequentially in parallel lanes and can be exchanged via chaining. This paper proposes an extension of such chaining mechanisms in a vertical vector processor architecture (V2PRO) to flexibly chain not only data but also address offsets between vector lanes. The indirect addressing enables vector access patterns with irregular strides for both register file and memory. The extension has a low hardware overhead of +4.8 % lookup tables and +1.8% registers on a Xilinx Ultrascale+ FPGA. A runtime evaluation for two applications from computer vision, namely Deformable Convolutions and point cloud encoding with PointPillars, demonstrates speedups of at least an order of magnitude with the proposed extension.

KW - Computer Vision

KW - Indirect Addressing Mode

KW - Radar Object Detection

KW - Vector Processor Architecture

UR - http://www.scopus.com/inward/record.url?scp=85218467582&partnerID=8YFLogxK

U2 - 10.1007/978-3-031-78377-7_12

DO - 10.1007/978-3-031-78377-7_12

M3 - Conference contribution

AN - SCOPUS:85218467582

SN - 9783031783760

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 167

EP - 182

BT - Embedded Computer Systems

A2 - Carro, Luigi

A2 - Regazzoni, Francesco

A2 - Pilato, Christian

PB - Springer Science and Business Media Deutschland GmbH

T2 - 24th International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation, SAMOS 2024

Y2 - 29 June 2024 through 4 July 2024

ER -

Research@Leibniz University

A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor

Autorschaft

Organisationseinheiten

Externe Organisationen

Details

Publikationsreihe

Abstract

ASJC Scopus Sachgebiete

Zitieren

Von denselben Autoren

Towards real-time LiDAR processing on RISC-V-based ASIPs: fast trigonometric approximations via parabolic synthesis

Hardware und Software vereint: Innovation durch Co-Design

RRNS Arith Lib – An Open-Source Redundant Residue Number System Arithmetic VHDL Library

Fiber deviation and optimized toolpath strategies in melt electrowriting of tubular scaffolds

SmartHeaP- A High-level Programmable and Customized Hearing Aid System on Chip Integrated in a Research Hearing Aid Prototype

Towards real-time LiDAR processing on RISC-V-based ASIPs: fast trigonometric approximations via parabolic synthesis

Hardware und Software vereint: Innovation durch Co-Design

RRNS Arith Lib – An Open-Source Redundant Residue Number System Arithmetic VHDL Library

Fiber deviation and optimized toolpath strategies in melt electrowriting of tubular scaffolds

SmartHeaP- A High-level Programmable and Customized Hearing Aid System on Chip Integrated in a Research Hearing Aid Prototype

Towards real-time LiDAR processing on RISC-V-based ASIPs: fast trigonometric approximations via parabolic synthesis