Details
Originalsprache | Englisch |
---|---|
Titel des Sammelwerks | Embedded Computer Systems |
Untertitel | Architectures, Modeling, and Simulation - 24th International Conference, SAMOS 2024, Proceedings |
Herausgeber/-innen | Luigi Carro, Francesco Regazzoni, Christian Pilato |
Herausgeber (Verlag) | Springer Science and Business Media Deutschland GmbH |
Seiten | 167-182 |
Seitenumfang | 16 |
ISBN (elektronisch) | 978-3-031-78377-7 |
ISBN (Print) | 9783031783760 |
Publikationsstatus | Veröffentlicht - 28 Jan. 2025 |
Veranstaltung | 24th International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation, SAMOS 2024 - Samos, Griechenland Dauer: 29 Juni 2024 → 4 Juli 2024 |
Publikationsreihe
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Band | 15226 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (elektronisch) | 1611-3349 |
Abstract
Efficient processing architectures for irregular data patterns require vector element addressing with flexible indices. Therefore, state-of-the-art SIMD vector extensions implement gather and scatter instructions for indexed addressing of data in memory. In vertical vector processors, different data is processed sequentially in parallel lanes and can be exchanged via chaining. This paper proposes an extension of such chaining mechanisms in a vertical vector processor architecture (V2PRO) to flexibly chain not only data but also address offsets between vector lanes. The indirect addressing enables vector access patterns with irregular strides for both register file and memory. The extension has a low hardware overhead of +4.8 % lookup tables and +1.8% registers on a Xilinx Ultrascale+ FPGA. A runtime evaluation for two applications from computer vision, namely Deformable Convolutions and point cloud encoding with PointPillars, demonstrates speedups of at least an order of magnitude with the proposed extension.
ASJC Scopus Sachgebiete
- Mathematik (insg.)
- Theoretische Informatik
- Informatik (insg.)
- Allgemeine Computerwissenschaft
Zitieren
- Standard
- Harvard
- Apa
- Vancouver
- BibTex
- RIS
Embedded Computer Systems: Architectures, Modeling, and Simulation - 24th International Conference, SAMOS 2024, Proceedings. Hrsg. / Luigi Carro; Francesco Regazzoni; Christian Pilato. Springer Science and Business Media Deutschland GmbH, 2025. S. 167-182 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Band 15226 LNCS).
Publikation: Beitrag in Buch/Bericht/Sammelwerk/Konferenzband › Aufsatz in Konferenzband › Forschung › Peer-Review
}
TY - GEN
T1 - A Novel Chaining-Based Indirect Addressing Mode in a Vertical Vector Processor
AU - Gesper, Sven
AU - Köhler, Daniel
AU - Thieu, Gia Bao
AU - Homann, Jasper
AU - Meinl, Frank
AU - Blume, Holger
AU - Payá-Vayá, Guillermo
N1 - Publisher Copyright: © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.
PY - 2025/1/28
Y1 - 2025/1/28
N2 - Efficient processing architectures for irregular data patterns require vector element addressing with flexible indices. Therefore, state-of-the-art SIMD vector extensions implement gather and scatter instructions for indexed addressing of data in memory. In vertical vector processors, different data is processed sequentially in parallel lanes and can be exchanged via chaining. This paper proposes an extension of such chaining mechanisms in a vertical vector processor architecture (V2PRO) to flexibly chain not only data but also address offsets between vector lanes. The indirect addressing enables vector access patterns with irregular strides for both register file and memory. The extension has a low hardware overhead of +4.8 % lookup tables and +1.8% registers on a Xilinx Ultrascale+ FPGA. A runtime evaluation for two applications from computer vision, namely Deformable Convolutions and point cloud encoding with PointPillars, demonstrates speedups of at least an order of magnitude with the proposed extension.
AB - Efficient processing architectures for irregular data patterns require vector element addressing with flexible indices. Therefore, state-of-the-art SIMD vector extensions implement gather and scatter instructions for indexed addressing of data in memory. In vertical vector processors, different data is processed sequentially in parallel lanes and can be exchanged via chaining. This paper proposes an extension of such chaining mechanisms in a vertical vector processor architecture (V2PRO) to flexibly chain not only data but also address offsets between vector lanes. The indirect addressing enables vector access patterns with irregular strides for both register file and memory. The extension has a low hardware overhead of +4.8 % lookup tables and +1.8% registers on a Xilinx Ultrascale+ FPGA. A runtime evaluation for two applications from computer vision, namely Deformable Convolutions and point cloud encoding with PointPillars, demonstrates speedups of at least an order of magnitude with the proposed extension.
KW - Computer Vision
KW - Indirect Addressing Mode
KW - Radar Object Detection
KW - Vector Processor Architecture
UR - http://www.scopus.com/inward/record.url?scp=85218467582&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-78377-7_12
DO - 10.1007/978-3-031-78377-7_12
M3 - Conference contribution
AN - SCOPUS:85218467582
SN - 9783031783760
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 167
EP - 182
BT - Embedded Computer Systems
A2 - Carro, Luigi
A2 - Regazzoni, Francesco
A2 - Pilato, Christian
PB - Springer Science and Business Media Deutschland GmbH
T2 - 24th International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation, SAMOS 2024
Y2 - 29 June 2024 through 4 July 2024
ER -