Performance Optimization and Modeling of Blocked Sparse Kernels - Algorithmes Parallèles et Optimisation Accéder directement au contenu
Article Dans Une Revue International Journal of High Performance Computing Applications Année : 2016

Performance Optimization and Modeling of Blocked Sparse Kernels

Résumé

We present a method for automatically selecting optimal implementations of sparse matrix-vector operations. Our software “AcCELS” (Accelerated Compress-storage Elements for Linear Solvers) involves a setup phase that probes machine characteristics, and a run-time phase where stored characteristics are combined with a measure of the actual sparse matrix to find the optimal kernel implementation. We present a performance model that is shown to be accurate over a large range of matrices.

Dates et versions

hal-02421133 , version 1 (20-12-2019)

Identifiants

Citer

Alfredo Buttari, Victor Eijkhout, Julien Langou, Salvatore Filippone. Performance Optimization and Modeling of Blocked Sparse Kernels. International Journal of High Performance Computing Applications, 2016, 21 (4), pp.467-484. ⟨10.1177/1094342007083801⟩. ⟨hal-02421133⟩
41 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More