Task-based multifrontal QR solver for GPU-accelerated multicore architectures - Algorithmes Parallèles et Optimisation Accéder directement au contenu
Rapport (Rapport De Recherche) Année : 2015

Task-based multifrontal QR solver for GPU-accelerated multicore architectures

Résumé

Recent studies have shown the potential of task-based programming paradigms for implementing robust, scalable sparse direct solvers for modern computing platforms. Yet, designing task flows that efficiently exploit heterogeneous architectures remains highly challenging. In this paper we first tackle the issue of data partitioning using a method suited for heterogeneous platforms. On the one hand, we design task of sufficiently large granularity to obtain a good acceleration factor on GPU. On the other hand, we limit that size in order to both fit the GPU memory constraints and generate enough parallelism in the task graph. Secondly we handle the task scheduling with a strategy capable of taking into account workload and architecture heterogeneity at a reduced cost. Finally we propose an original evaluation of the performance obtained in our solver on a test set of matrices. We show that the proposed approach allows for processing extremely large input problems on GPU-accelerated platforms and that the overall performance is competitive with equivalent state of the art solvers designed and optimized for GPU-only use.
Fichier principal
Vignette du fichier
IRI-RT--2015--02--FR-r1.pdf (407.39 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01166312 , version 1 (22-06-2015)
hal-01166312 , version 2 (22-09-2015)

Identifiants

  • HAL Id : hal-01166312 , version 2

Citer

Emmanuel Agullo, Alfredo Buttari, Abdou Guermouche, Florent Lopez. Task-based multifrontal QR solver for GPU-accelerated multicore architectures. [Research Report] IRI/RT--2015--02--FR-r1, IRIT, Toulouse. 2015. ⟨hal-01166312v2⟩
347 Consultations
446 Téléchargements

Partager

Gmail Facebook X LinkedIn More