NeuraChip: Accelerating GNN Computations with a Hash-based Decoupled Spatial Accelerator

Shivdikar, Kaustubh; Agostini, Nicolas Bohm; Jayaweera, Malith; Jonatan, Gilbert; Abellán Miguel, José Luis; Joshi, Ajay; Kim, John; Kaeli, David

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10201/141179

RefMan EndNote BibTex RefWorks Excel CSV PDF Mendeley

Registro completo de metadatos

Campo DC	Valor	Lengua/Idioma
dc.contributor.author	Shivdikar, Kaustubh	-
dc.contributor.author	Agostini, Nicolas Bohm	-
dc.contributor.author	Jayaweera, Malith	-
dc.contributor.author	Jonatan, Gilbert	-
dc.contributor.author	Abellán Miguel, José Luis	-
dc.contributor.author	Joshi, Ajay	-
dc.contributor.author	Kim, John	-
dc.contributor.author	Kaeli, David	-
dc.date.accessioned	2024-04-26T11:28:09Z	-
dc.date.available	2024-04-26T11:28:09Z	-
dc.date.issued	2024-04-23	-
dc.identifier.uri	http://hdl.handle.net/10201/141179	-
dc.description	©2024 ISCA. This manuscript version is made available under the CC-BY 4.0 license http://creativecommons.org/licenses/by/4.0/ This document is the Pre-print version published in arXiv. It will apear as a lecture in of ISCA 2024.	es
dc.description.abstract	Graph Neural Networks (GNNs) are emerging as a formidable tool for processing non-euclidean data across various domains, ranging from social network analysis to bioinformatics. Despite their effectiveness, their adoption has not been pervasive because of scalability challenges associated with large-scale graph datasets, particularly when leveraging message passing. They exhibit irregular sparsity patterns, resulting in unbalanced compute resource utilization. Prior accelerators investigating Gustavson’s technique adopted look-ahead buffers for prefetching data, aiming to prevent compute stalls. However, these solutions lead to inefficient use of the on-chip memory, leading to redundant data residing in cache. To tackle these challenges, we introduce NeuraChip, a novel GNN spatial accelerator based on Gustavson’s algorithm. NeuraChip decouples the multiplication and addition computations in sparse matrix multiplication. This separation allows for independent exploitation of their unique data dependencies, facilitating efficient resource allocation. We introduce a rolling eviction strategy to mitigate data idling in on-chip memory as well as address the prevalent issue of memory bloat in sparse graph computations. Furthermore, the compute resource load balancing is achieved through a dynamic reseeding hash-based mapping, ensuring uniform utilization of computing resources agnostic of sparsity patterns. Finally, we present NeuraSim, an open-source, cycle-accurate, multi-threaded, modular simulator for comprehensive performance analysis. Overall, NeuraChip presents a significant improvement, yielding an average speedup of 22.1× over Intel’s MKL, 17.1× over NVIDIA’s cuSPARSE, 16.7× over AMD’s hipSPARSE, and 1.5× over prior state of-the-art SpGEMM accelerator and 1.3× over GNN accelerator. The source code for our open-sourced simulator and performance visualizer is publicly accessible on GitHub.	es
dc.format	application/pdf	es
dc.format.extent	15	es
dc.language	eng	es
dc.publisher	ArXiv	es
dc.relation	Institute for Experiential AI and the NSF IUCRC Center for Hardware and Embedded Systems Security and Trust (CHEST), NSF CNS 2312275, NSF CNS 2312276, and by Samsung Advanced Institute of Technology, Samsung Electronics Co., Ltd. Additionally, we acknowl edge the financial assistance from grant RYC2021-031966-I funded by MCIN/AEI/10.13039/501100011033 and the “European Union NextGenerationEU/PRTR.”, and grant PID2022-136315OB-I00 funded by MCIN/AEI/10.13039/501100011033/ and by “ERDF A way of making Europe”, EU.	es
dc.relation.ispartof	ISCA 2024 : International Symposium on Computer Architecture, Argentina	es
dc.rights	info:eu-repo/semantics/openAccess	es
dc.rights	Atribución 4.0 Internacional	*
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/	*
dc.subject	Graph Neural Networks (GNN)	es
dc.subject	Decoupled Computations	es
dc.subject	Spatial Accelerators	es
dc.subject	Sparse Matrix Multiplication (SpGEMM)	es
dc.subject	On-chip Memory	es
dc.subject	Hardware-software co-design	es
dc.subject.other	CDU::6 - Ciencias aplicadas::62 - Ingeniería. Tecnología	es
dc.title	NeuraChip: Accelerating GNN Computations with a Hash-based Decoupled Spatial Accelerator	es
dc.type	info:eu-repo/semantics/article	es
dc.type	info:eu-repo/semantics/lecture	es
dc.relation.publisherversion	https://arxiv.org/abs/2404.15510	es
dc.contributor.department	Departamento de Ingeniería y Tecnología de Computadores	-
Aparece en las colecciones:	Artículos

Ficheros en este ítem:

Fichero	Descripción	Tamaño	Formato
NeuraChip GNN Accelerator.pdf		8,07 MB	Adobe PDF	Visualizar/Abrir

Mostrar el registro sencillo del ítem Mostrar el registro PREMIS del ítem Estadísticas

Este ítem está sujeto a una licencia Creative Commons Licencia Creative Commons