PIMnet: A Domain-Specific Network for Efficient Collective Communication in Scalable PIM

Son, Hyojun; Jonatan, Gilbert; Wu, Xiangyu; Cho, Haeyoon; Shivdikar, Kaustubh; Abellán, José L.; Joshi, Ajay; Kaeli, David; Kim , John

Por favor, use este identificador para citar o enlazar este ítem: http://dx.doi.org/10.1109/HPCA61900.2025.00116

RefMan EndNote BibTex RefWorks Excel CSV PDF Mendeley

Registro completo de metadatos

Campo DC	Valor	Lengua/Idioma
dc.contributor.author	Son, Hyojun	-
dc.contributor.author	Jonatan, Gilbert	-
dc.contributor.author	Wu, Xiangyu	-
dc.contributor.author	Cho, Haeyoon	-
dc.contributor.author	Shivdikar, Kaustubh	-
dc.contributor.author	Abellán, José L.	-
dc.contributor.author	Joshi, Ajay	-
dc.contributor.author	Kaeli, David	-
dc.contributor.author	Kim , John	-
dc.date.accessioned	2025-03-25T11:39:12Z	-
dc.date.available	2025-03-25T11:39:12Z	-
dc.date.issued	2025	-
dc.identifier.issn	2378-203X	-
dc.identifier.uri	http://hdl.handle.net/10201/152085	-
dc.description	© 2025 IEEE This document is the published version of a published work that appeared in final form in 2025 IEEE International Symposium on High Performance Computer Architecture (HPCA). . To access the final edited and published work see: http://dx.doi.org/10.1109/HPCA61900.2025.00116	es
dc.description.abstract	Processing-in-memory (PIM), where compute is moved closer to memory or data, has been explored to accelerate emerging workloads. Different PIM-based systems have been announced, each offering a unique microarchitectural organization of their compute units, ranging from fixed functional units to programmable general-purpose compute cores near memory. However, one fundamental limitation of PIM is that each compute unit can only access its local memory; access to “remote” memory must occur through the host CPU – potentially limiting application performance scalability. In this work, we first characterize the scalability of real PIM architectures using the UPMEM PIM system. We analyze how the overhead of communicating through the host (instead of providing direct communication between the PIM compute units) can become a bottleneck for collective communications that are commonly used in many workloads. To overcome this inter-PIM bank communication, we propose PIMnet – a PIM interconnection network for PIM banks that provides direct connectivity between compute units and removes the overhead of communicating through the host. PIMnet exploits bandwidth parallelism where communication across the different PIM bank/chips can occur in parallel to maximize communication performance. PIMnet also matches the DRAM packaging hierarchy with a multi-tier network architecture. Unlike traditional interconnection networks, PIMnet is a PIMcontrolled network where communication is managed by the PIM logic, optimizing collective communications and minimizing the hardware overhead of PIMnet. Our evaluation of PIMnet shows that it provides up to 85× speedup on collective communications and achieves a 11.8× improvement on real applications compared to the baseline PIM.	es
dc.format	application/pdf	es
dc.format.extent	16	es
dc.language	eng	es
dc.publisher	HPCA 2025	es
dc.relation	Sin financiación externa a la Universidad	es
dc.relation.ispartof	2025 IEEE International Symposium on High Performance Computer Architecture (HPCA)	es
dc.rights	info:eu-repo/semantics/embargoedAccess	es
dc.title	PIMnet: A Domain-Specific Network for Efficient Collective Communication in Scalable PIM	es
dc.type	info:eu-repo/semantics/article	es
dc.embargo.terms	Si	-
dc.identifier.doi	http://dx.doi.org/10.1109/HPCA61900.2025.00116	-
dc.contributor.department	Departamento de Ingeniería y Tecnología de Computadores	-
Aparece en las colecciones:	Artículos

Ficheros en este ítem:

Fichero	Descripción	Tamaño	Formato
064700b557.pdf		5,28 MB	Adobe PDF	Visualizar/Abrir Solicitar una copia

Mostrar el registro sencillo del ítem Mostrar el registro PREMIS del ítem Estadísticas