Por favor, use este identificador para citar o enlazar este ítem: http://dx.doi.org/10.1109/HPCA61900.2025.00116

Registro completo de metadatos
Campo DCValorLengua/Idioma
dc.contributor.authorSon, Hyojun-
dc.contributor.authorJonatan, Gilbert-
dc.contributor.authorWu, Xiangyu-
dc.contributor.authorCho, Haeyoon-
dc.contributor.authorShivdikar, Kaustubh-
dc.contributor.authorAbellán, José L.-
dc.contributor.authorJoshi, Ajay-
dc.contributor.authorKaeli, David-
dc.contributor.authorKim , John-
dc.date.accessioned2025-03-25T11:39:12Z-
dc.date.available2025-03-25T11:39:12Z-
dc.date.issued2025-
dc.identifier.issn2378-203X-
dc.identifier.urihttp://hdl.handle.net/10201/152085-
dc.description© 2025 IEEE This document is the published version of a published work that appeared in final form in 2025 IEEE International Symposium on High Performance Computer Architecture (HPCA). . To access the final edited and published work see: http://dx.doi.org/10.1109/HPCA61900.2025.00116es
dc.description.abstractProcessing-in-memory (PIM), where compute is moved closer to memory or data, has been explored to accelerate emerging workloads. Different PIM-based systems have been announced, each offering a unique microarchitectural organization of their compute units, ranging from fixed functional units to programmable general-purpose compute cores near memory. However, one fundamental limitation of PIM is that each compute unit can only access its local memory; access to “remote” memory must occur through the host CPU – potentially limiting application performance scalability. In this work, we first characterize the scalability of real PIM architectures using the UPMEM PIM system. We analyze how the overhead of communicating through the host (instead of providing direct communication between the PIM compute units) can become a bottleneck for collective communications that are commonly used in many workloads. To overcome this inter-PIM bank communication, we propose PIMnet – a PIM interconnection network for PIM banks that provides direct connectivity between compute units and removes the overhead of communicating through the host. PIMnet exploits bandwidth parallelism where communication across the different PIM bank/chips can occur in parallel to maximize communication performance. PIMnet also matches the DRAM packaging hierarchy with a multi-tier network architecture. Unlike traditional interconnection networks, PIMnet is a PIMcontrolled network where communication is managed by the PIM logic, optimizing collective communications and minimizing the hardware overhead of PIMnet. Our evaluation of PIMnet shows that it provides up to 85× speedup on collective communications and achieves a 11.8× improvement on real applications compared to the baseline PIM.es
dc.formatapplication/pdfes
dc.format.extent16es
dc.languageenges
dc.publisherHPCA 2025es
dc.relationSin financiación externa a la Universidades
dc.relation.ispartof2025 IEEE International Symposium on High Performance Computer Architecture (HPCA)es
dc.rightsinfo:eu-repo/semantics/embargoedAccesses
dc.titlePIMnet: A Domain-Specific Network for Efficient Collective Communication in Scalable PIMes
dc.typeinfo:eu-repo/semantics/articlees
dc.embargo.termsSi-
dc.identifier.doihttp://dx.doi.org/10.1109/HPCA61900.2025.00116-
dc.contributor.departmentDepartamento de Ingeniería y Tecnología de Computadores-
Aparece en las colecciones:Artículos

Ficheros en este ítem:
Fichero Descripción TamañoFormato 
064700b557.pdf5,28 MBAdobe PDFVista previa
Visualizar/Abrir    Solicitar una copia


Los ítems de Digitum están protegidos por copyright, con todos los derechos reservados, a menos que se indique lo contrario.