A complexity-effective local delta prefetcher

Navarro-Torres, Agustín; Panda, Biswabandan; Alastruey-Benedé, Jesús; Ibáñez, Pablo; Viñals-Yúfera, Víctor; Ros Bardisa, Alberto

Por favor, use este identificador para citar o enlazar este ítem: https://doi.org/10.1109/TC.2025.3533086

RefMan EndNote BibTex RefWorks Excel CSV PDF Mendeley

Registro completo de metadatos

Campo DC	Valor	Lengua/Idioma
dc.contributor.author	Navarro-Torres, Agustín	-
dc.contributor.author	Panda, Biswabandan	-
dc.contributor.author	Alastruey-Benedé, Jesús	-
dc.contributor.author	Ibáñez, Pablo	-
dc.contributor.author	Viñals-Yúfera, Víctor	-
dc.contributor.author	Ros Bardisa, Alberto	-
dc.date.accessioned	2025-02-28T06:56:08Z	-
dc.date.available	2025-02-28T06:56:08Z	-
dc.date.issued	2025-01-31	-
dc.identifier.citation	IEEE Transactions on Computers 2025	es
dc.identifier.issn	Print: 0018-9340	-
dc.identifier.issn	Electronic: 1557-9956	-
dc.identifier.uri	http://hdl.handle.net/10201/151244	-
dc.description	© 2025, IEEE all right reserved. This manuscript version is made available under the CC-BY 4.0 license http://creativecommons.org/licenses/by/4.0/. This document is the Accepted version of a Published Work that appeared in final form in IEEE Transactions on Computers. To access the final edited and published work see https://doi.org/10.1109/TC.2025.3533086	es
dc.description.abstract	Data prefetching is crucial for performance in modern processors by effectively masking long-latency memory accesses. Over the past decades, numerous data prefetching mechanisms have been proposed, which have continuously reduced the access latency to the memory hierarchy. Several state-of-the-art prefetchers, namely Instruction Pointer Classifier Prefetcher (IPCP) and Berti, target the first-level data cache, and thus, they are able to completely hide the miss latency for timely prefetched cache lines. Berti exploits timely local deltas to achieve high accuracy and performance. This paper extends Berti with a larger evaluation and with extra optimizations on top of the previous conference paper. The result is a complexity-effective version of Berti that outperforms it for a large amount of workloads and simplifies its control logic. The key for those advancements is a simple mechanism for learning timely deltas without the need to track the fetch latency of each cache miss. Our experiments conducted with a wide range of workloads (CVP traces by Qualcomm, SPEC CPU2017, and GAP) show performance improvements by 4.0% over a mainstream stride prefetcher, and by a non-negligible 1.4% over the previously published version of Berti requiring similar storage.	es
dc.format	application/pdf	es
dc.format.extent	12	es
dc.language	eng	es
dc.publisher	Institute of Electrical and Electronics Engineers	es
dc.relation	This work was supported by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (Berti-Chip, GA No 101158023, ECHO, GA No.819134), by the MCIN/AEI/10.13039/501100011033/ and the “ERDF A way of making Europe”, EU (grants PID2022-136315OB-I00, PID2022-136454NB-C22, RTI2018-098156-B-C53), by the MCIN/AEI/10.13039/501100011033/ the European Union NextGenerationEU/PRTR (grant TED2021-130233B-C33), and by Government of Aragon (T58 _23R research group)	es
dc.rights	info:eu-repo/semantics/openAccess	es
dc.rights	Atribución 4.0 Internacional	*
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/	*
dc.subject	Data prefetching	es
dc.subject	Hardware prefetching	es
dc.subject	First-level cache	es
dc.subject	Stride	es
dc.subject	Local deltas	es
dc.subject	Accuracy	es
dc.subject	Timeliness	es
dc.title	A complexity-effective local delta prefetcher	es
dc.type	info:eu-repo/semantics/article	es
dc.relation.publisherversion	https://ieeexplore.ieee.org/document/10859166	es
dc.identifier.doi	https://doi.org/10.1109/TC.2025.3533086	-
dc.contributor.department	Departamento de Ingeniería y Tecnología de Computadores	-
Aparece en las colecciones:	Artículos

Ficheros en este ítem:

Fichero	Descripción	Tamaño	Formato
anavarrotorres-tc25.pdf		2,68 MB	Adobe PDF	Visualizar/Abrir

Mostrar el registro sencillo del ítem Mostrar el registro PREMIS del ítem Estadísticas

Este ítem está sujeto a una licencia Creative Commons Licencia Creative Commons