Por favor, use este identificador para citar o enlazar este ítem:
10.1109/TC.2023.3337308
Twittear
Título: | Wrong-Path-Aware Entangling Instruction Prefetcher |
Fecha de publicación: | 2024 |
Editorial: | Institute of Electrical and Electronics Engineers |
Cita bibliográfica: | IEEE Transactions on computers |
Palabras clave: | Instruction prefetching Processor front-end Performance Energy efficiency |
Resumen: | Instruction prefetching is instrumental for guaranteeing a high flow of instructions through the processor front end for applications whose working set does not fit in the lowerlevel caches. Examples of such applications are server workloads, whose instruction footprints are constantly growing. There are two main techniques to mitigate this problem: fetch directed prefetching (or decoupled front end) and instruction cache (L1I) prefetching. This work extends the state-of-the-art Entangling prefetcher to avoid training during wrong-path execution. Our Entangling wrong-path-aware prefetcher is equipped with microarchitectural techniques that eliminate more than 99% of wrong-path pollution, thus reaching 98.9% of the performance of an ideal wrongpath-aware solution. Next, we propose two microarchitectural optimizations able to further increase performance benefits by 1.8%, on average. All this is achieved with just 304 bytes. Finally, we study the interplay between the L1I prefetcher and a decoupled front end. Our analysis shows that due to pollution caused by wrong-path instructions, the degree of decoupling cannot be increased unlimitedly without negative effects on the energy-delay product (EDP). Furthermore, the closer to ideal is the L1I prefetcher, the less decoupling is required. For example, our Entangling prefetcher reaches an optimal EDP with a decoupling degree of 64 instructions. |
Autor/es principal/es: | Ros, Alberto |
Autor/es secundario/s: | Jimborean, Alexandra |
Facultad/Departamentos/Servicios: | Facultades, Departamentos, Servicios y Escuelas::Departamentos de la UMU::Ingeniería y Tecnología de Computadores |
URI: | http://hdl.handle.net/10201/136528 |
DOI: | 10.1109/TC.2023.3337308 |
Tipo de documento: | info:eu-repo/semantics/article |
Número páginas / Extensión: | 12 |
Derechos: | info:eu-repo/semantics/openAccess Atribución 4.0 Internacional |
Descripción: | © 2023.IEEE. This document is made available under the CC-BY 4.0 license http://creativecommons.org/licenses/by /4.0/ This document is the Accepted version of a Published Work that appeared in final form in IEEE Transactions on Computers. To access the final edited and published work see DOI 10.1109/TC.2023.3337308 |
Aparece en las colecciones: | Artículos: Ingeniería y Tecnología de Computadores |
Ficheros en este ítem:
Fichero | Descripción | Tamaño | Formato | |
---|---|---|---|---|
aros-tc24.pdf | 641,09 kB | Adobe PDF | Visualizar/Abrir |
Este ítem está sujeto a una licencia Creative Commons Licencia Creative Commons