Por favor, use este identificador para citar o enlazar este ítem:
https://doi.org/10.1109/PACT58117.2023.00019
Twittear
Registro completo de metadatos
Campo DC | Valor | Lengua/Idioma |
---|---|---|
dc.contributor.author | Joseph, D. | - |
dc.contributor.author | Aragón, J.L. | - |
dc.contributor.author | Parcerisa, J.M. | - |
dc.contributor.author | González, A. | - |
dc.contributor.other | Facultades, Departamentos, Servicios y Escuelas::Departamentos de la UMU::Ingeniería y Tecnología de Computadores | es |
dc.date.accessioned | 2023-11-21T07:33:47Z | - |
dc.date.available | 2023-11-21T07:33:47Z | - |
dc.date.issued | 2023-10-21 | - |
dc.identifier.uri | http://hdl.handle.net/10201/135884 | - |
dc.description | © 2023 Copyright held by the owner/author(s). This document is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/ This document is the Accepted version of a Published Work that appeared in final form in 32nd International Conference on Parallel Architectures and Compilation Techniques (PACT), Viena, Austria, October 2023. To access the final edited and published work see https://doi.org/10.1109/PACT58117.2023.00019 | es |
dc.description.abstract | Literature is plentiful in works exploiting cache locality for GPUs. A majority of them explore replacement or bypassing policies. In this paper, however, we surpass this exploration by fabricating a formal proof for a no-overhead quasi-optimal caching technique for caching textures in graphics workloads. Textures make up a significant part of main memory traffic in mobile GPUs, which contributes to the total GPU energy consumption. Since texture accesses use a shared L2 cache, improving the L2 texture caching efficiency would decrease main memory traffic, thus improving energy efficiency, which is crucial for mobile GPUs. Our proposal reaches quasi-optimality by exploiting the frame-to-frame reuse of textures in graphics. We do this by traversing frames in a boustrophedonic1 manner w.r.t. the frame-to-frame tile order. We first approximate the texture access trace to a circular trace and then forge a formal proof for our proposal being optimal for such traces. We also complement the proof with empirical data that demonstrates the quasi-optimality of our no-cost proposal. | es |
dc.format | application/pdf | es |
dc.format.extent | 13 | es |
dc.language | eng | es |
dc.publisher | ACM | es |
dc.relation | This work has been supported by the CoCoUnit ERC Advanced Grant of the EU’s Horizon 2020 program (grant No 833057), the Spanish State Research Agency (MCIN/AEI) under grant PID2020-113172RB-I00, the ICREA Academia program and the AGAUR grant 2020-FISDU-00287. | es |
dc.relation.ispartof | 32nd International Conference on Parallel Architectures and Compilation Techniques (PACT), Viena, Austria, ISBN: 979-8-3503-4254-3 | es |
dc.rights | info:eu-repo/semantics/openAccess | es |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
dc.subject | GPU | es |
dc.subject | Caches | es |
dc.subject | Graphics | es |
dc.subject | Texture | es |
dc.subject | Low-power | es |
dc.title | Boustrophedonic Frames: Quasi-Optimal L2 Caching for Textures in GPUs | es |
dc.type | info:eu-repo/semantics/lecture | es |
dc.type | info:eu-repo/semantics/lecture | es |
dc.identifier.doi | https://doi.org/10.1109/PACT58117.2023.00019 | - |
Aparece en las colecciones: | Artículos: Ingeniería y Tecnología de Computadores |
Ficheros en este ítem:
Fichero | Descripción | Tamaño | Formato | |
---|---|---|---|---|
Boustrophedonic Frames-PACT23-Camera Ready.pdf | 459,38 kB | Adobe PDF | Visualizar/Abrir |
Este ítem está sujeto a una licencia Creative Commons Licencia Creative Commons