Por favor, use este identificador para citar o enlazar este ítem: 10.1145/3410463.3414645

Título: Regional Out-of-Order Writes in Total Store Order
Fecha de publicación: oct-2020
ISBN: 978-1-4503-8075-1
Palabras clave: Memory Consistency Models
Total Store Order
Store Buffer
Resumen: The store buffer, an essential component in today’s processors, is designed to hide memory latency by moving stores off the processor’s critical path. Furthermore, under the Total Store Order (TSO) memory model, the store buffer ensures the in-order retirement of stores. Problems arise when the store buffer is full or, under TSO, when the leading store encounters a cache miss, which blocks all subsequent stores and incurs severe performance bottlenecks.This work presents a software-hardware co-designed approach to cope with this bottleneck for processors with strong consistency guarantees. Our proposal is driven by the insight that store operations can be reordered if their reordering does not change the observable program behavior. The compiler delineates safe regions within which stores can be shuffled while still delivering the same observable behavior as if they performed in program order and unsafe regions within which stores must be kept in program order. This is leveraged by a novel dual-mode store buffer that switches between the out-of-order and in-order execution of stores within the safe and respectively unsafe regions. Correctness is preserved through well-placed fences inserted by the compiler, which impede the execution of stores from the following regions until all stores of the current region complete. Our dual-mode store buffer only requires one extra bit per entry, significantly decreases processor stall cycles, and brings 8.13% performance improvements compared to a mainstream store buffer.
Autor/es principal/es: Singh, Sawan
Jimborean, Alexandra
Ros, Alberto
Facultad/Departamentos/Servicios: Facultades, Departamentos, Servicios y Escuelas::Facultades de la UMU::Facultad de Informática
Forma parte de: 29th International Conference on Parallel Architectures and Compilation Techniques (PACT)
URI: http://hdl.handle.net/10201/106161
DOI: 10.1145/3410463.3414645
Tipo de documento: info:eu-repo/semantics/article
info:eu-repo/semantics/lecture
Derechos: info:eu-repo/semantics/openAccess
Atribución 4.0 Internacional
Aparece en las colecciones:Artículos: Ingeniería y Tecnología de Computadores

Ficheros en este ítem:
Fichero Descripción TamañoFormato 
ssingh-pact20.pdf1,06 MBAdobe PDFVista previa
Visualizar/Abrir


Este ítem está sujeto a una licencia Creative Commons Licencia Creative Commons Creative Commons