Por favor, use este identificador para citar o enlazar este ítem: 10.1145/3410463.3414645

Registro completo de metadatos
Campo DCValorLengua/Idioma
dc.contributor.authorSingh, Sawan-
dc.contributor.authorJimborean, Alexandra-
dc.contributor.authorRos, Alberto-
dc.contributor.otherFacultades, Departamentos, Servicios y Escuelas::Facultades de la UMU::Facultad de Informáticaes
dc.date.accessioned2021-04-08T21:33:36Z-
dc.date.available2021-04-08T21:33:36Z-
dc.date.issued2020-10-
dc.identifier.isbn978-1-4503-8075-1-
dc.identifier.urihttp://hdl.handle.net/10201/106161-
dc.description.abstractThe store buffer, an essential component in today’s processors, is designed to hide memory latency by moving stores off the processor’s critical path. Furthermore, under the Total Store Order (TSO) memory model, the store buffer ensures the in-order retirement of stores. Problems arise when the store buffer is full or, under TSO, when the leading store encounters a cache miss, which blocks all subsequent stores and incurs severe performance bottlenecks.This work presents a software-hardware co-designed approach to cope with this bottleneck for processors with strong consistency guarantees. Our proposal is driven by the insight that store operations can be reordered if their reordering does not change the observable program behavior. The compiler delineates safe regions within which stores can be shuffled while still delivering the same observable behavior as if they performed in program order and unsafe regions within which stores must be kept in program order. This is leveraged by a novel dual-mode store buffer that switches between the out-of-order and in-order execution of stores within the safe and respectively unsafe regions. Correctness is preserved through well-placed fences inserted by the compiler, which impede the execution of stores from the following regions until all stores of the current region complete. Our dual-mode store buffer only requires one extra bit per entry, significantly decreases processor stall cycles, and brings 8.13% performance improvements compared to a mainstream store buffer.es
dc.formatapplication/pdfes
dc.languageenges
dc.relationEuropean Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (ECHO: Extending Coherence for Hardware-Driven Optimizations in Multicore Architectures, grant agreement No 819134, Consolidator Grant, 2018).es
dc.relation.ispartof29th International Conference on Parallel Architectures and Compilation Techniques (PACT)es
dc.rightsinfo:eu-repo/semantics/openAccesses
dc.rightsAtribución 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.subjectMemory Consistency Modelses
dc.subjectTotal Store Orderes
dc.subjectStore Bufferes
dc.titleRegional Out-of-Order Writes in Total Store Orderes
dc.typeinfo:eu-repo/semantics/articlees
dc.typeinfo:eu-repo/semantics/lecturees
dc.identifier.doi10.1145/3410463.3414645-
Aparece en las colecciones:Artículos: Ingeniería y Tecnología de Computadores

Ficheros en este ítem:
Fichero Descripción TamañoFormato 
ssingh-pact20.pdf1,06 MBAdobe PDFVista previa
Visualizar/Abrir


Este ítem está sujeto a una licencia Creative Commons Licencia Creative Commons Creative Commons