Regional Out-of-Order Writes in Total Store Order

Singh, Sawan; Jimborean, Alexandra; Ros, Alberto

Por favor, use este identificador para citar o enlazar este ítem: 10.1145/3410463.3414645

RefMan EndNote BibTex RefWorks Excel CSV PDF Mendeley

Registro completo de metadatos

Campo DC	Valor	Lengua/Idioma
dc.contributor.author	Singh, Sawan	-
dc.contributor.author	Jimborean, Alexandra	-
dc.contributor.author	Ros, Alberto	-
dc.date.accessioned	2021-04-08T21:33:36Z	-
dc.date.available	2021-04-08T21:33:36Z	-
dc.date.issued	2020-10	-
dc.identifier.isbn	978-1-4503-8075-1	-
dc.identifier.uri	http://hdl.handle.net/10201/106161	-
dc.description.abstract	The store buffer, an essential component in today’s processors, is designed to hide memory latency by moving stores off the processor’s critical path. Furthermore, under the Total Store Order (TSO) memory model, the store buffer ensures the in-order retirement of stores. Problems arise when the store buffer is full or, under TSO, when the leading store encounters a cache miss, which blocks all subsequent stores and incurs severe performance bottlenecks.This work presents a software-hardware co-designed approach to cope with this bottleneck for processors with strong consistency guarantees. Our proposal is driven by the insight that store operations can be reordered if their reordering does not change the observable program behavior. The compiler delineates safe regions within which stores can be shuffled while still delivering the same observable behavior as if they performed in program order and unsafe regions within which stores must be kept in program order. This is leveraged by a novel dual-mode store buffer that switches between the out-of-order and in-order execution of stores within the safe and respectively unsafe regions. Correctness is preserved through well-placed fences inserted by the compiler, which impede the execution of stores from the following regions until all stores of the current region complete. Our dual-mode store buffer only requires one extra bit per entry, significantly decreases processor stall cycles, and brings 8.13% performance improvements compared to a mainstream store buffer.	es
dc.format	application/pdf	es
dc.language	eng	es
dc.relation	European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (ECHO: Extending Coherence for Hardware-Driven Optimizations in Multicore Architectures, grant agreement No 819134, Consolidator Grant, 2018).	es
dc.relation.ispartof	29th International Conference on Parallel Architectures and Compilation Techniques (PACT)	es
dc.rights	info:eu-repo/semantics/openAccess	es
dc.rights	Atribución 4.0 Internacional	*
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/	*
dc.subject	Memory Consistency Models	es
dc.subject	Total Store Order	es
dc.subject	Store Buffer	es
dc.title	Regional Out-of-Order Writes in Total Store Order	es
dc.type	info:eu-repo/semantics/article	es
dc.type	info:eu-repo/semantics/lecture	es
dc.identifier.doi	10.1145/3410463.3414645	-
dc.contributor.department	Departamento de Ingeniería y Tecnología de Computadores	-
Aparece en las colecciones:	Artículos

Ficheros en este ítem:

Fichero	Descripción	Tamaño	Formato
ssingh-pact20.pdf		1,06 MB	Adobe PDF	Visualizar/Abrir

Mostrar el registro sencillo del ítem Mostrar el registro PREMIS del ítem Estadísticas

Este ítem está sujeto a una licencia Creative Commons Licencia Creative Commons