A hybrid static-dynamic classification for dual-consistency cache coherence

Jimborean, Alexandra; Ros Bardisa, Alberto

Por favor, use este identificador para citar o enlazar este ítem: 10.1109/TPDS.2016.2528241

RefMan EndNote BibTex RefWorks Excel CSV PDF Mendeley

Registro completo de metadatos

Campo DC	Valor	Lengua/Idioma
dc.contributor.author	Jimborean, Alexandra	-
dc.contributor.author	Ros Bardisa, Alberto	-
dc.date.accessioned	2024-02-08T12:09:02Z	-
dc.date.available	2024-02-08T12:09:02Z	-
dc.date.issued	2016	-
dc.identifier.citation	IEEE Transactions on Parallel and Distributed Systems, 27(11), 3101-3115.	es
dc.identifier.issn	1045-9219	-
dc.identifier.uri	http://hdl.handle.net/10201/138999	-
dc.description	Ros, A., & Jimborean, A. (2016). A hybrid static-dynamic classification for dual-consistency cache coherence. IEEE Transactions on Parallel and Distributed Systems, 27(11), 3101-3115.	es
dc.description.abstract	Traditional cache coherence protocols manage all memory accesses equally and ensure the strongest memory model, namely, sequential consistency. Recent cache coherence protocols based on self-invalidation advocate for the model sequential consistency for data-race-free, which enables powerful optimizations for race-free code. However, for racy code these cache coherence protocols provide sub-optimal performance compared to traditional protocols. This paper proposes SPEL++, a dual-consistency cache coherence protocol that supports two execution modes: a traditional sequential-consistent protocol and a protocol that provides weak consistency (or sequential consistency for data-race-free). SPEL++ exploits a static-dynamic hybrid classification of memory accesses based on (i) a compile-time identification of extended data-race-free code regions for OpenMP applications and (ii) a runtime classification of accesses based on the operating system’s memory page management. By executing racy code under the sequential-consistent protocol and race-free code under the cache coherence protocol that provides sequential consistency for data-race-free, the end result is an efficient execution of the applications while still providing sequential consistency. Compared to a traditional protocol, we show improvements in performance from 19% to 38% and reductions in energy consumption from 47% to 53%, on average for different benchmark suites, on a 64-core chip multiprocessor	es
dc.format	application/pdf	es
dc.format.extent	14	es
dc.language	eng	es
dc.publisher	IEEE	es
dc.relation	This work was supported in part by the ”Fundaci´on Seneca- Agencia de Ciencia y Tecnolog´ıa de la Regi´on de Murcia” under grant ”J´ovenes L´ıderes en Investigaci´on” 18956/JLI/13, as well as by the Swedish Research Council UPMARC Linnaeus Centre, the VR frame project ”Efficient Modeling of Heterogeneity in the Era of Dark Silicon”: 106201305/C0533201 and by the European 7th Framework Programme (EU ICT-287759) through a collaboration grant from HiPEAC.	es
dc.rights	info:eu-repo/semantics/openAccess	es
dc.rights	Attribution-NonCommercial-NoDerivatives 4.0 Internacional	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/	*
dc.subject	Multiprocessors	es
dc.subject	Cache coherence	es
dc.subject	Classification of accesses	es
dc.subject	Runtime	es
dc.subject	Compiler	es
dc.subject	Consistency model	es
dc.subject	Data races	es
dc.title	A hybrid static-dynamic classification for dual-consistency cache coherence	es
dc.type	info:eu-repo/semantics/article	es
dc.relation.publisherversion	http://webs.um.es/aros/papers/pdfs/aros-tpds16.pdf	es
dc.identifier.doi	10.1109/TPDS.2016.2528241	-
dc.contributor.department	Departamento de Ingeniería y Tecnología de Computadores	-
Aparece en las colecciones:	Artículos

Ficheros en este ítem:

Fichero	Descripción	Tamaño	Formato
2016_TPDS_Alberto_Ros_author.pdf		881,99 kB	Adobe PDF	Visualizar/Abrir

Mostrar el registro sencillo del ítem Mostrar el registro PREMIS del ítem Estadísticas

Este ítem está sujeto a una licencia Creative Commons Licencia Creative Commons