Por favor, use este identificador para citar o enlazar este ítem: 10.1109/TPDS.2016.2528241

Registro completo de metadatos
Campo DCValorLengua/Idioma
dc.contributor.authorJimborean, Alexandra-
dc.contributor.authorRos Bardisa, Alberto-
dc.date.accessioned2024-02-08T12:09:02Z-
dc.date.available2024-02-08T12:09:02Z-
dc.date.issued2016-
dc.identifier.citationIEEE Transactions on Parallel and Distributed Systems, 27(11), 3101-3115.es
dc.identifier.issn1045-9219-
dc.identifier.urihttp://hdl.handle.net/10201/138999-
dc.descriptionRos, A., & Jimborean, A. (2016). A hybrid static-dynamic classification for dual-consistency cache coherence. IEEE Transactions on Parallel and Distributed Systems, 27(11), 3101-3115.es
dc.description.abstractTraditional cache coherence protocols manage all memory accesses equally and ensure the strongest memory model, namely, sequential consistency. Recent cache coherence protocols based on self-invalidation advocate for the model sequential consistency for data-race-free, which enables powerful optimizations for race-free code. However, for racy code these cache coherence protocols provide sub-optimal performance compared to traditional protocols. This paper proposes SPEL++, a dual-consistency cache coherence protocol that supports two execution modes: a traditional sequential-consistent protocol and a protocol that provides weak consistency (or sequential consistency for data-race-free). SPEL++ exploits a static-dynamic hybrid classification of memory accesses based on (i) a compile-time identification of extended data-race-free code regions for OpenMP applications and (ii) a runtime classification of accesses based on the operating system’s memory page management. By executing racy code under the sequential-consistent protocol and race-free code under the cache coherence protocol that provides sequential consistency for data-race-free, the end result is an efficient execution of the applications while still providing sequential consistency. Compared to a traditional protocol, we show improvements in performance from 19% to 38% and reductions in energy consumption from 47% to 53%, on average for different benchmark suites, on a 64-core chip multiprocessores
dc.formatapplication/pdfes
dc.format.extent14es
dc.languageenges
dc.publisherIEEEes
dc.relationThis work was supported in part by the ”Fundaci´on Seneca- Agencia de Ciencia y Tecnolog´ıa de la Regi´on de Murcia” under grant ”J´ovenes L´ıderes en Investigaci´on” 18956/JLI/13, as well as by the Swedish Research Council UPMARC Linnaeus Centre, the VR frame project ”Efficient Modeling of Heterogeneity in the Era of Dark Silicon”: 106201305/C0533201 and by the European 7th Framework Programme (EU ICT-287759) through a collaboration grant from HiPEAC.es
dc.rightsinfo:eu-repo/semantics/openAccesses
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectMultiprocessorses
dc.subjectCache coherencees
dc.subjectClassification of accesseses
dc.subjectRuntimees
dc.subjectCompileres
dc.subjectConsistency modeles
dc.subjectData raceses
dc.titleA hybrid static-dynamic classification for dual-consistency cache coherencees
dc.typeinfo:eu-repo/semantics/articlees
dc.relation.publisherversionhttp://webs.um.es/aros/papers/pdfs/aros-tpds16.pdfes
dc.identifier.doi10.1109/TPDS.2016.2528241-
dc.contributor.departmentDepartamento de Ingeniería y Tecnología de Computadores-
Aparece en las colecciones:Artículos

Ficheros en este ítem:
Fichero Descripción TamañoFormato 
2016_TPDS_Alberto_Ros_author.pdf881,99 kBAdobe PDFVista previa
Visualizar/Abrir


Este ítem está sujeto a una licencia Creative Commons Licencia Creative Commons Creative Commons