Code Detection for Hardware Acceleration Using Large Language Models

Martínez Sánchez, Pablo Antonio; Bernabé García, Gregorio; García Carrasco, José Manuel

Por favor, use este identificador para citar o enlazar este ítem: https://doi.org/10.1109/ACCESS.2024.3372853

RefMan EndNote BibTex RefWorks Excel CSV PDF Mendeley

Registro completo de metadatos

Campo DC	Valor	Lengua/Idioma
dc.contributor.author	Martínez Sánchez, Pablo Antonio	-
dc.contributor.author	Bernabé García, Gregorio	-
dc.contributor.author	García Carrasco, José Manuel	-
dc.date.accessioned	2024-03-26T09:19:19Z	-
dc.date.available	2024-03-26T09:19:19Z	-
dc.date.issued	2024-03-01	-
dc.identifier.citation	IEEE Access. Volumen 12, 2024	es
dc.identifier.issn	Electronic: 2169-3536	-
dc.identifier.uri	http://hdl.handle.net/10201/140462	-
dc.description	©2024. This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/ This document is the published, version of a Published Work that appeared in final form in IEEE Access. To access the final edited and published work see https://doi.org/10.1109/ACCESS.2024.3372853	es
dc.description.abstract	Large language models (LLMs) have been massively applied to many tasks, often surpassing state-of-the-art approaches. While their effectiveness in code generation has been extensively studied (e.g., AlphaCode), their potential for code detection remains unexplored. This work presents the first analysis of code detection using LLMs. Our study examines essential kernels, including matrix multiplication, convolution, fast-fourier transform and LU factorization, implemented in C/C++. We propose both a preliminary, naive prompt and a novel prompting strategy for code detection. Results reveal that conventional prompting achieves great precision but poor accuracy (67.5%, 22.5%, 79.5% and 64% for GEMM, convolution, FFT and LU factorization, respectively) due to a high number of false positives. Our novel prompting strategy substantially reduces false positives, resulting in excellent overall accuracy (91.2%, 98%, 99.7% and 99.7%, respectively). These results pose a considerable challenge to existing state-of-the-art code detection methods.	es
dc.format	application/pdf	es
dc.format.extent	11	es
dc.language	eng	es
dc.relation	This work was supported in part by the Ministerio de Ciencia e Innovación (MCIN)/Agencia Estatal de Investigación (AEI)/10.13039/501100011033 under Grant TED2021-129221B-I00 and Grant PID2022-136315OB-I00; in part by ‘‘European Union (EU) NextGenerationEU/Plan de Recuperación, Transformación y Resiliencia (PRTR);’’ and in part by ‘‘European Regional Development Fund (ERDF) A way of making Europe,’’ EU.	es
dc.rights	info:eu-repo/semantics/openAccess	es
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/	*
dc.subject	Code detection	es
dc.subject	Compilers	es
dc.subject	Heterogeneous computing	es
dc.subject	High-performance computing	es
dc.subject	Large language model	es
dc.title	Code Detection for Hardware Acceleration Using Large Language Models	es
dc.type	info:eu-repo/semantics/article	es
dc.identifier.doi	https://doi.org/10.1109/ACCESS.2024.3372853	-
dc.contributor.department	Ingeniería y Tecnología de Computadores	-
Aparece en las colecciones:	Artículos

Ficheros en este ítem:

Fichero	Descripción	Tamaño	Formato
IEEE24.pdf		887,05 kB	Adobe PDF	Visualizar/Abrir

Mostrar el registro sencillo del ítem Mostrar el registro PREMIS del ítem Estadísticas

Este ítem está sujeto a una licencia Creative Commons Licencia Creative Commons