A latency-conscious SMT branch prediction architecture

Falcón, Ayose; Santana, Oliverio J.; Ramirez, Alex; Valero, Mateo

Identificador persistente para citar o vincular este elemento: https://accedacris.ulpgc.es/jspui/handle/10553/50504

Campo DC	Valor	idioma
dc.contributor.author	Falcón, Ayose	en_US
dc.contributor.author	Santana, Oliverio J.	en_US
dc.contributor.author	Ramirez, Alex	en_US
dc.contributor.author	Valero, Mateo	en_US
dc.date.accessioned	2018-11-24T16:33:07Z	-
dc.date.available	2018-11-24T16:33:07Z	-
dc.date.issued	2004	en_US
dc.identifier.issn	1740-0562	en_US
dc.identifier.uri	https://accedacris.ulpgc.es/handle/10553/50504	-
dc.description.abstract	Executing multiple threads has proved to be an effective solution to partially hide latencies that appear in a processor. When a thread is stalled because of a long-latency operation is being processed, such as a memory access or a floating-point calculation, the processor can switch to another context so that another thread can take advantage of the idle resources. However, fetch stall conditions caused by a branch predictor delay are not hidden by current simultaneous multithreading (SMT) fetch designs, causing a performance drop due to the absence of instructions to execute. In this paper, we propose several solutions to reduce the effect of branch predictor delay in the performance of SMT processors. Firstly, we analyse the impact of varying the number of access ports. Secondly, we describe a decoupled implementation of an SMT fetch unit that helps to tolerate the predictor delay. Finally, we present an interthread pipelined branch predictor, based on creating a pipeline of interleaved predictions from different threads. Our results show that, combining all the proposed techniques, the performance obtained is similar to that obtained using an ideal, 1-cycle access branch predictor.	en_US
dc.language	eng	en_US
dc.relation.ispartof	International Journal of High Performance Computing and Networking	en_US
dc.source	International Journal of High Performance Computing and Networking [ISSN 1740-0562], v. 2 (1), p. 11-21	en_US
dc.subject	330406 Arquitectura de ordenadores	en_US
dc.subject.other	Branch predictor delay	en_US
dc.subject.other	Decoupled predictor	en_US
dc.subject.other	Fetch engine	en_US
dc.subject.other	Predictor pipelining	en_US
dc.subject.other	SMT	en_US
dc.title	A latency-conscious SMT branch prediction architecture	en_US
dc.type	info:eu-repo/semantics/article	es
dc.type	Article	es
dc.identifier.doi	10.1504/IJHPCN.2004.009264	en_US
dc.identifier.scopus	84951714939	-
dc.contributor.authorscopusid	9733156400	-
dc.contributor.authorscopusid	7003605046	-
dc.contributor.authorscopusid	55837529000	-
dc.contributor.authorscopusid	24475914200	-
dc.identifier.eissn	1740-0570	-
dc.description.lastpage	21	-
dc.identifier.issue	1	-
dc.description.firstpage	11	-
dc.relation.volume	2	-
dc.investigacion	Ingeniería y Arquitectura	en_US
dc.type2	Artículo	en_US
dc.identifier.ulpgc	Sí	es
item.grantfulltext	none	-
item.fulltext	Sin texto completo	-
crisitem.author.dept	GIR SIANI: Inteligencia Artificial, Robótica y Oceanografía Computacional	-
crisitem.author.dept	IU de Sistemas Inteligentes y Aplicaciones Numéricas en Ingeniería	-
crisitem.author.dept	Departamento de Informática y Sistemas	-
crisitem.author.orcid	0000-0001-7511-5783	-
crisitem.author.parentorg	IU de Sistemas Inteligentes y Aplicaciones Numéricas en Ingeniería	-
crisitem.author.fullName	Santana Jaria, Oliverio Jesús	-
Colección:	Artículos

Vista resumida

Citas SCOPUS^TM

1

actualizado el 08-jun-2025

Visitas

280

actualizado el 16-ene-2026

Citas SCOPUS^TM

Visitas

Google Scholar^TM

Altmetric

Comparte

Exporta metadatos

Dirección

Contacto

Legal

De interés

Citas SCOPUSTM

Visitas

Google ScholarTM

Altmetric

Comparte

Exporta metadatos

Dirección

Citas SCOPUS^TM

Google Scholar^TM