A latency-conscious SMT branch prediction architecture

Falcón, Ayose; Santana, Oliverio J.; Ramirez, Alex; Valero, Mateo

Please use this identifier to cite or link to this item: https://accedacris.ulpgc.es/jspui/handle/10553/50504

DC Field	Value	Language
dc.contributor.author	Falcón, Ayose	en_US
dc.contributor.author	Santana, Oliverio J.	en_US
dc.contributor.author	Ramirez, Alex	en_US
dc.contributor.author	Valero, Mateo	en_US
dc.date.accessioned	2018-11-24T16:33:07Z	-
dc.date.available	2018-11-24T16:33:07Z	-
dc.date.issued	2004	en_US
dc.identifier.issn	1740-0562	en_US
dc.identifier.uri	https://accedacris.ulpgc.es/handle/10553/50504	-
dc.description.abstract	Executing multiple threads has proved to be an effective solution to partially hide latencies that appear in a processor. When a thread is stalled because of a long-latency operation is being processed, such as a memory access or a floating-point calculation, the processor can switch to another context so that another thread can take advantage of the idle resources. However, fetch stall conditions caused by a branch predictor delay are not hidden by current simultaneous multithreading (SMT) fetch designs, causing a performance drop due to the absence of instructions to execute. In this paper, we propose several solutions to reduce the effect of branch predictor delay in the performance of SMT processors. Firstly, we analyse the impact of varying the number of access ports. Secondly, we describe a decoupled implementation of an SMT fetch unit that helps to tolerate the predictor delay. Finally, we present an interthread pipelined branch predictor, based on creating a pipeline of interleaved predictions from different threads. Our results show that, combining all the proposed techniques, the performance obtained is similar to that obtained using an ideal, 1-cycle access branch predictor.	en_US
dc.language	eng	en_US
dc.relation.ispartof	International Journal of High Performance Computing and Networking	en_US
dc.source	International Journal of High Performance Computing and Networking [ISSN 1740-0562], v. 2 (1), p. 11-21	en_US
dc.subject	330406 Arquitectura de ordenadores	en_US
dc.subject.other	Branch predictor delay	en_US
dc.subject.other	Decoupled predictor	en_US
dc.subject.other	Fetch engine	en_US
dc.subject.other	Predictor pipelining	en_US
dc.subject.other	SMT	en_US
dc.title	A latency-conscious SMT branch prediction architecture	en_US
dc.type	info:eu-repo/semantics/article	es
dc.type	Article	es
dc.identifier.doi	10.1504/IJHPCN.2004.009264	en_US
dc.identifier.scopus	84951714939	-
dc.contributor.authorscopusid	9733156400	-
dc.contributor.authorscopusid	7003605046	-
dc.contributor.authorscopusid	55837529000	-
dc.contributor.authorscopusid	24475914200	-
dc.identifier.eissn	1740-0570	-
dc.description.lastpage	21	-
dc.identifier.issue	1	-
dc.description.firstpage	11	-
dc.relation.volume	2	-
dc.investigacion	Ingeniería y Arquitectura	en_US
dc.type2	Artículo	en_US
dc.identifier.ulpgc	Sí	es
item.grantfulltext	none	-
item.fulltext	Sin texto completo	-
crisitem.author.dept	GIR SIANI: Inteligencia Artificial, Robótica y Oceanografía Computacional	-
crisitem.author.dept	IU de Sistemas Inteligentes y Aplicaciones Numéricas en Ingeniería	-
crisitem.author.dept	Departamento de Informática y Sistemas	-
crisitem.author.orcid	0000-0001-7511-5783	-
crisitem.author.parentorg	IU de Sistemas Inteligentes y Aplicaciones Numéricas en Ingeniería	-
crisitem.author.fullName	Santana Jaria, Oliverio Jesús	-
Appears in Collections:	Artículos

Show simple item record

SCOPUS^TM
Citations

1

checked on Jun 8, 2025

Page view(s)

280

checked on Jan 16, 2026

SCOPUS^TM
Citations

Page view(s)

Google Scholar^TM

Altmetric

Share

Export metadata

Dirección

Contacto

Legal

De interés

SCOPUSTM Citations

Page view(s)

Google ScholarTM

Altmetric

Share

Export metadata

Dirección

SCOPUS^TM
Citations

Google Scholar^TM