Identificador persistente para citar o vincular este elemento: http://hdl.handle.net/10553/50508
Campo DC Valoridioma
dc.contributor.authorRamirez, Alexen_US
dc.contributor.authorSantana, Oliverio J.en_US
dc.contributor.authorLarriba-Pey, Josep L.en_US
dc.contributor.authorValero, Mateoen_US
dc.date.accessioned2018-11-24T16:34:56Z-
dc.date.available2018-11-24T16:34:56Z-
dc.date.issued2002en_US
dc.identifier.isbn0-7695-1859-1en_US
dc.identifier.issn1072-4451en_US
dc.identifier.urihttp://hdl.handle.net/10553/50508-
dc.description.abstractFetch performance is a very important factor because it effectively limits the overall processor performance. However there is little performance advantage in increasing front-end performance beyond what the back-end can consume. For each processor design, the target is to build the best possible fetch engine for the required performance level. A fetch engine will be better if it provides better performance, but also if it takes fewer resources, requires less chip area, or consumes less power. In this paper we propose a novel fetch architecture based on the execution of long streams of sequential instructions, taking maximum advantage of code layout optimizations. We describe our architecture in detail, and show that it requires less complexity and resources than other high performance fetch architectures like the trace cache, while providing a high fetch performance suitable for wide-issue superscalar processors. Our results show that using our fetch architecture and code layout optimizations obtains 10% higher performance than the EV8 fetch architecture, and 4% higher than the FTB architecture using state-of-the-art branch predictors, while being only 1.5% slower than the trace cache. Even in the absence of code layout optimizations, fetching instruction streams is still 10% faster than the EV8, and only 4% slower than the trace cache. Fetching instruction streams effectively exploits the special characteristics of layout optimized codes to provide a high fetch performance, close to that of a trace cache, but has a much lower cost and complexity, similar to that of a basic block architecture.en_US
dc.languageengen_US
dc.sourceProceedings of the Annual International Symposium on Microarchitecture, MICRO [ISSN 1072-4451], v. 2002-January (1176264), p. 371-382en_US
dc.subject330406 Arquitectura de ordenadoresen_US
dc.subject.otherComputer architectureen_US
dc.subject.otherCost functionen_US
dc.subject.otherEnginesen_US
dc.subject.otherHipen_US
dc.subject.otherProcess designen_US
dc.titleFetching instruction streamsen_US
dc.typeinfo:eu-repo/semantics/conferenceObjectes
dc.typeConferenceObjectes
dc.identifier.doi10.1109/MICRO.2002.1176264en_US
dc.identifier.scopus84948990915-
dc.identifier.isi000179945200031-
dc.contributor.authorscopusid55837529000-
dc.contributor.authorscopusid7003605046-
dc.contributor.authorscopusid6603588897-
dc.contributor.authorscopusid24475914200-
dc.description.lastpage382-
dc.identifier.issue1176264-
dc.description.firstpage371-
dc.relation.volume2002-January-
dc.investigacionIngeniería y Arquitecturaen_US
dc.type2Actas de congresosen_US
dc.identifier.ulpgces
item.grantfulltextnone-
item.fulltextSin texto completo-
crisitem.author.deptGIR SIANI: Inteligencia Artificial, Robótica y Oceanografía Computacional-
crisitem.author.deptIU Sistemas Inteligentes y Aplicaciones Numéricas-
crisitem.author.deptDepartamento de Informática y Sistemas-
crisitem.author.orcid0000-0001-7511-5783-
crisitem.author.parentorgIU Sistemas Inteligentes y Aplicaciones Numéricas-
crisitem.author.fullNameSantana Jaria, Oliverio Jesús-
Colección:Actas de congresos
Vista resumida

Citas SCOPUSTM   

36
actualizado el 30-mar-2025

Citas de WEB OF SCIENCETM
Citations

13
actualizado el 30-mar-2025

Visitas

65
actualizado el 27-ene-2024

Google ScholarTM

Verifica

Altmetric


Comparte



Exporta metadatos



Los elementos en ULPGC accedaCRIS están protegidos por derechos de autor con todos los derechos reservados, a menos que se indique lo contrario.