Vis enkel innførsel

dc.contributor.authorGrannæs, Mariusnb_NO
dc.date.accessioned2014-12-19T13:36:05Z
dc.date.available2014-12-19T13:36:05Z
dc.date.created2010-10-06nb_NO
dc.date.issued2010nb_NO
dc.identifier355273nb_NO
dc.identifier.isbn978-82-471-2177-1 (printed version)nb_NO
dc.identifier.isbn978-82-471-2178-8 (electronic ver.)nb_NO
dc.identifier.urihttp://hdl.handle.net/11250/252194
dc.description.abstractIntegrated circuits have been in constant progression since the first prototype in 1958, with the semiconductor industry maintaining a constant rate of miniaturisation of transistors and wires. Up until about the year 2002, processor performance increased by about 55% per year. Since then, limitations on power, ILP and memory latency have slowed the increase in uniprocessor performance to about 20% per year. Although the capacity of DRAM increases by about 40% per year, the latency only decreases by about 6 { 7% per year. This performance gap between the processor and DRAM leads to a problem known as the memory wall. This thesis aims to improve system memory latency by leveraging available resources with excess capacity. This has been achieved through multiple techniques, but mainly by using excess bandwidth and improving scheduling policies. The first approach presented, destructive read DRAM, changes the underlying assumptions about the contents of a DRAM cell being unchanged after a read. The latency of a read is reduced, but the rest of the memory system requires changes to conserve data. Prefetching predicts what data is needed in the future and fetches that data into the cache before it is referenced. This dissertation presents a technique for generating highly accurate prefetches with good timeliness called Delta Correlating Prediction Tables (DCPT). DCPT uses a table indexed by the load's address to store the delta history of individual loads. Delta correlation is then used to predict future misses. Delta Correlating Prediction Tables with Partial Matching (DCPT-P) extends DCPT by introducing L1 hoisting which moves data from the L2 to the L1 to further increase performance. In addition, DCPT-P leverages partial matching which reduces the spatial resolution of deltas to expose more patterns. The interaction between the memory controller and the prefetcher is especially important, because of the complex 3D structure of modern DRAM. Utilizing open pages can increase the performance of the system significantly. Memory controllers can increase bandwidth utilization and reduce latency at the same time by scheduling prefetches such that the number of page hits are maximized. The interaction between the program, prefetcher and the memory controller is explored. This thesis examines the impact of having a shared memory system in a CMP. When resources are shared, one core might interfere with another core's execution by delaying memory requests or displacing useful data in the cache. This effect is quantified and which components are most prone to interference between cores identified. Finally, we present a framework for measuring interference at runtime.nb_NO
dc.languageengnb_NO
dc.publisherNTNUnb_NO
dc.relation.ispartofseriesDoktoravhandlinger ved NTNU, 1503-8181; 2010:106nb_NO
dc.relation.haspartDybdahl, H; Grannaes, M; Natvig, L. Cache write-back schemes for embedded destructive-read DRAM. Lecture Nnotes in Computer Science - ARCHITECTURE OF COMPUTING SYSTEMS - ARCS 2006, PROCEEDINGS   : 145-159, 2006.nb_NO
dc.relation.haspartDybdahl, Haakon; Kjeldsberg, Per Gunnar; Grannæs, Marius; Natvig, Lasse. Destructive-Read in Embedded DRAM, Impacton Power Consumption. Journal of Embedded Computing. (ISSN 1740-4460). 2(2): 249-260, 2006.nb_NO
dc.relation.haspartGrannæs, Marius; Natvig, Lasse. Hardware Prefetching Using Shadow Tagging. In CMP-MSI: 2nd Workshop on Chip Multiprocessor Memory Systems and Interconnects, 2008, 2008.nb_NO
dc.relation.haspartGrannæs, Marius; Jahre, Magnus; Natvig, Lasse. Low-Cost Open-Page Prefetch Scheduling in Chip Multiprocessors. XXVI IEEE International Conference on Computer Design (ICCD) 2008: 390-396, 2008. <a href='http://dx.doi.org/10.1109/ICCD.2008.4751890'>10.1109/ICCD.2008.4751890</a>.nb_NO
dc.relation.haspartGrannæs, Marius; Jahre, Magnus; Natvig, Lasse. Storage Efficient Hardware Prefetching using Delta Correlating Prediction Tables. Data Prefetching Chamionship - 1, 2009, 2009.nb_NO
dc.relation.haspartJahre, M; Grannæs, M; Natvig, L. A Quantitative Study of Memory System Interference in Chip Multiprocessor Architectures. 11th IEEE International Conference on High Performance Computing and Communications (HPCC) 2009: 622-629, 2009. <a href='http://dx.doi.org/10.1109/HPCC.2009.77'>10.1109/HPCC.2009.77</a>.nb_NO
dc.relation.haspartGrannæs, Marius; Jahre, Magnus; Natvig, Lasse. Multi-Level Hardware Prefetching using Low Complexity Delta Correlating Prediction Tables with Partial Matching.  Lecture Notes in Computer Science = Lecture notes in artificial intelligence: 247-261, 2010. <a href='http://dx.doi.org/10.1007/978-3-642-11515-8_19'>10.1007/978-3-642-11515-8_19</a>.nb_NO
dc.relation.haspartJahre, Magnus; Grannaes, Marius; Natvig, Lasse. DIEF: An Accurate Interference Feedback Mechanism for Chip Multiprocessor Memory Systems. Lecture Notes in Computer Science. (ISSN 0302-9743). 5952: 292-306, 2010. <a href='http://dx.doi.org/10.1007/978-3-642-11515-8_22'>10.1007/978-3-642-11515-8_22</a>.nb_NO
dc.relation.haspartGrannæs, Marius; Jahre, Magnus; Natvig, Lasse. Exploring the Prefetcher/Memory Controller Design Space: An Opportunistic Prefetch Scheduling Strategy. .nb_NO
dc.titleReducing Memory Latency by Improving Resource Utilizationnb_NO
dc.typeDoctoral thesisnb_NO
dc.contributor.departmentNorges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi, matematikk og elektroteknikk, Institutt for datateknikk og informasjonsvitenskapnb_NO
dc.description.degreePhD i informasjons- og kommunikasjonsteknologinb_NO
dc.description.degreePhD in Information and Communications Technologyen_GB


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel