1. Memory technology & Hierarchy



Similar documents
Computer Architecture

A N. O N Output/Input-output connection

Computer Systems Structure Main Memory Organization

RAM & ROM Based Digital Design. ECE 152A Winter 2012

Homework # 2. Solutions. 4.1 What are the differences among sequential access, direct access, and random access?

Memory Basics. SRAM/DRAM Basics

Chapter 5 :: Memory and Logic Arrays

SOLVING HIGH-SPEED MEMORY INTERFACE CHALLENGES WITH LOW-COST FPGAS

Module 2. Embedded Processors and Memory. Version 2 EE IIT, Kharagpur 1

Memory unit. 2 k words. n bits per word

Memory. The memory types currently in common usage are:

Memory ICS 233. Computer Architecture and Assembly Language Prof. Muhamed Mudawar

Configuring Memory on the HP Business Desktop dx5150

ADVANCED PROCESSOR ARCHITECTURES AND MEMORY ORGANISATION Lesson-17: Memory organisation, and types of memory

Computer Architecture

Semiconductor Device Technology for Implementing System Solutions: Memory Modules

Understanding Memory TYPES OF MEMORY

Chapter 2 Basic Structure of Computers. Jin-Fu Li Department of Electrical Engineering National Central University Jungli, Taiwan

Random-Access Memory (RAM) The Memory Hierarchy. SRAM vs DRAM Summary. Conventional DRAM Organization. Page 1

Switch Fabric Implementation Using Shared Memory

Features. DDR SODIMM Product Datasheet. Rev. 1.0 Oct. 2011

Chapter 6. Inside the System Unit. What You Will Learn... Computers Are Your Future. What You Will Learn... Describing Hardware Performance

Table 1: Address Table

Memory Hierarchy. Arquitectura de Computadoras. Centro de Investigación n y de Estudios Avanzados del IPN. adiaz@cinvestav.mx. MemoryHierarchy- 1

Open Flow Controller and Switch Datasheet

Objectives. Units of Memory Capacity. CMPE328 Microprocessors (Spring ) Memory and I/O address Decoders. By Dr.

Byte Ordering of Multibyte Data Items

The Central Processing Unit:

Random Access Memory (RAM) Types of RAM. RAM Random Access Memory Jamie Tees SDRAM. Micro-DIMM SO-DIMM

With respect to the way of data access we can classify memories as:

CHAPTER 7: The CPU and Memory

NAND Flash FAQ. Eureka Technology. apn5_87. NAND Flash FAQ

User s Manual HOW TO USE DDR SDRAM

Price/performance Modern Memory Hierarchy

Algorithms and Methods for Distributed Storage Networks 3. Solid State Disks Christian Schindelhauer

Memory Basics ~ ROM, DRAM, SRAM, Cache Memory The article was added by Kyle Duke

Low Power AMD Athlon 64 and AMD Opteron Processors

RAM. Overview DRAM. What RAM means? DRAM

Chapter 9 Semiconductor Memories. Jin-Fu Li Department of Electrical Engineering National Central University Jungli, Taiwan

361 Computer Architecture Lecture 14: Cache Memory

Chapter 2: Computer-System Structures. Computer System Operation Storage Structure Storage Hierarchy Hardware Protection General System Architecture

Computer-System Architecture

Memory technology evolution: an overview of system memory technologies

CSCA0102 IT & Business Applications. Foundation in Business Information Technology School of Engineering & Computing Sciences FTMS College Global

Logical Operations. Control Unit. Contents. Arithmetic Operations. Objectives. The Central Processing Unit: Arithmetic / Logic Unit.

Slide Set 8. for ENCM 369 Winter 2015 Lecture Section 01. Steve Norman, PhD, PEng

COMPUTER HARDWARE. Input- Output and Communication Memory Systems

Atmel Norway XMEGA Introduction

White Paper Utilizing Leveling Techniques in DDR3 SDRAM Memory Interfaces

Agenda. Michele Taliercio, Il circuito Integrato, Novembre 2001

Computers. Hardware. The Central Processing Unit (CPU) CMPT 125: Lecture 1: Understanding the Computer

DDR4 Memory Technology on HP Z Workstations

Technical Note. Initialization Sequence for DDR SDRAM. Introduction. Initializing DDR SDRAM

Memory Systems. Static Random Access Memory (SRAM) Cell

ThinkServer PC DDR3 1333MHz UDIMM and RDIMM PC DDR3 1066MHz RDIMM options for the next generation of ThinkServer systems TS200 and RS210

Technical Note. Micron NAND Flash Controller via Xilinx Spartan -3 FPGA. Overview. TN-29-06: NAND Flash Controller on Spartan-3 Overview

Am186ER/Am188ER AMD Continues 16-bit Innovation

OpenSPARC T1 Processor

AMD Opteron Quad-Core

1.1 Silicon on Insulator a brief Introduction

A New Chapter for System Designs Using NAND Flash Memory

ADQYF1A08. DDR2-1066G(CL6) 240-Pin O.C. U-DIMM 1GB (128M x 64-bits)

Semiconductor Memories

Chapter Introduction. Storage and Other I/O Topics. p. 570( 頁 585) Fig I/O devices can be characterized by. I/O bus connections

NTE2053 Integrated Circuit 8 Bit MPU Compatible A/D Converter

DDR3 memory technology

Chapter 7 Memory and Programmable Logic

Lecture 3: Modern GPUs A Hardware Perspective Mohamed Zahran (aka Z) mzahran@cs.nyu.edu

Class 18: Memories-DRAMs

Modeling Sequential Elements with Verilog. Prof. Chien-Nan Liu TEL: ext: Sequential Circuit

Intel 965 Express Chipset Family Memory Technology and Configuration Guide

Lecture 9: Memory and Storage Technologies

Lecture 2: Computer Hardware and Ports.

SLC vs. MLC: An Analysis of Flash Memory

COMPUTER SCIENCE AND ENGINEERING - Microprocessor Systems - Mitchell Aaron Thornton

Table 1 SDR to DDR Quick Reference

GR2DR4B-EXXX/YYY/LP 1GB & 2GB DDR2 REGISTERED DIMMs (LOW PROFILE)

Introduction To Computers: Hardware and Software

The Quest for Speed - Memory. Cache Memory. A Solution: Memory Hierarchy. Memory Hierarchy

Communicating with devices

Technical Product Specifications Dell Dimension 2400 Created by: Scott Puckett

Computing for Data-Intensive Applications:

Storage Class Memory and the data center of the future

7a. System-on-chip design and prototyping platforms

McPAT: An Integrated Power, Area, and Timing Modeling Framework for Multicore and Manycore Architectures

Lizy Kurian John Electrical and Computer Engineering Department, The University of Texas as Austin

TECHNOLOGY BRIEF. Compaq RAID on a Chip Technology EXECUTIVE SUMMARY CONTENTS

Intel X38 Express Chipset Memory Technology and Configuration Guide

Intel Q35/Q33, G35/G33/G31, P35/P31 Express Chipset Memory Technology and Configuration Guide

In-Memory Databases Algorithms and Data Structures on Modern Hardware. Martin Faust David Schwalb Jens Krüger Jürgen Müller

Bandwidth Calculations for SA-1100 Processor LCD Displays

Why Latency Lags Bandwidth, and What it Means to Computing

QUICK START GUIDE FOR DEMONSTRATION CIRCUIT BIT DIFFERENTIAL ADC WITH I2C LTC2485 DESCRIPTION

Driving The Need For Innovative Memory Solutions

Unit A451: Computer systems and programming. Section 2: Computing Hardware 1/5: Central Processing Unit

EE482: Advanced Computer Organization Lecture #11 Processor Architecture Stanford University Wednesday, 31 May ILP Execution

Computer Organization. and Instruction Execution. August 22

Mobile SDRAM. MT48H16M16LF 4 Meg x 16 x 4 banks MT48H8M32LF 2 Meg x 32 x 4 banks

1.55V DDR2 SDRAM FBDIMM

Power-Aware High-Performance Scientific Computing

Transcription:

1. Memory technology & Hierarchy RAM types Advances in Computer Architecture Andy D. Pimentel

Memory wall Memory wall = divergence between CPU and RAM speed We can increase bandwidth by introducing concurrency in memory access (e.g. through pipelining accesses) but this requires regular access patterns random accesses to main memory can cause severe performance degradation

Memory hierarchy - issues Conflicting requirements in a memory system: we want both large and fast Electronic systems have higher latencies as they increase in size Speed of light is approximately 1ns for 30cms N.b. 1 ns is 3 clock cycles in a state of the art processor Pins, wires, connectors etc. all add resistance and capacitance which delay signals significantly A memory hierarchy attempts to make a large slow memory appear fast by buffering data in smaller faster memories close to the processor

Memory hierarchy - issues Can make memories faster but this requires more power Need to drive long wires on chip across memory array to do this fast needs more power Memory performance is a compromise between power and performance (As is processor performance today)

RAM cell designs

Components - DRAM DRAM - is a very dense form of RAM - it is volatile access is destructive & data must be re-written charge also leaks from the capacitor which stores the data so data must be refreshed periodically ( dynamic RAM) Typical DRAM chip characteristics 256-1024 Mbit and 2-800MHz cycle Uses two cycle Row/Column addressing (RAS/CAS) two-stage access and requirement to rewrite data contribute to slow cycle time but good design can help for regular accesses

DRAM - RAS/CAS addressing

DRAM refresh Voltage for 1 1 Written Refreshed Refreshed Refreshed Threshold voltage Voltage for 0 0 Stored 10s of ms before needing refresh cycle Time

Components - SRAM SRAM - is less dense but faster than DRAM uses four transistors to store data and two transistors to access it - access is non-destructive data is stable while the RAM has power connected ( static RAM) Typical SRAM chip characteristics ~64Mbit and 100MHz-1GHz cycle although SRAM cycle times are similar to DRAM, SRAM is true random access memory DRAM can only read consecutive bits at the cycle rate, therefore DRAM has a much larger latency time SRAM is also used for memory on the processor chip registers and cache both use SRAM technology the smaller the memory the faster it operates: lower latency

New technology Memristor a circuit element in which the resistance is a function of the history of the current through the device Memristor theory was formulated and named by Leon Chua in 1971 Ref: Chua, Leon O (1971), "Memristor -The Missing Circuit Element", IEEE Transactions on Circuit Theory CT-18 (5): 507 519, doi:10.1109/tct.1971.1083337 HP announced they teamed up with Hynix to produce a commercial product dubbed "ReRam" August 2010 http://www.reuters.com/article/idus254583059320100901 has the potential to be dense and non-volatile

Bandwidth vs. Latency Memory latency is the time delay required to obtain a specific item of data This is larger in DRAM than in SRAM SRAM can access any bit each cycle DRAM is restricted to bits in the same row, CAS cycles Memory Bandwidth is the rate at which data can be accessed (e.g. bits per second) Bandwidth unit is normally 1/cycle time This rate can be improved by concurrent access

Improving DRAM bandwidth Using locality to get maximum bandwidth One RAS multiple CAS e.g. Fast page mode DRAM E(xtended) D(ata) O(utput) RAMs Burst-mode DRAMs Burst EDO RAM S(ynchronous)DRAM By improving the interface DDR SDRAM and RAMBUS

Example - Synchronous DRAM SDRAM changed the memory interface from asynchronous to synchronous and uses a form of pipelining will return to this concept later DDR SDRAM - double data rate uses transfers on both rising and falling clock edges

Despite the performance improvement in the overall system due to use of SDRAM, the growing performance gap between the memory and processor must be filled by more advanced memory Different technologies. These technologies, SDRAM which are described technologies on the following pages, boost the overall performance of systems using the latest high-speed processors (Figure 8). Figure 8. Peak bandwidth comparison of SDRAM and advanced SDRAM technologies

RAMBUS DRAMs This is an interface improvement using a pipelined bus interface sometimes called a split-transaction Bus comprises row and column address line + 18 bits of data 3 transactions on bus simultaneously (RAS/CAS/Data) High clock rate (400MHz) with data transfers on both edges Note that neither technique (SDRAM or RAMBUS) can improve the latency to access a single item of data

Flash memory S o u r c e l i n e s Control gate Floating gate Source Word lines n p substrate n+ B i t l i n e s Drain

Memory wall solutions The most common solution to the memory wall is to cache data requires locality of access or memory reuse compiler optimisations can help to localise data Can also design banked memory systems to provide high bandwidth to random memory locations Some access patterns will still break the memory Can design processors that tolerate high-latency memory accesses don t wait do something else Requires concurrency in instruction execution

Summary Need for new dense + fast technology Most common solution: cache data Requires locality of access, or memory reuse Design processors that tolerate high-latency memory accesses don t wait do something else Trend: hardware multithreading in current and future chips

Summary Limits of RAM components cause Memory wall problem Caches & hw multithreading solutions New components (?)