Introduction to RISC Processor. ni logic Pvt. Ltd., Pune



Similar documents
ARM Microprocessor and ARM-Based Microcontrollers

İSTANBUL AYDIN UNIVERSITY

ADVANCED PROCESSOR ARCHITECTURES AND MEMORY ORGANISATION Lesson-12: ARM

Chapter 2 Logic Gates and Introduction to Computer Architecture

Overview. CISC Developments. RISC Designs. CISC Designs. VAX: Addressing Modes. Digital VAX

CISC, RISC, and DSP Microprocessors

OpenSPARC T1 Processor

4 RISC versus CISC Architecture

AMD Opteron Quad-Core

Pentium vs. Power PC Computer Architecture and PCI Bus Interface

Generations of the computer. processors.

A Survey on ARM Cortex A Processors. Wei Wang Tanima Dey

CPU Organization and Assembly Language

Intel 8086 architecture

Processor Architectures

Computer Architecture TDTS10

Am186ER/Am188ER AMD Continues 16-bit Innovation

CHAPTER 4 MARIE: An Introduction to a Simple Computer

Instruction Set Architecture. or How to talk to computers if you aren t in Star Trek

Chapter 6. Inside the System Unit. What You Will Learn... Computers Are Your Future. What You Will Learn... Describing Hardware Performance

Logical Operations. Control Unit. Contents. Arithmetic Operations. Objectives. The Central Processing Unit: Arithmetic / Logic Unit.

Board Notes on Virtual Memory

Advanced Computer Architecture-CS501. Computer Systems Design and Architecture 2.1, 2.2, 3.2

Central Processing Unit (CPU)

The Central Processing Unit:

Instruction Set Architecture (ISA)

An Overview of Stack Architecture and the PSC 1000 Microprocessor

VHDL DESIGN OF EDUCATIONAL, MODERN AND OPEN- ARCHITECTURE CPU

CHAPTER 2: HARDWARE BASICS: INSIDE THE BOX

EE482: Advanced Computer Organization Lecture #11 Processor Architecture Stanford University Wednesday, 31 May ILP Execution

A New, High-Performance, Low-Power, Floating-Point Embedded Processor for Scientific Computing and DSP Applications

The Orca Chip... Heart of IBM s RISC System/6000 Value Servers

CS 61C: Great Ideas in Computer Architecture Virtual Memory Cont.

CHAPTER 7: The CPU and Memory

Full and Para Virtualization

Intel StrongARM SA-110 Microprocessor

Computer System: User s View. Computer System Components: High Level View. Input. Output. Computer. Computer System: Motherboard Level

Computer Organization and Architecture. Characteristics of Memory Systems. Chapter 4 Cache Memory. Location CPU Registers and control unit memory

Computer Organization and Architecture

In the Beginning The first ISA appears on the IBM System 360 In the good old days

Solution: start more than one instruction in the same clock cycle CPI < 1 (or IPC > 1, Instructions per Cycle) Two approaches:

Lecture 3: Modern GPUs A Hardware Perspective Mohamed Zahran (aka Z) mzahran@cs.nyu.edu

Microprocessor and Microcontroller Architecture

Unit A451: Computer systems and programming. Section 2: Computing Hardware 1/5: Central Processing Unit

A Lab Course on Computer Architecture

W4118: segmentation and paging. Instructor: Junfeng Yang

Operating System Impact on SMT Architecture

SOC architecture and design

Management Challenge. Managing Hardware Assets. Central Processing Unit. What is a Computer System?

The AVR Microcontroller and C Compiler Co-Design Dr. Gaute Myklebust ATMEL Corporation ATMEL Development Center, Trondheim, Norway

EE361: Digital Computer Organization Course Syllabus

BEAGLEBONE BLACK ARCHITECTURE MADELEINE DAIGNEAU MICHELLE ADVENA

MPSoC Designs: Driving Memory and Storage Management IP to Critical Importance

The ARM Architecture. With a focus on v7a and Cortex-A8

Chapter 2 Basic Structure of Computers. Jin-Fu Li Department of Electrical Engineering National Central University Jungli, Taiwan

Programming Logic controllers

MC68060 MC68LC060 MC68EC060

ARM Architecture. ARM history. Why ARM? ARM Ltd developed by Acorn computers. Computer Organization and Assembly Languages Yung-Yu Chuang

CPUs - CPU PN/DP

DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING Question Bank Subject Name: EC Microprocessor & Microcontroller Year/Sem : II/IV

Making Multicore Work and Measuring its Benefits. Markus Levy, president EEMBC and Multicore Association

RISC AND CISC. Computer Architecture. Farhat Masood BE Electrical (NUST) COLLEGE OF ELECTRICAL AND MECHANICAL ENGINEERING

Chapter 13. PIC Family Microcontroller

SPARC64 VIIIfx: CPU for the K computer

Design Cycle for Microprocessors

Lecture 11: Multi-Core and GPU. Multithreading. Integration of multiple processor cores on a single chip.

OC By Arsene Fansi T. POLIMI

VLIW Processors. VLIW Processors

Computers. Hardware. The Central Processing Unit (CPU) CMPT 125: Lecture 1: Understanding the Computer

This Unit: Putting It All Together. CIS 501 Computer Architecture. Sources. What is Computer Architecture?

Architecture of Hitachi SR-8000

Introducción. Diseño de sistemas digitales.1

Let s put together a Manual Processor

LSN 2 Computer Processors

Computer Architecture

361 Computer Architecture Lecture 14: Cache Memory


Thread level parallelism

picojava TM : A Hardware Implementation of the Java Virtual Machine

The i860 XP Second Generation of the i860 Supercomputing Microprocessor Family. Presentation Outline

Unit 4: Performance & Benchmarking. Performance Metrics. This Unit. CIS 501: Computer Architecture. Performance: Latency vs.

CS:APP Chapter 4 Computer Architecture. Wrap-Up. William J. Taffe Plymouth State University. using the slides of

AC : A PROCESSOR DESIGN PROJECT FOR A FIRST COURSE IN COMPUTER ORGANIZATION

Instruction Set Design

Bindel, Spring 2010 Applications of Parallel Computers (CS 5220) Week 1: Wednesday, Jan 27

Chapter 5 Instructor's Manual

An Introduction to the ARM 7 Architecture

Introduction to Microprocessors

COS 318: Operating Systems. Virtual Memory and Address Translation

Introduction to Cloud Computing

Faculty of Engineering Student Number:

How To Understand The Design Of A Microprocessor

Open Architecture Design for GPS Applications Yves Théroux, BAE Systems Canada

Why Latency Lags Bandwidth, and What it Means to Computing

Computer Architecture Lecture 2: Instruction Set Principles (Appendix A) Chih Wei Liu 劉 志 尉 National Chiao Tung University

on an system with an infinite number of processors. Calculate the speedup of

THE BASICS OF PERFORMANCE- MONITORING HARDWARE

Digitale Signalverarbeitung mit FPGA (DSF) Soft Core Prozessor NIOS II Stand Mai Jens Onno Krah

(Refer Slide Time: 00:01:16 min)

Software Pipelining. for (i=1, i<100, i++) { x := A[i]; x := x+1; A[i] := x

Transcription:

Introduction to RISC Processor ni logic Pvt. Ltd., Pune

AGENDA What is RISC & its History What is meant by RISC Architecture of MIPS-R4000 Processor Difference Between RISC and CISC Pros and Cons of RISC Why RISC 2

What is RISC and its History RISC stands for Reduced Instruction Set Computer. The IBM was the first company to define the RISC architecture in the 1970s. This research was further developed by the universities of Berkeley and Stanford to give basic architectural models. 3

What is RISC and its History RISC can be described as a philosophy with three basic levels : All instruction will be executed in a single cycle. Memory will only be accessed via load and store instruction. All executions units will be hardwired with no micro coding. 4

What is meant by RISC The instruction set is the hardware language in which the software tells the processor what to do. Surprisingly, reducing the size of instruction set -- eliminating certain instructions based upon a careful quantitative analysis, and requiring these seldom - used instructions to be emulated in software -- can lead to higher performance. For several reasons : 5

What is meant by RISC Some of the uses for the space : Additional registers. on-chip caches which are are clocked as fast as the Processor. Additional functional units for superscalar execution. On-chip support for floating-point operations. Increased pipeline depth. Branch prediction. Additional non-risc (but fast) instructions. 6

What is meant by RISC The vacated area of chip can be used in ways that accelerate the performance of more commonly used instructions. It becomes easier to optimize the design. It allows MP to use techniques hitherto restricted to the largest computers. 7

What is meant by RISC Basically the philosophy is, that instructions are handled in parts: Fetch the instruction. Get the arguments Perform the action Write back the result 8

What is meant by RISC Another view point was that processors currently use a 32-bit data bus anyway, so why not make every instruction 32-bits wide? Every instruction cycle fetches 32-bits anyway. Using these 32-bits efficiently makes it logical, to use one byte to code the actual instruction and the other three bytes, to code the registers to act on. This makes using triadic instruction logical: 9

What is meant by RISC For example: add ro,r1,r2 which means : r0 = r1 + r2. 10

RISC characteristics Simple instruction set. Same length instructions. 1 machine-cycle instructions. 11

R4000 Internal Block Diagram 12

MIPS-R4000 Processor Processor General Features : CPU Overview Register and Inst. Set Overview Efficient pipeline. Memory Management Unit (MMU). Memory Organization. System control coprocessor (CP0). Floating - point Unit (CP1). 13

64 - Bit Architecture The R4000 processor contains 32 general purpose 64-bit registers. The natural mode of operation for the R4000 processor is as a 64-bit MP; however, 32-bit applications maintain compatibility even when the processor operates as a 64-bit processor. All instructions are 32 bits wide. 14

64-Bit Architecture The R4000 processor provides the following : 64-bit on-chip floating point unit(fpu). 64-bit integer arithmetic logic unit(alu). 64-bit integer registers. 64-bit virtual address space. 64-bit system bus. 15

CPU Register Overview The CPU provides the following registers : 32 general purpose register. A program counter(pc) register. 2 registers that hold the results of integer multiply and divide operations( HI & LO). The R4000 has no Program Status Word(PSW) register, as such this is covered by the status and cause registers incorporated within the system control coprocessor(cp0). 16

CPU Register Overview 17

CPU Instruction Set Overview Each CPU instruction is 32-bits long. There are three instruction formats : immediate ( I - type) jump (J - type) register (r - type) 18

CPU Instruction Set Overview 19

CPU Instruction Set Overview Instruction Types : Load and store Instructions. Computational Instructions. Jump and Branch Instructions. Special Instructions. Exception Instructions. Coprocessor Instructions 20

Superpipeline Architecture The processor exploits instructions parallelism by using an eightstage superpipeline which places no restrictions on the instructions issued. Under normal circumstances, two instructions are issued each cycle. The internal pipeline operates at the twice the frequency of the master clock. 21

Superpipeline Architecture The CPU has an eight-stage instruction pipeline; each stage takes one PCycle. Thus, the execution of each instruction takes at least eight PCycles. Once the pipeline has been filled, eight instructions are executed simultaneously. 22

Superpipeline Architecture 23

Superpipeline Architecture IF IS - Instruction Fetch, First half. - Instruction Fetch, second half. RF - Register Fetch. EX - Execution. DF - Data Fetch, First half. DS - Data Fetch, second half. TC - Tag check. WB - Write Back. 24

Memory Management Unit(MMU) The MIPS R4000 processor provides a full-featured MMU which uses an on-chip translation lookaside buffer(tlb) to translate virtual addresses into physical addresses. 25

Memory Management Unit(MMU) Translation Lookaside Buffer(TLB): Mapped virtual address are translated into physical address using an on-chip TLB. The TLB is a fully associative memory that holds 48 entries. Which provide mapping to 48 odd/even page pairs(96 pages). The address mapped to a page ranges in size from 4Kbytes to 16 Mbytes, in multiples of 4 - i.e., 4K, 16K, 256K, 1M, 4M, 16M. 26

Memory Management Unit(MMU) Virtual address space: The processor virtual address space can be either 32 or 64 bits wide, depending on whether the processor is operating in 32-bit or 64-bit mode. In 32-bit mode, addresses are 32 bits wide. The maximum user process size is 2 Gbytes(231). In 64-bit mode, addresses are 64-bits wide. The maximum user process size is 1 Terabyte(240). 27

Memory Management Unit(MMU) 28

Memory Organization 29

Memory Organization The R4000 Primary instruction and data caches resides on-chip, and can each hold 8Kbytes. Architecturally, each primary cache can be increased to hold up to 32 Kbytes. An off-chip secondary cache (R4000SC and R4000MC only) can hold from 128 Kbytes to 4Mbytes. 30

Memory Organization 31

Memory Organization 32

Memory Organization 33

System Control Coprocessor(CP0) CP0 translates virtual addresses into physical addresses and manages exceptions and transitions between kernel, supervisor, and user states. CP0 also controls the cache subsystem, as well as providing diagnostic control and error recovery facilities. 34

System Control Coprocessor(CP0) 35

Floating Point Unit(FPU), CP1 R4000 has on-chip floating point unit designated as CP1. The FPU extends the CPU instruction set to perform arithmetic operations on floating-point values. The FPU features include : Full 64-bit Operation. Load and store instruction set. Tightly coupled coprocessor Interface. 36

Floating Point Unit(FPU) 37

RISC Vs CISC What really distinguishes RISC from CISC these days is more deeply rooted in the chip architectures, among them : RISC microprocessors have more general purpose registers. RISC microprocessors uses uniform instruction length. RISC microprocessors emphasize floating-point performance. 38

Advantages of RISC Speed. Simpler hardware. Shorter design cycle. User(programmer) benefits. 39

Hazards of RISC Code quality. Debugging. Code expansion. System design. 40

Why RISC...? Which processor will I use? And how should I choose it? Applications : High speed data transmission, Real time processing, controlling applications,..etc. 41

Remember...! Reducing or simplifying the instruction set was not the primary goal of RISC architecture; it is a pleasant side effect of techniques used to gain the highest performance possible from available technology. 42