Machine-Level Programming II: Arithmetic & Control

Similar documents
Machine Programming II: Instruc8ons

Lecture 7: Machine-Level Programming I: Basics Mohamed Zahran (aka Z)

Instruction Set Architecture

CS:APP Chapter 4 Computer Architecture Instruction Set Architecture. CS:APP2e

Assembly Language: Function Calls" Jennifer Rexford!

CS61: Systems Programing and Machine Organization

x64 Cheat Sheet Fall 2015

Chapter 4 Processor Architecture

Return-oriented programming without returns

Lecture 27 C and Assembly

CS412/CS413. Introduction to Compilers Tim Teitelbaum. Lecture 20: Stack Frames 7 March 08

l C-Programming l A real computer language l Data Representation l Everything goes down to bits and bytes l Machine representation Language

Hacking Techniques & Intrusion Detection. Ali Al-Shemery arabnix [at] gmail

CS:APP Chapter 4 Computer Architecture Instruction Set Architecture

Intel Assembler. Project administration. Non-standard project. Project administration: Repository

Off-by-One exploitation tutorial

Machine-Level Programming I: Basics

X86-64 Architecture Guide

x86-64 Machine-Level Programming

64-Bit NASM Notes. Invoking 64-Bit NASM

For a 64-bit system. I - Presentation Of The Shellcode

Giving credit where credit is due

CS:APP Chapter 4 Computer Architecture. Wrap-Up. William J. Taffe Plymouth State University. using the slides of

CHAPTER 6 TASK MANAGEMENT

A Tiny Guide to Programming in 32-bit x86 Assembly Language

Introduction. Application Security. Reasons For Reverse Engineering. This lecture. Java Byte Code

Overview of IA-32 assembly programming. Lars Ailo Bongo University of Tromsø

Introduction. Compiler Design CSE 504. Overview. Programming problems are easier to solve in high-level languages

Computer Architectures

Complete 8086 instruction set

Boolean Expressions, Conditions, Loops, and Enumerations. Precedence Rules (from highest to lowest priority)

Computer Organization and Assembly Language

Computer Organization and Components

Intel 8086 architecture

Computer Organization and Architecture

Systems Design & Programming Data Movement Instructions. Intel Assembly

TODAY, FEW PROGRAMMERS USE ASSEMBLY LANGUAGE. Higher-level languages such

Abysssec Research. 1) Advisory information. 2) Vulnerable version

Chapter 7D The Java Virtual Machine

Buffer Overflows. Security 2011

Andreas Herrmann. AMD Operating System Research Center

Faculty of Engineering Student Number:

Unpacked BCD Arithmetic. BCD (ASCII) Arithmetic. Where and Why is BCD used? From the SQL Server Manual. Packed BCD, ASCII, Unpacked BCD

Language Processing Systems

Stack Overflows. Mitchell Adair

Instruction Set Architecture

Automating Mimicry Attacks Using Static Binary Analysis

Reduced Instruction Set Computer (RISC)

Notes on x86-64 programming

Overview. CISC Developments. RISC Designs. CISC Designs. VAX: Addressing Modes. Digital VAX

Programming from the Ground Up. Jonathan Bartlett

Instruction Set Architecture. or How to talk to computers if you aren t in Star Trek

Programming Languages

Stack machines The MIPS assembly language A simple source language Stack-machine implementation of the simple language Readings:

Programming from the Ground Up

MACHINE ARCHITECTURE & LANGUAGE

Encryption Wrapper. on OSX

How to make the computer understand? Lecture 15: Putting it all together. Example (Output assembly code) Example (input program) Anatomy of a Computer

Programming from the Ground Up

Using the RDTSC Instruction for Performance Monitoring

G-Free: Defeating Return-Oriented Programming through Gadget-less Binaries

An Introduction to Assembly Programming with the ARM 32-bit Processor Family

Automated Repair of Binary and Assembly Programs for Cooperating Embedded Devices

Programming Languages and Compilers

The Beast is Resting in Your Memory On Return-Oriented Programming Attacks and Mitigation Techniques To appear at USENIX Security & BlackHat USA, 2014

Heap-based Buffer Overflow Vulnerability in Adobe Flash Player

Software Fingerprinting for Automated Malicious Code Analysis

Where we are CS 4120 Introduction to Compilers Abstract Assembly Instruction selection mov e1 , e2 jmp e cmp e1 , e2 [jne je jgt ] l push e1 call e

Assembly Language Tutorial

High-speed image processing algorithms using MMX hardware

Simple C Programs. Goals for this Lecture. Help you learn about:

Under The Hood: The System Call

Embedded Systems. Review of ANSI C Topics. A Review of ANSI C and Considerations for Embedded C Programming. Basic features of C

CPU performance monitoring using the Time-Stamp Counter register

Instruction Set Architecture (ISA)

Stitching the Gadgets On the Ineffectiveness of Coarse-Grained Control-Flow Integrity Protection

Anatomy and Performance of SSL Processing

Recovery of Jump Table Case Statements from Binary Code

TitanMist: Your First Step to Reversing Nirvana TitanMist. mist.reversinglabs.com

The programming language C. sws1 1

esrever gnireenigne tfosorcim seiranib

Very Large Instruction Word Architectures (VLIW Processors and Trace Scheduling)

Compiler and Language Processing Tools

Compilers I - Chapter 4: Generating Better Code

Return-oriented Programming: Exploitation without Code Injection

Dynamic Behavior Analysis Using Binary Instrumentation

M A S S A C H U S E T T S I N S T I T U T E O F T E C H N O L O G Y DEPARTMENT OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCE

Intel Architecture Software Developer s Manual

About the Tutorial. Audience. Prerequisites. Copyright & Disclaimer

What to do when I have a load/store instruction?

PART B QUESTIONS AND ANSWERS UNIT I

COMPUTERS ORGANIZATION 2ND YEAR COMPUTE SCIENCE MANAGEMENT ENGINEERING JOSÉ GARCÍA RODRÍGUEZ JOSÉ ANTONIO SERRA PÉREZ

The AVR Microcontroller and C Compiler Co-Design Dr. Gaute Myklebust ATMEL Corporation ATMEL Development Center, Trondheim, Norway

CPU Organization and Assembly Language

612 CHAPTER 11 PROCESSOR FAMILIES (Corrisponde al cap Famiglie di processori) PROBLEMS

Upcompiling Legacy Code to Java

Software Vulnerabilities

Exceptions in MIPS. know the exception mechanism in MIPS be able to write a simple exception handler for a MIPS machine

CS 16: Assembly Language Programming for the IBM PC and Compatibles

1 The Java Virtual Machine

Transcription:

Mellon Machine-Level Programming II: Arithmetic & Control 15-213 / 18-213: Introduction to Computer Systems 6 th Lecture, Jan 29, 2015 Instructors: Seth Copen Goldstein, Franz Franchetti, Greg Kesden 1

Today Complete addressing mode, address computation (leal) Arithmetic operations Control: Condition codes Conditional branches Loops 2

Complete Memory Addressing Modes Most General Form D(Rb,Ri,S) Mem[Reg[Rb]+S*Reg[Ri]+ D] D: Constant displacement 1, 2, or 4 bytes Rb: Base register: Any of 8 integer registers Ri: Index register: Any, except for %esp Unlikely you d use %ebp, either S: Scale: 1, 2, 4, or 8 (why these numbers?) Special Cases (Rb,Ri) Mem[Reg[Rb]+Reg[Ri]] D(Rb,Ri) Mem[Reg[Rb]+Reg[Ri]+D] (Rb,Ri,S) Mem[Reg[Rb]+S*Reg[Ri]] 3

Address Computation Examples %edx 0xf000 %ecx 0x0100 Expression Address Computation Address 0x8(%edx) (%edx,%ecx) (%edx,%ecx,4) 0x80(,%edx,2) 0xf000 + 0x8 0xf000 + 0x100 0xf000 + 4*0x100 2*0xf000 + 0x80 0xf008 0xf100 0xf400 0x1e080 4

Address Computation Instruction leal Src,Dest Src is address mode expression Set Dest to address denoted by expression Uses Computing addresses without a memory reference E.g., translation of p = &x[i]; Computing arithmetic expressions of the form x + k*y k = 1, 2, 4, or 8 Example int mul12(int x) { return x*12; Converted to ASM by compiler: leal (%eax,%eax,2), %eax ;t <- x+x*2 sall $2, %eax ;return t<<2 5

Today Complete addressing mode, address computation (leal) Arithmetic operations Control: Condition codes Conditional branches Loops 6

Some Arithmetic Operations Two Operand Instructions: Format Computation addl Src,Dest Dest = Dest + Src subl Src,Dest Dest = Dest Src imull Src,Dest Dest = Dest * Src sall Src,Dest Dest = Dest << Src Also called shll sarl Src,Dest Dest = Dest >> Src Arithmetic shrl Src,Dest Dest = Dest >> Src Logical xorl Src,Dest Dest = Dest ^ Src andl Src,Dest Dest = Dest & Src orl Src,Dest Dest = Dest Src Watch out for argument order! No distinction between signed and unsigned int (why?) 7

Some Arithmetic Operations One Operand Instructions incl Dest Dest = Dest + 1 decl Dest Dest = Dest 1 negl Dest Dest = Dest notl Dest Dest = ~Dest See book for more instructions 8

Arithmetic Expression Example int arith(int x, int y, int z) { int t1 = x+y; int t2 = z+t1; int t3 = x+4; int t4 = y * 48; int t5 = t3 + t4; int rval = t2 * t5; return rval; arith: pushl %ebp movl %esp, %ebp movl 8(%ebp), %ecx movl 12(%ebp), %edx leal (%edx,%edx,2), %eax sall $4, %eax leal 4(%ecx,%eax), %eax addl %ecx, %edx addl 16(%ebp), %edx imull %edx, %eax Set Up Body popl ret %ebp Finish 9

Understanding arith int arith(int x, int y, int z) { int t1 = x+y; int t2 = z+t1; int t3 = x+4; int t4 = y * 48; int t5 = t3 + t4; int rval = t2 * t5; return rval; Offset 16 z 12 y 8 x 4 Rtn Addr 0 Old %ebp %ebp movl 8(%ebp), %ecx movl 12(%ebp), %edx leal (%edx,%edx,2), %eax sall $4, %eax leal 4(%ecx,%eax), %eax addl %ecx, %edx addl 16(%ebp), %edx imull %edx, %eax 10

Understanding arith int arith(int x, int y, int z) { int t1 = x+y; int t2 = z+t1; int t3 = x+4; int t4 = y * 48; int t5 = t3 + t4; int rval = t2 * t5; return rval; Offset 16 z 12 y 8 x 4 Rtn Addr 0 Old %ebp Stack %ebp movl 8(%ebp), %ecx # ecx = x movl 12(%ebp), %edx # edx = y leal (%edx,%edx,2), %eax # eax = y*3 sall $4, %eax # eax *= 16 (t4) leal 4(%ecx,%eax), %eax # eax = t4 +x+4 (t5) addl %ecx, %edx # edx = x+y (t1) addl 16(%ebp), %edx # edx += z (t2) imull %edx, %eax # eax = t2 * t5 (rval) 12

Understanding arith int arith(int x, int y, int z) { int t1 = x+y; int t2 = z+t1; int t3 = x+4; int t4 = y * 48; int t5 = t3 + t4; int rval = t2 * t5; return rval; Offset 16 z 12 y 8 x 4 Rtn Addr 0 Old %ebp Stack %ebp movl 8(%ebp), %ecx # ecx = x movl 12(%ebp), %edx # edx = y leal (%edx,%edx,2), %eax # eax = y*3 sall $4, %eax # eax *= 16 (t4) leal 4(%ecx,%eax), %eax # eax = t4 +x+4 (t5) addl %ecx, %edx # edx = x+y (t1) addl 16(%ebp), %edx # edx += z (t2) imull %edx, %eax # eax = t2 * t5 (rval) 13

Observations about arith int arith(int x, int y, int z) { int t1 = x+y; int t2 = z+t1; int t3 = x+4; int t4 = y * 48; int t5 = t3 + t4; int rval = t2 * t5; return rval; Instructions in different order from C code Some expressions require multiple instructions Some instructions cover multiple expressions Get exact same code when compile: (x+y+z)*(x+4+48*y) movl 8(%ebp), %ecx # ecx = x movl 12(%ebp), %edx # edx = y leal (%edx,%edx,2), %eax # eax = y*3 sall $4, %eax # eax *= 16 (t4) leal 4(%ecx,%eax), %eax # eax = t4 +x+4 (t5) addl %ecx, %edx # edx = x+y (t1) addl 16(%ebp), %edx # edx += z (t2) imull %edx, %eax # eax = t2 * t5 (rval) 14

Another Example int logical(int x, int y) { int t1 = x^y; int t2 = t1 >> 17; int mask = (1<<13) - 7; int rval = t2 & mask; return rval; logical: pushl %ebp movl %esp,%ebp movl 12(%ebp),%eax xorl 8(%ebp),%eax sarl $17,%eax andl $8185,%eax popl %ebp ret Set Up Body Finish movl 12(%ebp),%eax # eax = y xorl 8(%ebp),%eax # eax = x^y (t1) sarl $17,%eax # eax = t1>>17 (t2) andl $8185,%eax # eax = t2 & mask (rval) 15

Another Example int logical(int x, int y) { int t1 = x^y; int t2 = t1 >> 17; int mask = (1<<13) - 7; int rval = t2 & mask; return rval; logical: pushl %ebp movl %esp,%ebp movl 12(%ebp),%eax xorl 8(%ebp),%eax sarl $17,%eax andl $8185,%eax popl %ebp ret Set Up Body Finish movl 12(%ebp),%eax # eax = y xorl 8(%ebp),%eax # eax = x^y (t1) sarl $17,%eax # eax = t1>>17 (t2) andl $8185,%eax # eax = t2 & mask (rval) 16

Another Example int logical(int x, int y) { int t1 = x^y; int t2 = t1 >> 17; int mask = (1<<13) - 7; int rval = t2 & mask; return rval; logical: pushl %ebp movl %esp,%ebp movl 12(%ebp),%eax xorl 8(%ebp),%eax sarl $17,%eax andl $8185,%eax popl %ebp ret Set Up Body Finish movl 12(%ebp),%eax # eax = y xorl 8(%ebp),%eax # eax = x^y (t1) sarl $17,%eax # eax = t1>>17 (t2) andl $8185,%eax # eax = t2 & mask (rval) 17

Another Example int logical(int x, int y) { int t1 = x^y; int t2 = t1 >> 17; int mask = (1<<13) - 7; int rval = t2 & mask; return rval; logical: pushl %ebp movl %esp,%ebp movl 12(%ebp),%eax xorl 8(%ebp),%eax sarl $17,%eax andl $8185,%eax popl %ebp ret Set Up Body Finish 2 13 = 8192, 2 13 7 = 8185 movl 12(%ebp),%eax # eax = y xorl 8(%ebp),%eax # eax = x^y (t1) sarl $17,%eax # eax = t1>>17 (t2) andl $8185,%eax # eax = t2 & mask (rval) 18

Another Example int logical(int x, int y) { int t1 = x^y; int t2 = t1 >> 17; int mask = (1<<13) - 7; int rval = t2 & mask; return rval; logical: pushl %ebp movl %esp,%ebp movl 12(%ebp),%eax xorl 8(%ebp),%eax sarl $17,%eax andl $8185,%eax popl %ebp ret Set Up Body Finish 2 13 = 8192, 2 13 7 = 8185 movl 12(%ebp),%eax # eax = y xorl 8(%ebp),%eax # eax = x^y (t1) sarl $17,%eax # eax = t1>>17 (t2) andl $8185,%eax # eax = t2 & mask (rval) 19

Today Complete addressing mode, address computation (leal) Arithmetic operations Control: Condition codes Conditional branches Loops 20

Processor State (IA32, Partial) Information about currently executing program Temporary data ( %eax, ) Location of runtime stack ( %ebp,%esp ) Location of current code control point ( %eip, ) Status of recent tests ( CF, ZF, SF, OF ) %eax %ecx %edx %ebx %esi %edi %esp %ebp %eip General purpose registers Current stack top Current stack frame Instruction pointer CF ZF SF OF Condition codes 21

Condition Codes (Implicit Setting) Single bit registers CF Carry Flag (for unsigned) SF Sign Flag (for signed) ZF Zero Flag OF Overflow Flag (for signed) Implicitly set (think of it as side effect) by arithmetic operations Example: addl/addq Src,Dest t = a+b CF set if carry out from most significant bit (unsigned overflow) ZF set if t == 0 SF set if t < 0 (as signed) OF set if two s-complement (signed) overflow (a>0 && b>0 && t<0) (a<0 && b<0 && t>=0) Not set by lea instruction Full documentation (IA32), link on course website 22

Condition Codes (Explicit Setting: Compare) Explicit Setting by Compare Instruction cmpl/cmpq Src2, Src1 cmpl b,a like computing a-b without setting destination CF set if carry out from most significant bit (used for unsigned comparisons) ZF set if a == b SF set if (a-b) < 0 (as signed) OF set if two s-complement (signed) overflow (a>0 && b<0 && (a-b)<0) (a<0 && b>0 && (a-b)>0) 23

Condition Codes (Explicit Setting: Test) Explicit Setting by Test instruction testl/testq Src2, Src1 testl b,a like computing a&b without setting destination Sets condition codes based on value of Src1 & Src2 Useful to have one of the operands be a mask ZF set when a&b == 0 SF set when a&b < 0 24

Reading Condition Codes SetX Instructions Set low-order byte to 0 or 1 based on combinations of condition codes Does not alter remaining 3 bytes SetX Condition Description sete ZF Equal / Zero setne ~ZF Not Equal / Not Zero sets SF Negative setns ~SF Nonnegative setg ~(SF^OF)&~ZF Greater (Signed) setge ~(SF^OF) Greater or Equal (Signed) setl (SF^OF) Less (Signed) setle (SF^OF) ZF Less or Equal (Signed) seta ~CF&~ZF Above (unsigned) setb CF Below (unsigned) 25

Reading Condition Codes (Cont.) SetX Instructions: Set single byte based on combination of condition codes One of 8 addressable byte registers Does not alter remaining 3 bytes Typically use movzbl to finish job int gt (int x, int y) { return x > y; %eax %ecx %edx %ebx %esi %ah %al %ch %cl %dh %dl %bh %bl Body %edi movl 12(%ebp),%eax cmpl %eax,8(%ebp) setg %al movzbl %al,%eax # eax = y # Compare x : y # al = x > y # Zero rest of %eax %esp %ebp 26

Reading Condition Codes: x86-64 SetX Instructions: Set single byte based on combination of condition codes Does not alter remaining 3 bytes int gt (int x, int y) { return x > y; long lgt (long x, long y) { return x > y; cmpl %esi, %edi setg %al movzbl %al, %eax Bodies cmpq %rsi, %rdi setg %al movzbl %al, %eax Is %rax zero? Yes: 32-bit instructions set high order 32 bits to 0! 27

Today Complete addressing mode, address computation (leal) Arithmetic operations Control: Condition codes Conditional branches Loops 28

Jumping jx Instructions Jump to different part of code depending on condition codes jx Condition Description jmp 1 Unconditional je ZF Equal / Zero jne ~ZF Not Equal / Not Zero js SF Negative jns ~SF Nonnegative jg ~(SF^OF)&~ZF Greater (Signed) jge ~(SF^OF) Greater or Equal (Signed) jl (SF^OF) Less (Signed) jle (SF^OF) ZF Less or Equal (Signed) ja ~CF&~ZF Above (unsigned) jb CF Below (unsigned) 29

Conditional Branch Example (Old Style) Generation shark> gcc O1 m32 fno-if-conversion control.c int absdiff(int x, int y) { int result; if (x > y) { result = x-y; else { result = y-x; return result; absdiff: pushl %ebp movl %esp, %ebp movl 8(%ebp), %edx movl 12(%ebp), %eax cmpl %eax, %edx jle.l6 subl %eax, %edx movl %edx, %eax jmp.l7.l6: subl %edx, %eax.l7: popl %ebp ret Setup Body1 Body2a Body2b Finish 30

Conditional Branch Example (Cont.) int goto_ad(int x, int y) { int result; if (x <= y) goto Else; result = x-y; goto Exit; Else: result = y-x; Exit: return result; C allows goto as means of transferring control Closer to machine-level programming style Generally considered bad coding style absdiff: pushl %ebp movl %esp, %ebp movl 8(%ebp), %edx movl 12(%ebp), %eax cmpl %eax, %edx jle.l6 subl %eax, %edx movl %edx, %eax jmp.l7.l6: subl %edx, %eax.l7: popl %ebp ret Setup Body1 Body2a Body2b Finish 31

Conditional Branch Example (Cont.) int goto_ad(int x, int y) { int result; if (x <= y) goto Else; result = x-y; goto Exit; Else: result = y-x; Exit: return result; absdiff: pushl %ebp movl %esp, %ebp movl 8(%ebp), %edx movl 12(%ebp), %eax cmpl %eax, %edx jle.l6 subl %eax, %edx movl %edx, %eax jmp.l7.l6: subl %edx, %eax.l7: popl %ebp ret Setup Body1 Body2a Body2b Finish 32

Conditional Branch Example (Cont.) int goto_ad(int x, int y) { int result; if (x <= y) goto Else; result = x-y; goto Exit; Else: result = y-x; Exit: return result; absdiff: pushl %ebp movl %esp, %ebp movl 8(%ebp), %edx movl 12(%ebp), %eax cmpl %eax, %edx jle.l6 subl %eax, %edx movl %edx, %eax jmp.l7.l6: subl %edx, %eax.l7: popl %ebp ret Setup Body1 Body2a Body2b Finish 33

Conditional Branch Example (Cont.) int goto_ad(int x, int y) { int result; if (x <= y) goto Else; result = x-y; goto Exit; Else: result = y-x; Exit: return result; absdiff: pushl %ebp movl %esp, %ebp movl 8(%ebp), %edx movl 12(%ebp), %eax cmpl %eax, %edx jle.l6 subl %eax, %edx movl %edx, %eax jmp.l7.l6: subl %edx, %eax.l7: popl %ebp ret Setup Body1 Body2a Body2b Finish 34

General Conditional Expression Translation (Using Branches) C Code val = Test? Then_Expr : Else_Expr; val = x>y? x-y : y-x; Goto Version nt =!Test; if (nt) goto Else; val = Then_Expr; goto Done; Else: val = Else_Expr; Done:... Test is expression returning integer = 0 interpreted as false 0 interpreted as true Create separate code regions for then & else expressions Execute appropriate one 35

Using Conditional Moves Conditional Move Instructions Instruction supports: if (Test) Dest Src Supported in post-1995 x86 processors GCC tries to use them Enabled for IA32 & x86-64 Why? Branches are very disruptive to instruction flow through pipelines Conditional move do not require control transfer C Code val = Test? Then_Expr : Else_Expr; Goto Version tval = Then_Expr; result = Else_Expr; t = Test; if (t) result = tval; return result; 36

Conditional Move Example: x86-64 int absdiff(int x, int y) { int result; if (x > y) { result = x-y; else { result = y-x; return result; x in %edi y in %esi absdiff: movl %edi, %edx subl %esi, %edx # tval = x-y movl %esi, %eax subl %edi, %eax # result = y-x cmpl %esi, %edi # Compare x:y cmovg %edx, %eax # If >, result = tval ret 37

Conditional Move Example: x86-64 int absdiff(int x, int y) { int result; if (x > y) { result = x-y; else { result = y-x; return result; x in %edi y in %esi absdiff: movl %edi, %edx subl %esi, %edx # tval = x-y movl %esi, %eax subl %edi, %eax # result = y-x cmpl %esi, %edi # Compare x:y cmovg %edx, %eax # If >, result = tval ret 38

Conditional Move Example: x86-64 int absdiff(int x, int y) { int result; if (x > y) { result = x-y; else { result = y-x; return result; x in %edi y in %esi absdiff: movl %edi, %edx subl %esi, %edx # tval = x-y movl %esi, %eax subl %edi, %eax # result = y-x cmpl %esi, %edi # Compare x:y cmovg %edx, %eax # If >, result = tval ret 39

Bad Cases for Conditional Move Expensive Computations val = Test(x)? Hard1(x) : Hard2(x); Both values get computed Only makes sense when computations are very simple Risky Computations val = p? *p : 0; Both values get computed May have undesirable effects Computations with side effects val = x > 0? x*=7 : x+=3; Both values get computed Must be side-effect free 40

Today Complete addressing mode, address computation (leal) Arithmetic operations Control: Condition codes Conditional branches Loops 41

Do-While Loop Example C Code int pcount_do(unsigned x) { int result = 0; do { result += x & 0x1; x >>= 1; while (x); return result; Goto Version int pcount_do(unsigned x) { int result = 0; loop: result += x & 0x1; x >>= 1; if (x) goto loop; return result; Count number of 1 s in argument x ( popcount ) Use conditional branch to either continue looping or to exit loop 42

Do-While Loop Compilation Goto Version int pcount_do(unsigned x) { int result = 0; loop: result += x & 0x1; x >>= 1; if (x) goto loop; return result; Registers: %edx %ecx x result movl $0, %ecx # result = 0.L2: # loop: movl %edx, %eax andl $1, %eax # t = x & 1 addl %eax, %ecx # result += t shrl %edx # x >>= 1 jne.l2 # If!0, goto loop 43

General Do-While Translation C Code do Body while (Test); Body: { Statement 1 ; Statement 2 ; Statement n ; Goto Version loop: Body if (Test) goto loop Test returns integer = 0 interpreted as false 0 interpreted as true 44

While Loop Example C Code for while loop int pcount_while(unsigned x) { int result = 0; while (x) { result += x & 0x1; x >>= 1; return result; C Code for do loop int pcount_do(unsigned x) { int result = 0; do { result += x & 0x1; x >>= 1; while (x); return result; Is while loop code equivalent to the do-while version? Must jump out of loop if test fails 45

While Loop Example C Code int pcount_while(unsigned x) { int result = 0; while (x) { result += x & 0x1; x >>= 1; return result; Goto Version int pcount_do(unsigned x) { int result = 0; if (!x) goto done; loop: result += x & 0x1; x >>= 1; if (x) goto loop; done: return result; Is this code equivalent to the do-while version? Must jump out of loop if test fails 46

General While Translation While version while (Test) Body Do-While Version if (!Test) goto done; do Body while(test); done: Goto Version if (!Test) goto done; loop: Body if (Test) goto loop; done: 47

For Loop Example C Code #define WSIZE 8*sizeof(int) int pcount_for(unsigned x) { int i; int result = 0; for (i = 0; i < WSIZE; i++) { unsigned mask = 1 << i; result += (x & mask)!= 0; return result; Is this code equivalent to other versions? What do we mean by equivalent? 48

For Loop Form General Form for (Init; Test; Update ) Body for (i = 0; i < WSIZE; i++) { unsigned mask = 1 << i; result += (x & mask)!= 0; i = 0 i < WSIZE i++ { Init Test Update Body unsigned mask = 1 << i; result += (x & mask)!= 0; 49

For Loop While Loop For Version for (Init; Test; Update ) Body While Version Init; while (Test ) { Body Update; 50

For Loop Goto For Version for (Init; Test; Update ) Body While Version Init; while (Test ) { Body Update; Init; if (!Test) goto done; do Body Update while(test); done: Init; if (!Test) goto done; loop: Body Update if (Test) goto loop; done: 51

For Loop Conversion Example C Code #define WSIZE 8*sizeof(int) int pcount_for(unsigned x) { int i; int result = 0; for (i = 0; i < WSIZE; i++) { unsigned mask = 1 << i; result += (x & mask)!= 0; return result; Initial test can be optimized away Goto Version int pcount_for_gt(unsigned x) { int i; int result = 0; i = 0; if (!(i < WSIZE)) goto done; loop: { unsigned mask = 1 << i; result += (x & mask)!= 0; i++; Update if (i < WSIZE) goto loop; done: return result; Test Init!Test Body 52

The Actual For Loop Code (Body Only) How Should I Decode This? Look at branching structure Identify registers Work through detailed logic %eax %ebx %ecx %edi %edx movl 8(%ebp), %edi movl $0, %eax movl $0, %ecx movl $1, %edx.l13: movl %edx, %esi sall %cl, %esi testl %edi, %esi setne %bl movl %ebx, %esi andl $255, %esi addl %esi, %eax addl $1, %ecx cmpl $32, %ecx jne.l13 %esi 53

Summary Today Complete addressing mode, address computation (leal) Arithmetic operations Control: Condition codes Conditional branches & conditional moves Loops Next Time Switch statements Stack Call / return Procedure call discipline 54