AVR200: Multiply and Divide Routines. 8-bit Microcontrollers. Application Note. Features. 1 Introduction

Similar documents
AVR134: Real Time Clock (RTC) using the Asynchronous Timer. 8-bit Microcontrollers. Application Note. Features. 1 Introduction

AVR32138: How to optimize the ADC usage on AT32UC3A0/1, AT32UC3A3 and AT32UC3B0/1 series. 32-bit Microcontrollers. Application Note.

AVR1922: Xplain Board Controller Firmware. 8-bit Microcontrollers. Application Note. Features. 1 Introduction

AVR1318: Using the XMEGA built-in AES accelerator. 8-bit Microcontrollers. Application Note. Features. 1 Introduction

AVR1510: Xplain training - XMEGA USART. 8-bit Microcontrollers. Application Note. Prerequisites. 1 Introduction

AVR1900: Getting started with ATxmega128A1 on STK bit Microcontrollers. Application Note. 1 Introduction

AVR353: Voltage Reference Calibration and Voltage ADC Usage. 8-bit Microcontrollers. Application Note. Features. 1 Introduction

AVR1309: Using the XMEGA SPI. 8-bit Microcontrollers. Application Note. Features. 1 Introduction SCK MOSI MISO SS

AVR32701: AVR32AP7 USB Performance. 32-bit Microcontrollers. Application Note. Features. 1 Introduction

AVR305: Half Duplex Compact Software UART. 8-bit Microcontrollers. Application Note. Features. 1 Introduction

Using CryptoMemory in Full I 2 C Compliant Mode. Using CryptoMemory in Full I 2 C Compliant Mode AT88SC0104CA AT88SC0204CA AT88SC0404CA AT88SC0808CA

8-bit. Application Note. Microcontrollers. AVR282: USB Firmware Upgrade for AT90USB

AVR1600: Using the XMEGA Quadrature Decoder. 8-bit Microcontrollers. Application Note. Features. 1 Introduction. Sensors

AVR115: Data Logging with Atmel File System on ATmega32U4. Microcontrollers. Application Note. 1 Introduction. Atmel

AVR32788: AVR 32 How to use the SSC in I2S mode. 32-bit Microcontrollers. Application Note. Features. 1 Introduction

8-bit Microcontroller. Application Note. AVR222: 8-point Moving Average Filter

8-bit Microcontroller. Application. Note. AVR204: BCD Arithmetics. Features. Introduction. 16-bit Binary to 5-digit BCD Conversion bin2bcd16

AVR033: Getting Started with the CodeVisionAVR C Compiler. 8-bit Microcontrollers. Application Note. Features. 1 Introduction

AVR1301: Using the XMEGA DAC. 8-bit Microcontrollers. Application Note. Features. 1 Introduction

AVR287: USB Host HID and Mass Storage Demonstration. 8-bit Microcontrollers. Application Note. Features. 1 Introduction

AVR2006: Design and characterization of the Radio Controller Board's 2.4GHz PCB Antenna. Application Note. Features.

Application Note. 8-bit Microcontrollers. AVR270: USB Mouse Demonstration

8-bit RISC Microcontroller. Application Note. AVR236: CRC Check of Program Memory

Atmel AVR4921: ASF - USB Device Stack Differences between ASF V1 and V2. 8-bit Atmel Microcontrollers. Application Note. Features.

Application Note. 8-bit Microcontrollers. AVR272: USB CDC Demonstration UART to USB Bridge

8-bit Microcontroller. Application Note. AVR201: Using the AVR Hardware Multiplier

APPLICATION NOTE. Atmel AVR134: Real Time Clock (RTC) Using the Asynchronous Timer. Atmel AVR 8-bit Microcontroller. Introduction.

Atmel AVR4920: ASF - USB Device Stack - Compliance and Performance Figures. Atmel Microcontrollers. Application Note. Features.

AVR055: Using a 32kHz XTAL for run-time calibration of the internal RC. 8-bit Microcontrollers. Application Note. Features.

Atmel AVR4903: ASF - USB Device HID Mouse Application. Atmel Microcontrollers. Application Note. Features. 1 Introduction

8-bit Microcontroller. Application Note. AVR400: Low Cost A/D Converter

AVR319: Using the USI module for SPI communication. 8-bit Microcontrollers. Application Note. Features. Introduction

AVR317: Using the Master SPI Mode of the USART module. 8-bit Microcontrollers. Application Note. Features. Introduction

32-bit AVR UC3 Microcontrollers. 32-bit AtmelAVR Application Note. AVR32769: How to Compile the standalone AVR32 Software Framework in AVR32 Studio V2

Application Note. 8-bit Microcontrollers. AVR293: USB Composite Device

AT91SAM ARM-based Flash MCU. Application Note

General Porting Considerations. Memory EEPROM XRAM

How To Use An Atmel Atmel Avr32848 Demo For Android (32Bit) With A Microcontroller (32B) And An Android Accessory (32D) On A Microcontroller (32Gb) On An Android Phone Or

3-output Laser Driver for HD-DVD/ Blu-ray/DVD/ CD-ROM ATR0885. Preliminary. Summary. Features. Applications. 1. Description

8051 Flash Microcontroller. Application Note. A Digital Thermometer Using the Atmel AT89LP2052 Microcontroller

8-bit RISC Microcontroller. Application Note. AVR182: Zero Cross Detector

Application Note. Atmel ATSHA204 Authentication Modes. Prerequisites. Overview. Introduction

AVR2004: LC-Balun for AT86RF230. Application Note. Features. 1 Introduction

AVR1003: Using the XMEGA Clock System. 8-bit Microcontrollers. Application Note. Features. 1 Introduction

APPLICATION NOTE. Atmel LF-RFID Kits Overview. Atmel LF-RFID Kit. LF-RFID Kit Introduction

256K (32K x 8) Battery-Voltage Parallel EEPROMs AT28BV256

Application Note. Atmel CryptoAuthentication Product Uses. Atmel ATSHA204. Abstract. Overview

AVR106: C functions for reading and writing to Flash memory. 8-bit Microcontrollers. Application Note. Features. Introduction

AVR223: Digital Filters with AVR. 8-bit Microcontrollers. Application Note. Features. 1 Introduction

8-bit Microcontroller. Application Note. AVR134: Real-Time Clock (RTC) using the Asynchronous Timer. Features. Theory of Operation.

Atmel AVR1017: XMEGA - USB Hardware Design Recommendations. 8-bit Atmel Microcontrollers. Application Note. Features.

8-bit Microcontroller. Application Note. AVR415: RC5 IR Remote Control Transmitter. Features. Introduction. Figure 1.

AVR1321: Using the Atmel AVR XMEGA 32-bit Real Time Counter and Battery Backup System. 8-bit Microcontrollers. Application Note.

AVR030: Getting Started with IAR Embedded Workbench for Atmel AVR. 8-bit Microcontrollers. Application Note. Features.

AT91 ARM Thumb Microcontrollers. AT91SAM CAN Bootloader. AT91SAM CAN Bootloader User Notes. 1. Description. 2. Key Features

8-bit Microcontroller. Application Note. AVR314: DTMF Generator

APPLICATION NOTE. Atmel AVR443: Sensor-based Control of Three Phase Brushless DC Motor. Atmel AVR 8-bit Microcontrollers. Features.

Atmel AVR4027: Tips and Tricks to Optimize Your C Code for 8-bit AVR Microcontrollers. 8-bit Atmel Microcontrollers. Application Note.

How to Calculate the Capacitor of the Reset Input of a C51 Microcontroller 80C51. Application Note. Microcontrollers. Introduction

AVR245: Code Lock with 4x4 Keypad and I2C LCD. 8-bit Microcontrollers. Application Note. Features. 1 Introduction

APPLICATION NOTE. Atmel AT02985: User s Guide for USB-CAN Demo on SAM4E-EK. Atmel AVR 32-bit Microcontroller. Features. Description.

APPLICATION NOTE. Atmel AT04389: Connecting SAMD20E to the AT86RF233 Transceiver. Atmel SAMD20. Description. Features

AT89C5131A Starter Kit... Software User Guide

Step Motor Controller. Application Note. AVR360: Step Motor Controller. Theory of Operation. Features. Introduction

64K (8K x 8) Parallel EEPROM with Page Write and Software Data Protection AT28C64B

8-bit RISC Microcontroller. Application Note. AVR910: In-System Programming

Application Note. C51 Bootloaders. C51 General Information about Bootloader and In System Programming. Overview. Abreviations

How To Design An Ism Band Antenna For 915Mhz/2.4Ghz Ism Bands On A Pbbb (Bcm) Board

APPLICATION NOTE Atmel AT02509: In House Unit with Bluetooth Low Energy Module Hardware User Guide 8-bit Atmel Microcontroller Features Description

AT91 ARM Thumb Microcontrollers. Application Note. Interfacing a PC Card to an AT91RM9200-DK. Introduction. Hardware Interface

AVR034: Mixing C and Assembly Code with IAR Embedded Workbench for AVR. 8-bit Microcontroller. Application Note. Features.

AVR ONE!... Quick-start Guide. EVK Windows 32104B AVR ONE! 02/10

AVR1324: XMEGA ADC Selection Guide. 8-bit Atmel Microcontrollers. Application Note. Features. 1 Introduction

AVR125: ADC of tinyavr in Single Ended Mode. 8-bit Microcontrollers. Application Note. Features. 1 Introduction

AVR32110: Using the AVR32 Timer/Counter. 32-bit Microcontrollers. Application Note. Features. 1 Introduction

8-bit Microcontroller. Application Note. AVR105: Power Efficient High Endurance Parameter Storage in Flash Memory

Introducing a platform to facilitate reliable and highly productive embedded developments

3-output Laser Driver for HD-DVD/ Blu-ray/DVD/ CD-ROM ATR0885. Preliminary. Summary

AT89LP Flash Data Memory. Application Note. AT89LP Flash Data Memory API. 1. Introduction. 2. Theory of Operation. 2.1 Flash Memory Operation

AVR442: PC Fan Control using ATtiny13. 8-bit Microcontrollers. Application Note. Features. 1 Introduction

AVR120: Characterization and Calibration of the ADC on an AVR. 8-bit Microcontrollers. Application Note. Features. Introduction

APPLICATION NOTE. Authentication Counting. Atmel CryptoAuthentication. Features. Introduction

Quick Start Guide. CAN Microcontrollers. ATADAPCAN01 - STK501 CAN Extension. Requirements

AVR241: Direct driving of LCD display using general IO. 8-bit Microcontrollers. Application Note. Features. Introduction AVR

Tag Tuning/RFID. Application Note. Tag Tuning. Introduction. Antenna Equivalent Circuit

1Mb (64K x 16) One-time Programmable Read-only Memory

USER GUIDE EDBG. Description

8-bit RISC Microcontroller. Application Note. AVR155: Accessing an I 2 C LCD Display using the AVR 2-wire Serial Interface

Application Note. USB Microcontrollers. USB PC Drivers Based on Generic HID Class. 1. Introduction

APPLICATION NOTE. Atmel AT01095: Joystick Game Controller Reference Design. 8-/16-bit Atmel Microcontrollers. Features.

AVR32100: Using the AVR32 USART. 32-bit Microcontrollers. Application Note. Features. 1 Introduction

APPLICATION NOTE. Secure Personalization with Transport Key Authentication. ATSHA204A, ATECC108A, and ATECC508A. Introduction.

Application Note. 1. Introduction. 2. Associated Documentation. 3. Gigabit Ethernet Implementation on SAMA5D3 Series. AT91SAM ARM-based Embedded MPU

APPLICATION NOTE. Atmel AVR600: STK600 Expansion, Routing and Socket Boards. Atmel Microcontrollers. Introduction

AVR106: C Functions for Reading and Writing to Flash Memory. Introduction. Features. AVR 8-bit Microcontrollers APPLICATION NOTE

Lecture 8: Binary Multiplication & Division

AVR151: Setup and Use of the SPI. Introduction. Features. Atmel AVR 8-bit Microcontroller APPLICATION NOTE

USER GUIDE. ZigBit USB Stick User Guide. Introduction

APPLICATION NOTE. Atmel AT01180: Barcode and QR code scanner User Guide. Atmel 32-bit Microcontroller. Features. Introduction

AT91SAM ARM-based Flash MCU. Application Note. Analog-to-Digital Converter in the SAM3S4. 1. Introduction

Transcription:

AVR200: Multiply and Divide Routines Features 8 and 16-bit Implementations Signed & Unsigned Routines Speed & Code Size Optimized Routines Runable Example Programs Speed is Comparable with HW Multiplicators/Dividers Example: 8 x 8 Mul in 2.8 µs, 16 x 16 Mul in 8.7 µs (12 MHz) Extremely Compact Code 8-bit Microcontrollers Application ote 1 Introduction This application note lists subroutines for multiplication and division of 8- and 16-bit signed and unsigned numbers. A listing of all implementations with key performance specifications is given in Table 1-1. Table 1-1. Performance Figures Summary Application Code Size (Words) Execution Time (Cycles) 8 x 8 = 16 bit unsigned (Code Optimized) 9 58 8 x 8 = 16 bit unsigned (Speed Optimized) 34 34 8 x 8 = 16 bit signed (Code Optimized) 10 73 16 x 16 = 32 bit unsigned (Code Optimized) 14 153 16 x 16 = 32 bit unsigned (Speed Optimized) 105 105 16 x 16 = 32 bit signed (Code Optimized) 16 218 8 / 8 = 8 + 8 bit unsigned (Code Optimized) 14 97 8 / 8 = 8 + 8 bit unsigned (Speed Optimized) 66 58 8 / 8 = 8 + 8 bit signed (Code Optimized) 22 103 16 / 16 = 16 + 16 bit unsigned (Code Optimized) 19 243 16 / 16 = 16 + 16 bit unsigned (Speed Optimized) 196 173 16 / 16 = 16 + 16 bit signed (Code Optimized) 39 255 The application note listing consists of two files: avr200.asm : Code size optimized multiplied and divide routines. avr200b.asm : Speed optimized multiply and divide routines. Rev.

2 8 x 8 = 16 Unsigned Multiplication mpy8u 2.1 Algorithm Description Both program files contain a routine called mpy8u which performs unsigned 8-bit multiplication. Both implementations are based on the same algorithm. The code size optimized implementation, however, uses looped code whereas the speed optimized code is a straight-line code implementation. Figure 2-1 shows the flow chart for the code size optimized version. The algorithm for the Code Size optimized version is as follows: 1. Clear result High byte. 2. Load Loop counter with eight. 3. Shift right multiplier 4. If carry (previous bit 0 of multiplier) set, add multiplicand to result High byte. 5. Shift right result High byte into result Low byte/multiplier. 6. Shift right result Low byte/multiplier. 7. Decrement Loop counter. 8. If Loop counter not zero, go to Step 4. Figure 2-1. mpy8u Flow Chart (Code Size Optimized Implementation) MP8U CLEAR RESULT HIGH BTE LOOP COUTER 8 SHIFT MULTIPLIER RIGHT ADD MULTIPLICAD TO RESULT HIGH BTE CARR SET? SHIFT RIGHT RESULT HIGH BTE SHIFT RIGHT RESULT LOW BTE AD MULTIPLIER DECREMET LOOP COUTER LOOP COUTER = 0? RETUR 2 AVR200

AVR200 2.2 Usage The usage of mpy8u is the same for both versions: 1. Load register variables mp8u and mc8u with the multiplier and multiplicand, respectively. 2. Call mpy8u. 3. The 16 -bit result is found in the two register variables m8uh (High byte) and m8ul (Low byte). Observe that to minimize register usage, code and execution time, the multiplier and result Low byte share the same register. 2.3 Performance Table 2-1. mpy8u (Code Size Optimized Implementation) R16 mc8u Multiplicand R17 mp8u Multiplier m8ul Result Low Byte R18 m8uh Result High Byte R19 mcnt8u Loop Counter Table 2-2. mpy8u Performance Figures (Code Size Optimized Implementation) Code Size (Words) 9 + return Execution Time (Cycles) 58 + return : :4 : Table 2-3. mpy8u (Straight-line Implementation) R16 mc8u Multiplicand R17 mp8u Multiplier m8ul Result Low Byte R18 m8uh Result High Byte 3

Table 2-4. mpy8u Performance Figures (Straight-line Implementation) Code Size (Words) Execution Time (Cycles) 34 + return 34 + return : :3 : 3 8 x 8 = 16 Signed Multiplication mpy8s 3.1 Algorithm Description This subroutine, which is found in avr200.asm implements signed 8 x 8 multiplication. egative numbers are represented as 2 s complement numbers. The application is an implementation of Booth's algorithm. The algorithm provides both small and fast code. However, it has one limitation that the user should bear in mind; If all 16 bits of the result is needed, the algorithm fails when used with the most negative number (-128) as the multiplicand. The algorithm for signed 8 x 8 multiplication is as follows: 1. Clear result High byte and carry. 2. Load Loop counter with eight. 3. If carry (previous bit 0 of multiplier) set, add multiplicand to result High byte. 4. If current bit 0 of multiplier set, subtract multiplicand from result High byte. 5. Shift right result High byte into result Low byte/multiplier. 6. Shift right result Low byte/multiplier. 7. Decrement Loop counter. 8. If Loop counter not zero, go to Step 3. 4 AVR200

AVR200 Figure 3-1. mpy8s Flow Chart MP8S CLEAR RESULT HIGH BTE AD CARR LOOP COUTER 8 ADD MULTIPLICAD TO RESULT HIGH BTE CARR = 1? SUBTRACT MULTIPLICAD FROM RESULT HIGH BTE BIT 0 OF MULTIPLIER SET? SHIFT RIGHT RESULT HIGH BTE SHIFT RIGHT RESULT LOW BTE AD MULTIPLIER DECREMET LOOP COUTER LOOP COUTER = 0? RETUR 3.2 Usage The usage of mpy8s is as follows: 1. Load register variables mp8s and mc8s with the multiplier and multiplicand, respectively. 2. Call mpy8s. 3. The 16 -bit result is found in the two register variables m8sh (High byte) and m8sl (Low byte). Observe that to minimize register usage, code and execution time, the multiplier and result Low byte share the same register. 3.3 Performance Table 3-1. mpy8s R16 mc8s Multiplicand R17 mp8s Multiplier m8sl Result Low Byte R18 m8sh Result High Byte R19 mcnt8s Loop Counter 5

Table 3-2. mpy8s Performance Figures Code Size (Words) 10 + return Execution Time (Cycles) 73 + return : :4 : 4 16 x 16 = 32 Unsigned Multiplication mpy16u 4.1 Algorithm Description Both program files contain a routine called mpy16u which performs unsigned 16-bit multiplication. Both implementations are based on the same algorithm. The code size optimized implementation, however, uses looped code whereas the speed optimized code is a straight-line code implementation. Figure 4-1 shows the flow chart for the Code Size optimized (looped) version. The algorithm for the Code Size optimized version is as follows: 1. Clear result High word (Bytes 2 and 3) 2. Load Loop counter with 16. 3. Shift multiplier right 4. If carry (previous bit 0 of multiplier Low byte) set, add multiplicand to result High word. 5. Shift right result High word into result Low word/multiplier. 6. Shift right Low word/multiplier. 7. Decrement Loop counter. 8. If Loop counter not zero, go to Step 4. 6 AVR200

Figure 4-1. mpy16u Flow Chart (Code Size Optimized Implementation) MP16U AVR200 CLEAR RESULT HIGH WORD LOOP COUTER 16 SHIFT MULTIPLIER RIGHT ADD MULTIPLICAD TO RESULT HIGH WORD CARR SET? SHIFT RIGHT RESULT HIGH WORD SHIFT RIGHT RESULT LOW WORD AD MULTIPLIER DECREMET LOOP COUTER LOOP COUTER = 0? RETUR 4.2 Usage The usage of mpy16u is the same for both versions: 1. Load register variables mp16ul / mp16uh with multiplier Low and High byte, respectively. 2. Load register variables mc16uh / mc16uh with multiplicand Low and High byte, respectively. 3. Call mpy16u. 4. The 32-bit result is found in the 4-byte register variable m16u3:m16u2:m16u1:m16u0. Observe that to minimize register usage, code and execution time, the multiplier and result Low word share the same registers. 7

4.3 Performance Table 4-1. mpy16u (Code Size Optimized Implementation) R16 R17 mc16ul Multiplicand Low Byte mc16uh Multiplicand High Byte R18 mp16ul Multiplier Low Byte m16u0 Result Byte 0 R19 mp16uh Multiplier High Byte m16u1 Result Byte 1 R20 m16u2 Result Byte 2 R21 m16u2 Result Byte 2 R22 mcnt16u Loop Counter Table 4-2. mpy16u Performance Figures (Code Size Optimized Implementation) Code Size (Words) 14 + return Execution Time (Cycles) 153 + return : :7 : Table 4-3. mpy16u (Straight-line Implementation) R16 mc16ul Multiplicand Low Byte R17 mc16uh Multiplicand High Byte R18 mp16ul Multiplier Low Byte m16u0 Result Byte 0 R19 mp16uh Multiplier High Byte m16u1 Result Byte 1 R20 m16u2 Result Byte 2 R21 m16u2 Result Byte 2 Table 4-4. mpy16u Performance Figures (Straight-line Implementation) Code Size (Words) 105 + return Execution Time (Cycles) 105 + return : :6 : 8 AVR200

AVR200 5 16 x 16 = 32 Signed Multiplication - mpy16s 5.1 Algorithm Description This subroutine, which is found in avr200.asm implements signed 16 x 16 multiplication. egative numbers are represented as 2 s complement numbers. The application is an implementation of Booth s algorithm. The algorithm provides both small and fast code. However, it has one limitation that the user should bear in mind; If all 32 bits of the result is needed, the algorithm fails when used with the most negative number (-32768) as the multiplicand. The algorithm for signed 16 x 16 multiplication is as follows: 1. Clear result High word (Bytes 2&3) and carry. 2. Load Loop counter with 16. 3. If carry (previous bit 0 of multiplier Low byte) set, add multiplicand to result High word. 4. If current bit 0 of multiplier Low byte set, subtract multiplicand from result High word. 5. Shift right result High word into result Low word/multiplier. 6. Shift right Low word/multiplier. 7. Decrement Loop counter. 8. If Loop counter not zero, go to Step 3. Figure 5-1. mpy16s Flow Chart MP16S CLEAR RESULT HIGH WORD AD CARR LOOP COUTER 8 ADD MULTIPLICAD TO RESULT HIGH WORD CARR = 1? SUBTRACT MULTIPLICAD FROM RESULT HIGH WORD BIT 0 OF MULTIPLIER LOW BTE SET? SHIFT RIGHT RESULT HIGH WORD SHIFT RIGHT RESULT LOW WORD AD MULTIPLIER DECREMET LOOP COUTER LOOP COUTER = 0? RETUR 9

5.2 Usage The usage of mpy16s is as follows: 1. Load register variables mp16sl / mp16sh with multiplier Low and High byte, respectively. 2. Load register variables mc16sh / mc16sh with multiplicand Low and High byte, respectively. 3. Call mpy16s. 4. The 32-bit result is found in the 4-byte register variable m16s3:m16s2:m16s1:m16s0. Observe that to minimize register usage, code and execution time, the multiplier and result Low byte share the same register. 5.3 Performance Table 5-1. mpy16s R16 mc16sl Multiplicand Low Byte R17 mc16sh Multiplicand High Byte R18 mp16sl Multiplier Low Byte m16s0 Result Byte 0 R19 mp16sh Multiplier High Byte m16s1 Result Byte 1 R20 m16s2 Result Byte 2 R21 m16s2 Result Byte 2 R22 mcnt16s Loop Counter Table 5-2. mpy16s Performance Figures Code Size (Words) 16 + return Execution Time (Cycles) 218 + return : :7 : 10 AVR200

AVR200 6 8 / 8 = 8 + 8 Unsigned Division div8u 6.1 Algorithm Description Both program files contain a routine called div8u which performs unsigned 8-bit division. Both implementations are based on the same algorithm. The code size optimized implementation, however, uses looped code, whereas the speed optimized code is a straight-line code implementation. Figure 6-1 shows the flow chart for the code size optimized version. The algorithm for unsigned 8/8 division (Code Size optimized code) is as follows: 1. Clear remainder and carry. 2. Load Loop counter with nine. 3. Shift left dividend into carry. 4. Decrement Loop counter. 5. If Loop counter = 0, return. 6. Shift left carry (from dividend/result) into remainder 7. Subtract divisor from remainder. 8. If result negative, add back divisor, clear carry and goto Step 3. 9. Set carry and go to Step 3. Figure 6-1. div8u Flow Chart (Code Size Optimized Implementation) DIV8U CLEAR REMAIDER AD CARR LOOP COUTER 9 SHIFT LEFT DIVIDED DECREMET LOOP COUTER SHIFT LEFT REMAIDER LOOP COUTER = 0? REMAIDER REMAIDER DIVISOR RETUR RESULT EGATIVE? REMAIDER REMAIDER + DIVISOR CLEAR CARR SET CARR 11

6.2 Usage The usage of div8u is the same for both implementations and is described in the following procedure: 1. Load register variable dd8u with the dividend (the number to be divided). 2. Load register variable dv8u with the divisor (the dividing number). 3. Call div8u. 4. The result is found in dres8u and the remainder in drem8u. Observe that to minimize register usage, code and execution time, the dividend and result share the same register. 6.3 Performance Table 6-1. div8u (Code Size Optimized Version) R15 drem8u Remainder R16 dd8u Dividend dres8u Result R17 dv8u Divisor R18 dcnt8u Loop Counter Table 6-2. div8u Performance Figures (Code Size Optimized Version) Code Size (Words) 14 Execution Time (Cycles) 97 :1 :3 : Table 6-3. div8u (Speed Optimized Version) R15 drem8u Remainder R16 dd8u Dividend dres8u Result R17 dv8u Divisor Table 6-4. div8u Performance Figures (Speed Optimized Version) Code Size (Words) 66 Execution Time (Cycles) 58 :1 :2 : 12 AVR200

AVR200 7 8 / 8 = 8 + 8 Signed Division div8s 7.1 Algorithm Description The subroutine mpy8s implements signed 8-bit division. The implementation is Code Size optimized. If negative, the input values shall be represented on 2 s complement's form. The algorithm for signed 8/8 division is as follows: 1. XOR dividend and divisor and store in a Sign Register. 2. If MSB of dividend set, negate dividend. 3. If MSB if divisor set, negate dividend. 4. Clear remainder and carry. 5. Load Loop counter with nine. 6. Shift left dividend into carry. 7. Decrement Loop counter. 8. If Loop counter ¼ 0, goto step 11. 9. If MSB of Sign Register set, negate result. 10. Return 11. Shift left carry (from dividend/result) into remainder. 12. Subtract divisor from remainder. 13. If result negative, add back divisor, clear carry and go to Step 6. 14. Set carry and go to Step 6. 13

Figure 7-1. div8s Flow Chart DIV8S SIG REGISTER DIVIDED XOR DIVISOR MSB OF DIVISOR SET? EGATE DIVISOR MSB OF DIVIDED SET? EGATE DIVIDED LOOP COUTER 9 SHIFT LEFT DIVIDED DECREMET LOOP COUTER SHIFT LEFT REMAIDER LOOP COUTER = 0? MSB OF SIG REGISTER SET? EGATE RESULT REMAIDER REMAIDER DIVISOR RETUR RESULT EGATIVE? REMAIDER REMAIDER + DIVISOR CLEAR CARR SET CARR 7.2 Usage The usage of div8s follows the procedure below: 1. Load register variable dd8s with the dividend (the number to be divided). 2. Load register variable dv8s with the divisor (the dividing number). 3. Call div8s. 4. The result is found in dres8s and the remainder in drem8s. Observe that to minimize register usage, code and execution time, the dividend and result share the same register. 14 AVR200

AVR200 7.3 Performance Table 7-1. div8u R14 d8s Sign Register R15 drem8s Remainder R16 dd8s Dividend dres8s Result R17 dv8s Divisor R18 dcnt8s Loop Counter Table 7-2. div8s Performance Figures (Code Size Optimized Version) Code Size (Words) 22 Execution Time (Cycles) 103 :2 :3 : 8 16 / 16 = 16 + 16 Unsigned Division div16u 8.1 Algorithm Description Both program files contain a routine called div16u which performs unsigned 16-bit division Both implementations are based on the same algorithm. The code size optimized implementation, however, uses looped code whereas the speed optimized code is a straight-line code implementation. Figure 8-1 shows the flow chart for the code size optimized version. The algorithm for unsigned 16 / 16 division (Code Size optimized code) is as follows: 1. Clear remainder and carry. 2. Load Loop counter with 17. 3. Shift left dividend into carry 4. Decrement Loop counter. 5. If Loop counter = 0, return. 6. Shift left carry (from dividend/result) into remainder 7. Subtract divisor from remainder. 8. If result negative, add back divisor, clear carry and go to Step 3. 9. Set carry and go to Step 3. 15

Figure 8-1. div16u Flow Chart (Code Size Optimized Implementation) DIV16U CLEAR REMAIDER AD CARR LOOP COUTER 17 SHIFT LEFT DIVIDED DECREMET LOOP COUTER SHIFT LEFT REMAIDER LOOP COUTER = 0? REMAIDER REMAIDER DIVISOR RETUR RESULT EGATIVE? REMAIDER REMAIDER + DIVISOR CLEAR CARR SET CARR 8.2 Usage The usage of div16u is the same for both implementations and is described in the following procedure: 1. Load the 16-bit register variable dd16uh:dd16ul with the dividend (the number to be divided). 2. Load the 16-bit register variable dv16uh:dv16ul with the divisor (the dividing number). 3. Call div16u. 4. The result is found in dres16u and the remainder in drem16u. Observe that to minimize register usage, code and execution time, the dividend and result share the same registers. 16 AVR200

AVR200 8.3 Performance Table 8-1. div16u (Code Size Optimized Version) R14 R15 R16 R17 R18 R19 R20 dd16ul Dividend Low Byte dd16uh Dividend High Byte dv16ul Divisor Low Byte dv16uh Divisor High Byte dcnt16u Loop Counter drem16ul Remainder Low Byte drem16uh Remainder High Byte dres16ul Result Low Byte dres16uh Result High Byte drem16ul Remainder Low Byte Table 8-2. div16u Performance Figures (Code Size Optimized Version) Code Size (Words) 19 Execution Time (Cycles) 243 :2 :5 : Table 8-3. div16u (Speed Optimized Version) R14 drem16ul Remainder Low Byte R15 R16 R17 R18 R19 dd16ul Dividend Low Byte dd16uh Dividend High Byte dv16ul Divisor Low Byte dv16uh Divisor High Byte drem16uh Remainder High Byte dres16ul Result Low Byte dres16uh Result High Byte 17

Table 8-4. div16u Performance Figures (Speed Optimized Version) Code Size (Words) Execution Time (Cycles) 173 196 + return :2 :4 : 9 16 / 16 = 16 + 16 Signed Division div16s 9.1 Algorithm Description The subroutine mpy16s implements signed 16-bit division. The implementation is Code Size optimized. If negative, the input values shall be represented on 2 s complement s form. The algorithm for signed 16 / 16 division is as follows: 1. XOR dividend and divisor High bytes and store in a Sign Register. 2. If MSB of dividend High byte set, negate dividend. 3. If MSB if divisor set High byte, negate dividend. 4. Clear remainder and carry. 5. Load Loop counter with 17. 6. Shift left dividend into carry. 7. Decrement Loop counter. 8. If Loop counter ¼ 0, go to step 11. 9. If MSB of Sign register set, negate result. 10. Return 11. Shift left carry (from dividend/result) into remainder 12. Subtract divisor from remainder. 13. If result negative, add back divisor, clear carry and go to Step 6. 14. Set carry and go to Step 6. 18 AVR200

AVR200 Figure 9-1. div16s Flow Chart DIV16S SIG REGISTER DIVIDEDH XOR DIVISORH MSB OF DIVISOR SET? EGATE DIVISOR MSB OF DIVIDED SET? EGATE DIVIDED LOOP COUTER 17 SHIFT LEFT DIVIDED DECREMET LOOP COUTER SHIFT LEFT REMAIDER LOOP COUTER = 0? MSB OF SIG REGISTER SET? EGATE RESULT REMAIDER REMAIDER DIVISOR RETUR RESULT EGATIVE? REMAIDER REMAIDER + DIVISOR CLEAR CARR SET CARR 9.2 Usage The usage of div16s is described in the following procedure: 1. Load the 16-bit register variable dd16sh:dd16sl with the dividend (the number to be divided). 2. Load the 16-bit register variable dv16sh:dv16sl with the divisor (the dividing number). 3. Call div16s. 4. The result is found in dres16s and the remainder in drem16s. Observe that to minimize register usage, code and execution time, the dividend and result share the same registers. 19

9.3 Performance Table 9-1. div16s R14 R15 R16 R17 R18 R19 R20 dd16sl Dividend Low Byte dd16sh Dividend High Byte dv16sl Divisor Low Byte dv16sh Divisor High Byte dcnt16s Loop Counter drem16sl Remainder Low Byte drem16sh Remainder High Byte dres16sl Result Low Byte dres16sh Result High Byte Table 9-2. div16s Performance Figures Code Size (Words) 39 Execution Time (Cycles) 255 :2 :5 : 20 AVR200

Disclaimer Headquarters International Atmel Corporation 2325 Orchard Parkway San Jose, CA 95131 USA Tel: 1(408) 441-0311 Fax: 1(408) 487-2600 Atmel Asia Unit 1-5 & 16, 19/F BEA Tower, Millennium City 5 418 Kwun Tong Road Kwun Tong, Kowloon Hong Kong Tel: (852) 2245-6100 Fax: (852) 2722-1369 Atmel Europe Le Krebs 8, Rue Jean-Pierre Timbaud BP 309 78054 Saint-Quentin-en- velines Cedex France Tel: (33) 1-30-60-70-00 Fax: (33) 1-30-60-71-11 Atmel Japan 9F, Tonetsu Shinkawa Bldg. 1-24-8 Shinkawa Chuo-ku, Tokyo 104-0033 Japan Tel: (81) 3-3523-3551 Fax: (81) 3-3523-7581 Product Contact Web Site http://www.atmel.com/ Technical Support avr@atmel.com Sales Contact www.atmel.com/contacts Literature Request www.atmel.com/literature Disclaimer: The information in this document is provided in connection with Atmel products. o license, express or implied, by estoppel or otherwise, to any intellectual property right is granted by this document or in connection with the sale of Atmel products. EXCEPT AS SET FORTH I ATMEL S TERMS AD CODITIOS OF SALE LOCATED O ATMEL S WEB SITE, ATMEL ASSUMES O LIABILIT WHATSOEVER AD DISCLAIMS A EXPRESS, IMPLIED OR STATUTOR WARRAT RELATIG TO ITS PRODUCTS ICLUDIG, BUT OT LIMITED TO, THE IMPLIED WARRAT OF MERCHATABILIT, FITESS FOR A PARTICULAR PURPOSE, OR O-IFRIGEMET. I O EVET SHALL ATMEL BE LIABLE FOR A DIRECT, IDIRECT, COSEQUETIAL, PUITIVE, SPECIAL OR ICIDETAL DAMAGES (ICLUDIG, WITHOUT LIMITATIO, DAMAGES FOR LOSS OF PROFITS, BUSIESS ITERRUPTIO, OR LOSS OF IFORMATIO) ARISIG OUT OF THE USE OR IABILIT TO USE THIS DOCUMET, EVE IF ATMEL HAS BEE ADVISED OF THE POSSIBILIT OF SUCH DAMAGES. Atmel makes no representations or warranties with respect to the accuracy or completeness of the contents of this document and reserves the right to make changes to specifications and product descriptions at any time without notice. Atmel does not make any commitment to update the information contained herein. Unless specifically provided otherwise, Atmel products are not suitable for, and shall not be used in, automotive applications. Atmel s products are not intended, authorized, or warranted for use as components in applications intended to support or sustain life. 2009 Atmel Corporation. All rights reserved. Atmel, Atmel logo and combinations thereof, AVR, AVR logo and others, are registered trademarks, XMEGA and others are trademarks of Atmel Corporation or its subsidiaries. Other terms and product names may be trademarks of others.