Representação de Caracteres



Similar documents
ASCII Code. Numerous codes were invented, including Émile Baudot's code (known as Baudot

Memory is implemented as an array of electronic switches

URL encoding uses hex code prefixed by %. Quoted Printable encoding uses hex code prefixed by =.

BAR CODE 39 ELFRING FONTS INC.

Voyager 9520/40 Voyager GS9590 Eclipse 5145

The ASCII Character Set

plc numbers Encoded values; BCD and ASCII Error detection; parity, gray code and checksums

Symbols in subject lines. An in-depth look at symbols

Xi2000 Series Configuration Guide

BI-300. Barcode configuration and commands Manual

BARCODE READER V 2.1 EN USER MANUAL

Numeral Systems. The number twenty-five can be represented in many ways: Decimal system (base 10): 25 Roman numerals:

Create!form Barcodes. User Guide

ASCII CODES WITH GREEK CHARACTERS

This is great when speed is important and relatively few words are necessary, but Max would be a terrible language for writing a text editor.

Barcode Magstripe. Decoder & Scanner. Programming Manual

DEBT COLLECTION SYSTEM ACCOUNT SUBMISSION FILE

Chapter 5. Binary, octal and hexadecimal numbers

Chapter 1. Binary, octal and hexadecimal numbers

Command Emulator STAR Line Mode Command Specifications

TELOCATOR ALPHANUMERIC PROTOCOL (TAP)

Part No. : MUL PROGRAMMING GUIDE

7-Bit coded Character Set

Teletypewriter Communication Codes

INTERNATIONAL STANDARD

Model 200 / 250 / 260 Programming Guide

DL910 SERIES. Instruction Manual

Barcode Scanning Made Easy. Programming Guide

MK-SERIE 1000/1500/2000 AllOfBarcode.de Michael Krug Traunstein BARCODE SCANNER

MINIMAG. Magnetic Stripe Reader Keyboard Wedge. User s Manual

Security Protection of Software Programs by Information Sharing and Authentication Techniques Using Invisible ASCII Control Codes

CHAPTER 8 BAR CODE CONTROL

Characters & Strings Lesson 1 Outline

Digital Logic Design. Introduction

Enter/Exit programming

ASCII control characters (character code 0-31)

Barcode Scanning Made Easy. WWS500 Programming Guide

Chapter 4: Computer Codes

Scanner Configuration

Index...1. Introduction...3. Installation- Keyboard Wedge...3 RS USB...3. Default Setting for each barcode shown as below:...

Talk Encoding Issues. An overview to understand and be able to handle encoding issues in a better way. Susanne Ebrecht


Bar Code CCD Scanner OPERATION MANUAL

Levent EREN A-306 Office Phone: INTRODUCTION TO DIGITAL LOGIC

Applied Data Communication Lecture 14

Systems I: Computer Organization and Architecture

ESPA Nov 1984 PROPOSAL FOR SERIAL DATA INTERFACE FOR PAGING EQUIPMENT CONTENTS 1. INTRODUCTION 2. CHARACTER DESCRIPTION

Counting in base 10, 2 and 16

Barcode reader setup manual

PRINTED MANUAL AGG Software (

Chapter 1: Digital Systems and Binary Numbers

HANDHELD LASER SCANNER

BRMO 80 / ETH-IP. User Manual. Réf : MU-BRMO 80-ETH-IP-1.4-EN

ASCII Characters. 146 CHAPTER 3 Information Representation. The sign bit is 1, so the number is negative. Converting to decimal gives

S302D. Programming Guide. 2D Imaging Barcode Scanner. Advanced Handheld High-Speed Laser Scanner

IBM Emulation Mode Printer Commands

ESC/POS Command Specifications

Today s topics. Digital Computers. More on binary. Binary Digits (Bits)

MK D Imager Barcode Scanner Configuration Guide

PROPERTY MANAGEMENT SYSTEM

2011, The McGraw-Hill Companies, Inc. Chapter 3

Number Representation

NVT (Network Virtual Terminal) description

Communication Concepts

Allen-Bradley. Bar Code. 2-D Hand-Held. Programming Guide. Bar Code. Scanners. (Cat. No HTG-4)

The use of binary codes to represent characters

TAP Interface Specifications

MetroSelect Programming Guide. MLPN 2407/December 1998

CD-3860 Bar Code Scanner User s Manual

!"#$$$$First in Document Technology BARCODE User Guide & Programming Manual

How To Use A Microsoft Powerbook With A Microtron 2 (Ios) On A Microsatellite (Ipl) On An Iphone Or Ipro (Iphones) On Your Computer Or Ipo (Iphone)

Scanner Configuration Manager Manual. for PC type computers

C Examples! Jennifer Rexford!

Preservation Handbook

S PT-H500LI ELECTRONIC E C LABELING L SYSTEM INTRODUCTION EDITING A LABEL LABEL PRINTING USING THE FILE MEMORY USING P-TOUCH SOFTWARE

Technical Reference DYMO LabelWriter SE450 Label Printer

Code. Barc. ber 20100

ASCII Character Set and Numeric Values The American Standard Code for Information Interchange

TEXT FILES. Format Description / Properties Usage and Archival Recommendations

Logix5000 Controllers ASCII Strings

Data Hiding in s and Applications Using Unused ASCII Control Codes

How to represent characters?

Computer Peripherals

Design of a Bluetooth Enabled Android Application for a Microcontroller Driven Robot

ELECTRONIC DOCUMENT IMAGING

Encoding Text with a Small Alphabet

Digital System Design Prof. D Roychoudhry Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur

Management Challenge. Managing Hardware Assets. Central Processing Unit. What is a Computer System?

Data Storage. Chapter 3. Objectives. 3-1 Data Types. Data Inside the Computer. After studying this chapter, students should be able to:

Docklight Pro Monitor User Manual 10/2015

XR-500 [Receipt Printer User s Manual ]

ROBO CYLINDER. Serial Communications Protocol. Intelligent Actuator, Inc.

QuickScan i. QD2100 Barcode Imager. Product Reference Guide

Control Functions for Coded Character Sets

Binary Representation

Programming Menu V3.9

Transcription:

Representação de Caracteres IFBA Instituto Federal de Educ. Ciencia e Tec Bahia Curso de Analise e Desenvolvimento de Sistemas Introdução à Ciência da Computação Prof. Msc. Antonio Carlos Souza Coletânea York University - ITEC 1011

Introdução Exemplos Real World Data Input device Computer Data Dear Mom: Keyboard 10110010 Digital camera 10110010

Formatos Apropriados A representação interna deve ser apropriada para o tipo de processamento (texto, imagem e som)

Tipos de Dados Números Inteiro ou ponto fixo Ponto Flutuante Número Decimal (BCD) Caracteres ACSII (American Standard Code for Information Interchange) EBCDIC (Extended binary Coded Decimal Interchange Code) Dados Lógicos Endereços

Convenções Formatos Apropriados Unique to a product or company E.g., Microsoft Word, Corel Word Perfect, IBM Lotus Notes Padrões Evolve two ways: Proprietary formats become de facto standards (e.g., Adobe PostScript, Apple Quick Time) Committee is struck to solve a problem (Motion Pictures Experts Group, MPEG)

Organizações Padrões ISO International Standards Organization CSA Canadian Standards Association ANSI American National Standards Institute IEEE Institute for Electrical and Electronics Engineers Etc.

Exemplos de Padrões Type of Data Standards Alphanumeric Image Motion picture Sound Outline graphics/fonts ASCII, EBCDIC, Unicode JPEG, GIF, PCX, TIFF MPEG-2, Quick Time Sound Blaster, WAV, AU PostScript, TrueType, PDF

Por que Padrões? Padrões são arbitrary Eles existem porque são: Convenient Efficient Flexible Appropriate Etc.

Representação de Caracteres Em geral, usa-se códigos alfanuméricos Código de 6 bits Código de 7 bits (ASCII) EBCDIC ASCII estendido ISO Latin - 1 Caracteres ANSI Caracteres Unicode

Dados Alfanuméricos Problema: Distinguir entre o número 123 (one hundred and twenty-three) and the characters 123 (one, two, three) Quatro padrões para representar letras (alpha) and números BCD Binary-coded decimal ASCII American standard code for information interchange EBCDIC Extended binary-coded decimal interchange code Unicode

Código de 6 bits Permite representar de 2 6 64 caracteres 26 letras maiúsculas 10 algarismos ( 0 1 2 3 4 5 6 7 8 9 ) 28 caracteres especiais, incluindo Space

7 bits (ASCII)

Binary-Coded Decimal (BCD) 4 bits por dígito Note: the following bit patterns are not used: 1010 1011 1100 1101 1110 1111 Digit 0 1 2 3 4 5 6 7 8 9 Bit pattern 0000 0001 0010 0011 0100 0101 0110 0111 1000 1001

Example 7093 10? (in BCD) 7 0 9 3 0111 0000 1001 0011

Standard Alphanumeric Formats BCD ASCII EBCDIC Unicode Next 22 slides

The Problem Representing text strings, such as Hello, world, in a computer

Codes and Characters Each character is coded as a byte Most common coding system is ASCII (Pronounced ass-key) ASCII American National Standard Code for Information Interchange Defined in ANSI document X3.4-1977

ASCII Features 7-bit code 8 th bit is unused (or used for a parity bit) 2 7 128 codes Two general types of codes: 95 are Graphic codes (displayable on a console) 33 are Control codes (control features of the console or communications channel)

ASCII Chart 000 001 010 011 100 101 110 111 0000 NULL DLE 0 @ P ` p 0001 SOH DC1! 1 A Q a q 0010 STX DC2 " 2 B R b r 0011 ETX DC3 # 3 C S c s 0100 EDT DC4 $ 4 D T d t 0101 ENQ NAK % 5 E U e u 0110 ACK SYN & 6 F V f v 0111 BEL ETB ' 7 G W g w 1000 BS CAN ( 8 H X h x 1001 HT EM ) 9 I Y i y 1010 LF SUB * : J Z j z 1011 VT ESC + ; K [ k { 1100 FF FS, < L \ l 1101 CR GS - M ] m } 1110 SO RS. > N ^ n ~ 1111 SI US /? O _ o DEL

000 001 010 011 100 101 110 111 0000 NULL DLE 0 @ P ` p 0001 SOH DC1! 1 A Q a q 0010 STX DC2 " 2 B R b r 0011 ETX DC3 # 3 C S c s 0100 EDT DC4 $ 4 D T d t 0101 ENQ NAK % 5 E U e u 0110 ACK SYN & 6 F V f v 0111 BEL ETB ' 7 G W g w 1000 BS CAN ( 8 H X h x 1001 HT EM ) 9 I Y i y 1010 LF SUB * : J Z j z 1011 VT ESC + ; K [ k { 1100 FF FS, < L \ l 1101 CR GS - M ] m } 1110 SO RS. > N ^ n ~ 1111 SI US /? O _ o DEL

000 001 010 011 100 101 110 111 0000 NULL DLE 0 @ P ` p 0001 SOH DC1! 1 A Q a q 0010 STX DC2 " 2 B R b r 0011 ETX DC3 # 3 C S c s 0100 EDT DC4 Most $ significant 4 bit D T d t 0101 ENQ NAK % 5 E U e u 0110 ACK SYN & 6 F V f v 0111 BEL ETB ' 7 G W g w 1000 BS CAN ( 8 H X h x 1001 HT EM ) 9 I Y i y 1010 LF SUB * : J Z j z 1011 Least significant VT ESC bit + ; K [ k { 1100 FF FS, < L \ l 1101 CR GS - M ] m } 1110 SO RS. > N ^ n ~ 1111 SI US /? O _ o DEL

e.g., a 1100001 000 001 010 011 100 101 110 111 0000 NULL DLE 0 @ P ` p 0001 SOH DC1! 1 A Q a q 0010 STX DC2 " 2 B R b r 0011 ETX DC3 # 3 C S c s 0100 EDT DC4 $ 4 D T d t 0101 ENQ NAK % 5 E U e u 0110 ACK SYN & 6 F V f v 0111 BEL ETB ' 7 G W g w 1000 BS CAN ( 8 H X h x 1001 HT EM ) 9 I Y i y 1010 LF SUB * : J Z j z 1011 VT ESC + ; K [ k { 1100 FF FS, < L \ l 1101 CR GS - M ] m } 1110 SO RS. > N ^ n ~ 1111 SI US /? O _ o DEL

95 Graphic codes 000 001 010 011 100 101 110 111 0000 NULL DLE 0 @ P ` p 0001 SOH DC1! 1 A Q a q 0010 STX DC2 " 2 B R b r 0011 ETX DC3 # 3 C S c s 0100 EDT DC4 $ 4 D T d t 0101 ENQ NAK % 5 E U e u 0110 ACK SYN & 6 F V f v 0111 BEL ETB ' 7 G W g w 1000 BS CAN ( 8 H X h x 1001 HT EM ) 9 I Y i y 1010 LF SUB * : J Z j z 1011 VT ESC + ; K [ k { 1100 FF FS, < L \ l 1101 CR GS - M ] m } 1110 SO RS. > N ^ n ~ 1111 SI US /? O _ o DEL

33 Control codes 000 001 010 011 100 101 110 111 0000 NULL DLE 0 @ P ` p 0001 SOH DC1! 1 A Q a q 0010 STX DC2 " 2 B R b r 0011 ETX DC3 # 3 C S c s 0100 EDT DC4 $ 4 D T d t 0101 ENQ NAK % 5 E U e u 0110 ACK SYN & 6 F V f v 0111 BEL ETB ' 7 G W g w 1000 BS CAN ( 8 H X h x 1001 HT EM ) 9 I Y i y 1010 LF SUB * : J Z j z 1011 VT ESC + ; K [ k { 1100 FF FS, < L \ l 1101 CR GS - M ] m } 1110 SO RS. > N ^ n ~ 1111 SI US /? O _ o DEL

Alphabetic codes 000 001 010 011 100 101 110 111 0000 NULL DLE 0 @ P ` p 0001 SOH DC1! 1 A Q a q 0010 STX DC2 " 2 B R b r 0011 ETX DC3 # 3 C S c s 0100 EDT DC4 $ 4 D T d t 0101 ENQ NAK % 5 E U e u 0110 ACK SYN & 6 F V f v 0111 BEL ETB ' 7 G W g w 1000 BS CAN ( 8 H X h x 1001 HT EM ) 9 I Y i y 1010 LF SUB * : J Z j z 1011 VT ESC + ; K [ k { 1100 FF FS, < L \ l 1101 CR GS - M ] m } 1110 SO RS. > N ^ n ~ 1111 SI US /? O _ o DEL

Numeric codes 000 001 010 011 100 101 110 111 0000 NULL DLE 0 @ P ` p 0001 SOH DC1! 1 A Q a q 0010 STX DC2 " 2 B R b r 0011 ETX DC3 # 3 C S c s 0100 EDT DC4 $ 4 D T d t 0101 ENQ NAK % 5 E U e u 0110 ACK SYN & 6 F V f v 0111 BEL ETB ' 7 G W g w 1000 BS CAN ( 8 H X h x 1001 HT EM ) 9 I Y i y 1010 LF SUB * : J Z j z 1011 VT ESC + ; K [ k { 1100 FF FS, < L \ l 1101 CR GS - M ] m } 1110 SO RS. > N ^ n ~ 1111 SI US /? O _ o DEL

Punctuation, etc. 000 001 010 011 100 101 110 111 0000 NULL DLE 0 @ P ` p 0001 SOH DC1! 1 A Q a q 0010 STX DC2 " 2 B R b r 0011 ETX DC3 # 3 C S c s 0100 EDT DC4 $ 4 D T d t 0101 ENQ NAK % 5 E U e u 0110 ACK SYN & 6 F V f v 0111 BEL ETB ' 7 G W g w 1000 BS CAN ( 8 H X h x 1001 HT EM ) 9 I Y i y 1010 LF SUB * : J Z j z 1011 VT ESC + ; K [ k { 1100 FF FS, < L \ l 1101 CR GS - M ] m } 1110 SO RS. > N ^ n ~ 1111 SI US /? O _ o DEL

Hello, world Example Binary 01001000 01100101 01101100 01101100 01101111 00101100 00100000 01110111 01100111 01110010 01101100 01100100 Hexadecimal 48 65 6C 6C 6F 2C 20 77 67 72 6C 64 Decimal 72 101 108 108 111 44 32 119 103 114 108 100 H el l o, w or l d

Common Control Codes CR 0D carriage return LF 0A line feed HT 09 horizontal tab DEL 7F delete NULL 00 null Hexadecimal code

000 001 010 011 100 101 110 111 0000 NULL DLE 0 @ P ` p 0001 SOH DC1! 1 A Q a q 0010 STX DC2 " 2 B R b r 0011 ETX DC3 # 3 C S c s 0100 EDT DC4 $ 4 D T d t 0101 ENQ NAK % 5 E U e u 0110 ACK SYN & 6 F V f v 0111 BEL ETB ' 7 G W g w 1000 BS CAN ( 8 H X h x 1001 HT EM ) 9 I Y i y 1010 LF SUB * : J Z j z 1011 VT ESC + ; K [ k { 1100 FF FS, < L \ l 1101 CR GS - M ] m } 1110 SO RS. > N ^ n ~ 1111 SI US /? O _ o DEL

000 001 010 011 100 101 110 111 0000 NULL DLE 0 @ P ` p 0001 SOH DC1! 1 A Q a q 0010 STX DC2 " 2 B R b r 0011 ETX DC3 # 3 C S c s 0100 EDT DC4 $ 4 D T d t 0101 ENQ NAK % 5 E U e u 0110 ACK SYN & 6 F V f v 0111 BEL ETB ' 7 G W g w 1000 BS CAN ( 8 H X h x 1001 HT EM ) 9 I Y i y 1010 LF SUB * : J Z j z 1011 VT ESC + ; K [ k { 1100 FF FS, < L \ l 1101 CR GS - M ] m } 1110 SO RS. > N ^ n ~ 1111 SI US /? O _ o DEL

Standard Alphanumeric Formats BCD ASCII EBCDIC Unicode Next 1 slides

EBCDIC Extended BCD Interchange Code (pronounced ebb -se-dick) 8-bit code Developed by IBM Rarely used today IBM mainframes only

8 bits (EBCDIC) Extended Binary Coded Decimal Interchange Code

8 bits (ASCII Estendido)

ISO Latin-1

Caracteres ANSI Windows 9x suporta caracteres ANSI American National Standards Institute Representação de 8 bits (256 caracteres) 0 a 255 Valores de 0 a 127: mesmos de ASCII Entre 128 a 255: similar a ISO Latin-1 Tem extensões e incompatibilidades

Standard Alphanumeric Formats BCD ASCII EBCDIC Unicode Next 2 slides

Unicode 16-bit standard Developed by a consortia Intended to supercede older 7- and 8-bit codes

Unicode Version 2.1 1998 Improves on version 2.0 Includes the Euro sign (20AC 16 ) From the standard: contains 38,887 distinct coded characters derived from the supported scripts. These characters cover the principal written languages of the Americas, Europe, the Middle East, Africa, India, Asia, and Pacifica. http://www.unicode.org

Caracteres Unicode Windows NT usa Unicode 16-bits Cobre grande parte das línguas vivas Também linguas mortas (uso escolar) Detalhes http://www.unicode.org

Keyboard Input Key ( scan ) codes are converted to ASCII ASCII code sent to host computer Received by the host as a stream of data Stored in buffer Processed Etc.

Outras Entradas OCR optical character recognition Bar code readers Voice/audio input Punched cards Images / objects Pointing devices