Download Lecture 7

Chapter Three : Arithmetic for Computers 2004 Morgan Kaufmann Publishers 1 Outline • • • • • • • • • 3.1 3.2 3.3 3.4 3.5 3.6 3.7 3.8 3.9 Introduction Signed and Unsigned Numbers Addition and Subtraction Multiplication Division Floating Point Real Stuff: Floating Point in the IA-32 Fallacies and Pitfalls Concluding Remarks 2004 Morgan Kaufmann Publishers 2 3.1 Introduction 2004 Morgan Kaufmann Publishers 3 Numbers • • • • Bits are just bits (no inherent meaning) — conventions define relationship between bits and numbers Binary numbers (base 2) 0000 0001 0010 0011 0100 0101 0110 0111 1000 1001... decimal: 0...2n-1 Of course it gets more complicated: numbers are finite (overflow) fractions and real numbers negative numbers e.g., no MIPS subi instruction; addi can add a negative number How do we represent negative numbers? i.e., which bit patterns will represent which numbers? 2004 Morgan Kaufmann Publishers 4 Possible Representations • Sign Magnitude: 000 = +0 001 = +1 010 = +2 011 = +3 100 = -0 101 = -1 110 = -2 111 = -3 • • One's Complement Two's Complement 000 = +0 001 = +1 010 = +2 011 = +3 100 = -3 101 = -2 110 = -1 111 = -0 000 = +0 001 = +1 010 = +2 011 = +3 100 = -4 101 = -3 110 = -2 111 = -1 Issues: balance, number of zeros, ease of operations Which one is best? Why? 2004 Morgan Kaufmann Publishers 5 3.2 Signed and Unsigned Numbers 2004 Morgan Kaufmann Publishers 6 Keywords • Least significant bit The rightmost bit in a MIPS word. • Most significant bit The leftmost bit in a MIPS word. • Biased notation A notation that represents the most negative value by 00 … 000 two and the most positive value by 11 … 111two with 0 typically having the value 10 … 00 two , thereby biasing the number bias has a nonnegative representation. 2004 Morgan Kaufmann Publishers 7 Biased notation 11…111two ; the most positive value . . . 10…000two ; 0, biasing the number . . . 00…000two ; the most negative value 2004 Morgan Kaufmann Publishers 8 MIPS • 32 bit signed numbers: 0000 0000 0000 ... 0111 0111 1000 1000 1000 ... 1111 1111 1111 0000 0000 0000 0000 0000 0000 0000two = 0ten 0000 0000 0000 0000 0000 0000 0001two = + 1ten 0000 0000 0000 0000 0000 0000 0010two = + 2ten 1111 1111 0000 0000 0000 1111 1111 0000 0000 0000 1111 1111 0000 0000 0000 1111 1111 0000 0000 0000 1111 1111 0000 0000 0000 1111 1111 0000 0000 0000 1110two 1111two 0000two 0001two 0010two = = = = = + + – – – 2,147,483,646ten 2,147,483,647ten 2,147,483,648ten 2,147,483,647ten 2,147,483,646ten maxint minint 1111 1111 1111 1111 1111 1111 1101two = – 3ten 1111 1111 1111 1111 1111 1111 1110two = – 2ten 1111 1111 1111 1111 1111 1111 1111two = – 1ten 2004 Morgan Kaufmann Publishers 9 Two's Complement Operations • Negating a two's complement number: invert all bits and add 1 – remember: “negate” and “invert” are quite different! • Converting n bit numbers into numbers with more than n bits: – MIPS 16 bit immediate gets converted to 32 bits for arithmetic – copy the most significant bit (the sign bit) into the other bits 0010 -> 0000 0010 1010 -> 1111 1010 – "sign extension" (lbu vs. lb) 2004 Morgan Kaufmann Publishers 10 Signed versus Unsigned Comparison 1111 1111 1111 1111 1111 1111 1111 1111two # $s0 0000 0000 0000 0000 0000 0000 0000 0000two #$t1 slt $t0, $s0, $s1 #signed comparison, -1ten< 1ten, $t0=1 sltu $t0, $s0, $s1 #unsigned comparison, 4,294,967,295ten< 1ten, $t0=0 2004 Morgan Kaufmann Publishers 11 Negation Shortcut Negate 2ten and then check the result by negating -2ten 1111 1111 1111 1111 1111 1111 1111 1101two + 1two ---------------------------------------------------------------------------1111 1111 1111 1111 1111 1111 1111 1110two=-2ten 0000 0000 0000 0000 0000 0000 0000 0001two + 1two ---------------------------------------------------------------------------0000 0000 0000 0000 0000 0000 0000 0010two=2ten 2004 Morgan Kaufmann Publishers 12 Sign Extension Shortcut Convert 16-bit binary versions of 2ten and -2ten to 32-bit binary numbers. 0000 0000 0000 0010two= 2 ten 1111 1111 1111 1110two= -2 ten 0000 0000 0000 0000 0000 0000 0000 0010two= 2 ten 1111 1111 1111 1111 1111 1111 1111 1110two= -2 ten 2004 Morgan Kaufmann Publishers 13 FIGURE 3.1 MIPS architecture revealed thus far. MIPS operands Name Example Comments 32 registers $s0-$s7, $t 0- $t 9, $zero, $a0-a3, $v0-$v1, $gp, $fp, $sp, $ra, $at Fast locations for data. In MIPS, data must be in registers to perform arithmetic. MIPS register $zero always equals 0. Register $at is reserved for the assembler to handle large constants. 30memory 2 words Memory[0], Memory[4], …, Memory[4294967292] Accessed only by data transfer instructions in MIPS. MIPS uses byte addresses, so sequential word addresses differ by 4. Memory holds data structures, arrays, and spilled registers, such as those saved on procedure calls. 2004 Morgan Kaufmann Publishers 14 MIPS assembly language Category Arithmetic Data transfer Logical Instruction Example Meaning Comments add add $s1, $s2, $s3 $s1 = $s2 + $s3 Three operands; Overflow detected subtract sub $s1, $s2, $s3 $s1 = $s2 - $s3 Three operands; Overflow detected add immediate addi $s1, $s2, 100 $s1 = $s2+ 100 Used to add constants load word lw $s1, 100($s2) $s1 = Memory [$s2 + 100] Word from memory to register store word sw $s1, 100 ($s2) Memory [$s2 + 100] = $s1 Word from register to memory load half unsigned lh $s1, 100($s2) $s1 = Memory [$s2 + 100] Halfword memory to register store half sh $s1, 100 ($s2) Memory [$s2 + 100] = $s1 Halfword register to memory load byte unsigned lb $s1, 100($s2) $s1 = Memory [$s2 + 100] Byte from memory to register store byte sb $s1, 100 ($s2) Memory [$s2 + 100] = $s1 Byte from register to memory load upper immed. lui $s1, 100 $s1 = 100 * 2^16 Loads constant in upper 16 bits and add $s1, $s2, $s3 $s1 = $s2 & $s3 Three reg. operands; bit-by-bit AND or or $s1, $s2, $s3 $s1 = $s2 | $s3 Three reg. operands; bit-by-bit OR nor nor $s1, $s2, $s3 $s1 = ~($s2 | $s3) Three reg. operands; bit-by-bit NOR and immediate andi $s1, $s2, 100 $s1 = $s2 & 100 Bit-by-bit AND reg with constant or immediate ori $s1, $s2, 100 $s1 = $s2 | 100 Bit-by-bit OR reg with constant shift left logical sll $s1, $s2, 10 $s1 = $s2 << 10 Shift left by constant shift right logical srl $$s1, $s2, 10 $s1 = $s2 >> 10 Shift right by constant 2004 Morgan Kaufmann Publishers 15 Continue.. Condition al branch Unconditi onal jump branch on equal beq $s1, $s2, 25 if ($s1 == $s2) go to PC+4+100 Equal test; PC-relative branch branch on not equal bne $s1, $s2, 25 if ($s1 != $s2) go to L PC+4+100 Not equal test; PC-relative set on less than slt $s1, $s2, $s3 if ($s2 < $s3) $s1 = 1; else $s1 = 0 Compare less than; two’s complement set on less than immediate slt $s1, $s2, 100 if ($s2 < 100) $s1 = 1; else $s1 = 0 Compare < constant; Two’s complement set less than unsigned sltu $s1, $s2, $s3 if ($s2 < $s3) $s1 = 1; else $s1 = 0 Compare less than; unsigned numbers set less than immediate unsigned sltuiu $s1, $s2, 100 if ($s2 < 100) $s1 = 1; else $s1 = 0 Compare< constant; unsigned numbers jump j 2500 go to 10000 Jump to target address jump register jr $ra go to $ra For procedure return jump and link jal 2500 $ra = PC + 4; go to 10000 For procedure call 2004 Morgan Kaufmann Publishers 16 3.3 Addition and Subtraction 2004 Morgan Kaufmann Publishers 17 Keywords • Exception Also called interrupt. An unscheduled event that disrupts program execution; used to detect overflow. • Interrupt An exception that comes from outside of the processor. (Some architectures use the term interrupt for all exceptions.) 2004 Morgan Kaufmann Publishers 18 Addition & Subtraction • Just like in grade school (carry/borrow 1s) 0111 0111 0110 + 0110 - 0110 - 0101 • Two's complement operations easy – subtraction using addition of negative numbers 0111 + 1010 • Overflow (result too large for finite computer word): – e.g., adding two n-bit numbers does not yield an n-bit number 0111 + 0001 note that overflow term is somewhat misleading, 1000 it does not mean a carry “overflowed” 2004 Morgan Kaufmann Publishers 19 FIGURE 3.2 Binary addition, showing carries from right to left. 2004 Morgan Kaufmann Publishers 20 Detecting Overflow • • • • No overflow when adding a positive and a negative number No overflow when signs are the same for subtraction Overflow occurs when the value affects the sign: – overflow when adding two positives yields a negative – or, adding two negatives gives a positive – or, subtract a negative from a positive and get a negative – or, subtract a positive from a negative and get a positive Consider the operations A + B, and A – B – Can overflow occur if B is 0 ? – Can overflow occur if A is 0 ? 2004 Morgan Kaufmann Publishers 21 FIGURE 3.3 Overflow conditions for addition and subtraction. Operation Operand A Operand B Result indicating overflow A+B 0 0 A+B 0 A–B 0 0 0 0 A–B 0 0 0 0 0 2004 Morgan Kaufmann Publishers 22 Effects of Overflow • • • An exception (interrupt) occurs – Control jumps to predefined address for exception – Interrupted address is saved for possible resumption Details based on software system / language – example: flight control vs. homework assignment Don't always want to detect overflow — new MIPS instructions: addu, addiu, subu note: addiu still sign-extends! note: sltu, sltiu for unsigned comparisons 2004 Morgan Kaufmann Publishers 23 addu $t0, $t1, $t2 xor $t3, $t1, $t2 slt $t3, $t3, $zero bne $t3, $zero, no_overflow xor 4t3, $t0, $t1 slt $t3, $t3, $zero bne $t3, $zero, overflow 2004 Morgan Kaufmann Publishers 24 addu $t0, $t1, $t2 nor $t3, $t1, $zero sltu $t3, $t3, $t2 bne $t3, $zero, overflow 2004 Morgan Kaufmann Publishers 25 FIGURE 3.4 MIPS architecture revealed thus far MIPS assembly language Category Arithmetic Data transfer Logical Instruction Example Meaning Comments add add $s1, $s2, $s3 $s1 = $s2 + $s3 Three operands; Overflow detected subtract sub $s1, $s2, $s3 $s1 = $s2 - $s3 Three operands; Overflow detected add immediate addi $s1, $s2, 100 $s1 = $s2+ 100 + constants; overflow detected add unsigned addu $s1, $s2, $s3 $s1 = $s2 + $s3 Three operands; overflow undetected subtract unsigned subu $s1, $s2, $s3 $s1 = $s2 - $s3 Three operands; overflow undetected add immediate unsigned addiu $s1, $s2, 100 $s1 = $s2+ 100 + constants; overflow detected move from coprocessor register mfc0 $s1, $epc $s1 = $epc Used to copy Exception PC plus other special registers load word lw $s1, 100($s2) $s1 = Memory [$s2 + 100] Word from memory to register store word sw $s1, 100 ($s2) Memory [$s2 + 100] = $s1 Word from register to memory load half unsigned lh $s1, 100($s2) $s1 = Memory [$s2 + 100] Halfword memory to register store half sh $s1, 100 ($s2) Memory [$s2 + 100] = $s1 Halfword register to memory load byte unsigned lb $s1, 100($s2) $s1 = Memory [$s2 + 100] Byte from memory to register store byte sb $s1, 100 ($s2) Memory [$s2 + 100] = $s1 Byte from register to memory load upper immed. lui $s1, 100 $s1 = 100 * 2^16 Loads constant in upper 16 bits and add $s1, $s2, $s3 $s1 = $s2 & $s3 Three reg. operands; bit-by-bit AND or or $s1, $s2, $s3 $s1 = $s2 | $s3 Three reg. operands; bit-by-bit OR nor nor $s1, $s2, $s3 $s1 = ~($s2 | $s3) Three reg. operands; bit-by-bit NOR and immediate andi $s1, $s2, 100 $s1 = $s2 & 100 Bit-by-bit AND reg with constant 2004 Morgan Kaufmann Publishers 26 Continue.. Condition al branch Unconditi onal jump or immediate ori $s1, $s2, 100 $s1 = $s2 | 100 Bit-by-bit OR reg with constant shift left logical sll $s1, $s2, 10 $s1 = $s2 << 10 Shift left by constant shift right logical srl $$s1, $s2, 10 $s1 = $s2 >> 10 Shift right by constant branch on equal beq $s1, $s2, 25 if ($s1 == $s2) go to PC+4+100 Equal test; PC-relative branch branch on not equal bne $s1, $s2, 25 if ($s1 != $s2) go to L PC+4+100 Not equal test; PC-relative set on less than slt $s1, $s2, $s3 if ($s2 < $s3) $s1 = 1; else $s1 = 0 Compare less than; two’s complement set on less than immediate slt $s1, $s2, 100 if ($s2 < 100) $s1 = 1; else $s1 = 0 Compare < constant; Two’s complement set less than unsigned sltu $s1, $s2, $s3 if ($s2 < $s3) $s1 = 1; else $s1 = 0 Compare less than; unsigned numbers set less than immediate unsigned sltuiu $s1, $s2, 100 if ($s2 < 100) $s1 = 1; else $s1 = 0 Compare< constant; unsigned numbers jump j 2500 go to 10000 Jump to target address jump register jr $ra go to $ra For procedure return jump and link jal 2500 $ra = PC + 4; go to 10000 For procedure call 2004 Morgan Kaufmann Publishers 27 3.4 Multiplication 2004 Morgan Kaufmann Publishers 28 Multiplication • • • More complicated than addition – accomplished via shifting and addition More time and more area Let's look at 3 versions based on a gradeschool algorithm 0010 __x_1011 • (multiplicand) (multiplier) Negative numbers: convert and multiply – there are better techniques, we won’t look at them 2004 Morgan Kaufmann Publishers 29 Multiplying 1000two by 1001two Multiplicand Multiplier 1000two 1001two ------------------------------------------------------------------------1000 (1xmultiplicand) 0000 (0xmultiplicand) 0000 (0xmultiplicand) 1000 (1xmultiplicand) -------------------------------------------------------------------------- Product 1001000two 2004 Morgan Kaufmann Publishers 30 Multiplication: Implementation Datapath Multiplicand Shift left 64 bits Multiplier Shift right 64-bit ALU 32 bits Product Write Control test 64 bits 2004 Morgan Kaufmann Publishers 31 Start Control Multiplier0 = 1 1. Test Multiplier0 = 0 Multiplier0 1a. Add multiplicand to product and place the result in Product register 2. Shift the Multiplicand register left 1 bit 3. Shift the Multiplier register right 1 bit No: < 32 repetitions 32nd repetition? Yes: 32 repetitions Done 2004 Morgan Kaufmann Publishers 32 Final Version •Multiplier starts in right half of product Multiplicand 32 bits 32-bit ALU Product Shift right Write Control test 64 bits 2004 Morgan Kaufmann Publishers 33 Start Product0 = 1 1. Test Product0 = 0 Product0 1a. Add multiplicand to the left half of the product and place the result in the left half of the Product register 3. Shift the Product register right 1 bit No: < 32 repetitions 32nd repetition? Yes: 32 repetitions Done 2004 Morgan Kaufmann Publishers 34 FIGURE 3.8 Multiply example using algorithm in Figure 3.6. 2004 Morgan Kaufmann Publishers 35 FIGURE 3.9 Fast multiplication hardware. 2004 Morgan Kaufmann Publishers 36 3.5 Division 2004 Morgan Kaufmann Publishers 37 Keywords • Dividend A number being divided. • Divisor A number that the dividend is divided by. • Quotient The primary result of a division; a number that when multiplied by the divisor and added to the remainder produces the dividend. • Remainder The secondary result of a division; a number that when added to the product of the quotient and the divisor produces the dividend. 2004 Morgan Kaufmann Publishers 38 A example is dividing, 1,001,010two by 1000two 1001 two Quotient ---------------------------Divisor 1000two | 1001010 two - 1000 ----------------------------10 101 1010 -1000 ------------------------------10 Remainder 2004 Morgan Kaufmann Publishers 39 FIGURE 3.10 First version of the division hardware. 2004 Morgan Kaufmann Publishers 40 FIGURE 3.11 A division algorithm, using the hardware in Figure 3.10. 2004 Morgan Kaufmann Publishers 41 FIGURE 3.12 Division example using the algorithm in Figure 3.11. 2004 Morgan Kaufmann Publishers 42 FIGURE 3.13 An improved version of the division hardware. 2004 Morgan Kaufmann Publishers 43 Start 1. Shift the Remainder register left 1 bit 2. Subtract the Divisor register from the left half of the Remainder register and place the result in the left half of the Remainder register Remainder ≥ 0 Test Remainder Remainder < 0 3b. Restore the original value by adding the Divisor register to the left half of the Remainder register and place the sum in the left half of the Remainder register. Also shift the Remainder register to the left, setting the new rightmost bit to 0 3a. Shift the Remainder register to the left, setting the new rightmost bit to 1 32nd repetition No: < 32 repetitions Yes: 32 repetitions Done. Shift left half of Remainder right 1 bit 2004 Morgan Kaufmann Publishers 44 Signed Division The simplest solution is to remember the signs of the divisor and dividend and then negate the quotient if the signs disagree. Dividend = Quotient * Divisor + Remainder +7 / +2: Quotient = +3, Remainder = +1. -7 / +2: Quotient = -3, Remainder = -1. +7 / -2: Quotient = -3, Remainder = +1. -7 / -2: Quotient = +3, Remainder = -1. 2004 Morgan Kaufmann Publishers 45 3.6 Floating Point 2004 Morgan Kaufmann Publishers 46 Keywords (1) • Scientific notation A notation that renders numbers with a single digit to the left of the decimal point. • Normalized A number in floating-point notation that has no leading 0. • Floating point Computer arithmetic that represents numbers in which the binary point is not fixed. • Fraction The value, generally between 0 and 1, placed in the fraction field. 2004 Morgan Kaufmann Publishers 47 • • • • • • 3.14159265…ten 2.71828…(℮) 0.000000001ten or 1.0X10-9 3,155,760,000ten or 3.15576tenX109 1.0twoX2-1 1.xxxxxxxxtwox2yyyy 2004 Morgan Kaufmann Publishers 48 Keywords (2) • Exponent In the numerical representation system of floating-point arithmetic, the value that is placed in the exponent field. • Overflow (floating-point) A situation in which a positive exponent becomes too large to fit in the exponent field. • Underflow (floating-point) A situation in which a negative exponent becomes too large to fit in the exponent field. • Double precision A floating-point value represented in two 32-bit word. 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 s exponent fraction 1 bit 8 bits 23 bits 8 7 6 5 4 3 2 1 2004 Morgan Kaufmann Publishers 0 49 Keywords (3) • Single precision A floating-point value represented in a single 32-bit word. 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 s exponent fraction 1 bit 11 bits 7 6 5 4 3 2 1 0 20 bits Fraction (continued) 32 bits • Units in the last place (ulp) The number of bits in error in the least significant bits of the significant between the actual number and the number that can be prepresented. • Sticky bit A bit used in rounding in addition to guard and round that is set whenever there are nonzero bits to the right of the round bit. 2004 Morgan Kaufmann Publishers 50 Floating Point (a brief look) • We need a way to represent – numbers with fractions, e.g., 3.1416 – very small numbers, e.g., .000000001 – very large numbers, e.g., 3.15576 ´ 109 • Representation: – sign, exponent, significand: (–1)sign ´ significand ´ 2exponent – more bits for significand gives more accuracy – more bits for exponent increases range • IEEE 754 floating point standard: – single precision: 8 bit exponent, 23 bit significand – double precision: 11 bit exponent, 52 bit significand 2004 Morgan Kaufmann Publishers 51 FIGURE 3.14 MIPS architecture revealed thus far MIPS assembly language Category Arithmetic Data transfer Instruction Example Meaning Comments add add $s1, $s2, $s3 $s1 = $s2 + $s3 Three operands; Overflow detected subtract sub $s1, $s2, $s3 $s1 = $s2 - $s3 Three operands; Overflow detected add immediate addi $s1, $s2, 100 $s1 = $s2+ 100 + constants; overflow detected add unsigned addu $s1, $s2, $s3 $s1 = $s2 + $s3 Three operands; overflow undetected subtract unsigned subu $s1, $s2, $s3 $s1 = $s2 - $s3 Three operands; overflow undetected add immediate unsigned addiu $s1, $s2, 100 $s1 = $s2+ 100 + constants; overflow detected move from coprocessor register mfc0 $s1, $epc $s1 = $epc Copy Exception PC + special regs multiply mult Hi, Lo = $s2 x $s3 64-bit signed product in Hi, Lo multiply unsigned multu Hi, Lo = $s2 x $s3 64-bit unsigned product in Hi, Lo divide div Lo = $s2 / $s3 Hi = $s2 mod $s3 Lo = quotient, Hi = remainder divide unsigned divu $s2, $s3 Lo = $s2 / $s3 Hi = $s2 mod $s3 Unsigned quotient and remainder move from Hi mfhi $s1 $s1 = Hi Used to get copy of Hi move from Lo mflo $s1 $s1 = Lo Used to get copy of Lo load word lw $s1, 100($s2) $s1 = Memory [$s2 + 100] Word from memory to register store word sw $s1, 100 ($s2) Memory [$s2 + 100] = $s1 Word from register to memory load half unsigned lh $s1, 100($s2) $s1 = Memory [$s2 + 100] Halfword memory to register store half sh $s1, 100 ($s2) Memory [$s2 + 100] = $s1 Halfword register to memory load byte unsigned lb $s1, 100($s2) $s1 = Memory [$s2 + 100] Byte from memory to register store byte sb $s1, 100 ($s2) Memory [$s2 + 100] = $s1 Byte from register to memory load upper immed. lui $s1, 100 $s1 = 100 * 2^16 Loads constant in upper 16 bits $s2, $s3 $s2, $s3 $s2, $s3 2004 Morgan Kaufmann Publishers 52 Continue.. Logical Condition al branch Unconditi onal jump and add $s1, $s2, $s3 $s1 = $s2 & $s3 Three reg. operands; bit-by-bit AND or or $s1, $s2, $s3 $s1 = $s2 | $s3 Three reg. operands; bit-by-bit OR nor nor $s1, $s2, $s3 $s1 = ~($s2 | $s3) Three reg. operands; bit-by-bit NOR and immediate andi $s1, $s2, 100 $s1 = $s2 & 100 Bit-by-bit AND reg with constant or immediate ori $s1, $s2, 100 $s1 = $s2 | 100 Bit-by-bit OR reg with constant shift left logical sll $s1, $s2, 10 $s1 = $s2 << 10 Shift left by constant shift right logical srl $$s1, $s2, 10 $s1 = $s2 >> 10 Shift right by constant branch on equal beq $s1, $s2, 25 if ($s1 == $s2) go to PC+4+100 Equal test; PC-relative branch branch on not equal bne $s1, $s2, 25 if ($s1 != $s2) go to L PC+4+100 Not equal test; PC-relative set on less than slt $s1, $s2, $s3 if ($s2 < $s3) $s1 = 1; else $s1 = 0 Compare less than; two’s complement set on less than immediate slt $s1, $s2, 100 if ($s2 < 100) $s1 = 1; else $s1 = 0 Compare < constant; Two’s complement set less than unsigned sltu $s1, $s2, $s3 if ($s2 < $s3) $s1 = 1; else $s1 = 0 Compare less than; natural numbers set less than immediate unsigned sltuiu $s1, $s2, 100 if ($s2 < 100) $s1 = 1; else $s1 = 0 Compare< constant; natural numbers jump j 2500 go to 10000 Jump to target address jump register jr $ra go to $ra For switch, procedure return jump and link jal 2500 $ra = PC + 4; go to 10000 2004 call Morgan Kaufmann Publishers For procedure 53 IEEE 754 floating-point standard • Leading “1” bit of significand is implicit • Exponent is “biased” to make sorting easier – all 0s is smallest exponent all 1s is largest – bias of 127 for single precision and 1023 for double precision – summary: (–1)sign ´ (1+significand) ´ 2exponent – bias • Example: – decimal: -.75 = - ( ½ + ¼ ) – binary: -.11 = -1.1 x 2-1 – floating point: exponent = 126 = 01111110 – IEEE single precision: 10111111010000000000000000000000 2004 Morgan Kaufmann Publishers 54 FIGURE 3.15 IEEE 754 encoding of floating-point numbers. Single precision Double precision Object representation Exponent Fraction Exponent Fraction 0 0 0 0 0 0 nonzero 0 nonzero 1 – 254 anything 1 – 2046 anything  renormalized number  floating-point number 255 0 2047 0  infinity 255 nonzero 2047 nonzero NaN (Not a Number) 2004 Morgan Kaufmann Publishers 55 Floating-Point Representation • Show the IEEE 754 binary representation of the number -0.75 in single and double precision. • Step1: The number -0.75ten is also -3/4ten = -(1/2+1/4) = -0.11two Step2: In scientific notation, the value is  0.11two  20 and in normalized scientific notation, it is  1.1two  2 1 Step3: (1) The general representation for a single precision number is (1) s  (1 + Fraction)  2( Exeponent127) Then the result is (1)1  (1 + .1000 0000 0000 0000 0000 000two )  2(126127) 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 1 0 1 1 1 1 1 1 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 8 0 7 0 6 0 5 0 4 0 3 0 2 1 0 2004 Morgan Kaufmann Publishers 0 0 0 56 • (2) The general representation for a double precision number is (1) s  (1 + Fraction)  2( Exeponent1023) Then the result is  12   (1)1  (1 + .1000 0000  0000two )  2(10221023) 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 1 0 1 1 1 1 1 1 1 1 1 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 bit 0 11 bits 0 0 0 0 0 0 20 bits 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 32 bits 2004 Morgan Kaufmann Publishers 57 Converting Binary to Decimal Floating Point • What decimal number is represented by this single precision float? 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 1 1 0 0 0 0 0 0 1 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 The sign bit is 1, the exponent field contains 129, and the fraction field contains 1 2 2  1 / 4 , or 0.25. Using the basic equation, (1) s  (1 + Fraction)  2( ExponentBias)  (1)1  (1 + 0.25)  2(129127)  11.25  2 2  1.25  4  5.0 2004 Morgan Kaufmann Publishers 58 Floating-Point Addition Step1: Align the decimal point of the number that has the smaller exponent. 9.999 101 + 1.610  10 1 ten ten The number on the right is the version we desire, since its exponent matches the exponent of the larger number, 9.999ten 101 . Step2: Next comes the addition of the significands: 1.610ten 10 1  0.1610ten 100  0.01610ten 101 9.999ten + 0.016ten 10.015ten The sum is 10.015ten 101 Step3: Normalize the sum into a scientific notation. 10.015ten 101  1.0015ten 10 2 Step4: Round the number (we assume that the significand can be only four digits long.) 1.0015ten 102  1.002ten 102 2004 Morgan Kaufmann Publishers 59 Decimal Floating-Point Addition – [ 0.5ten + ( - 0.4375ten) ] • First: Convert two decimal numbers into binary numbers that are in normalized scientific notation. 0.5ten  1 / 2ten  1 / 21ten  0.1two  0.1two  20  1.000two  2 1  0.4375ten  7 / 16ten  7 / 2 4 ten  0.0111two  0.0111two  20  1.110two  2  2 Then we follow the forward algorithm: • Step1:  1.110two  2 2  0.111two  2 1 • Step2: 1.000two  2 1 + (0.111two  2 1 )  0.001two  2 1 • Step3: 0.001two  2 1  0.010two  2 2  0.100two  2 3  1.000two  2 4 Since 127  4  126 , there is no overflow or underflow. 1.000two  24  0.0001000two  0.0001two  1 / 24ten  1 / 16ten  0.0625ten 2004 Morgan Kaufmann Publishers 60 Floating point addition Sign Exponent Fraction Sign Exponent Fraction • Small ALU Exponent difference 0 1 0 1 0 1 Shift right Control Big ALU 0 0 1 Increment or decrement 1 Shift left or right Rounding hardware Sign Exponent Fraction 2004 Morgan Kaufmann Publishers 61 Start 1. Compare the exponents of the two numbers. Shift the smaller number to the right until its exponent would match the larger exponent 2. Add the significands 3. Normalize the sum, either shifting right and incrementing the exponent or shifting left and decrementing the exponent Overflow or underflow? Yes No Exception 4. Round the significand to the appropriate number of bits No Still normalized? Yes 2004 Morgan Kaufmann Publishers Done 62 Floating-Point Multiplication -- 1.110ten 1010  9.200ten 105 • Step1: Calculate the new exponent with the biased: 10 + ( - 5 ) = 5 • Step2: Next comes the multiplication of the significands: 1.110ten  9.200ten 0000 0000 2220 9990 10212000ten The result is 10.212ten 10 5 • Step3: Normalize the result: 10.212ten 10  1.0212ten 10 • Step4: We assume that the significand is only four digits long, so we must round the number: 1.0212ten 106  1.021ten 106 • Step5: The sign of the product: 5 + 1.021ten 106 6 2004 Morgan Kaufmann Publishers 63 Decimal Floating-Point Multiplication -- 0.5ten  (0.4375ten ) 1.000two  2 1 by  1.110two  2 2 • In binary, the task is multiplying • Step1: - 1 + ( - 2 ) = - 3 • Step2: Multiplying the significands: 1.000 two  1.110 two 0000 1000 1000 1000 1110000 two The result is 1.110two  2 3 • Step3: The product is normalized and there is no overflow or underflow, since: 127  3  126 • Step4: Rounding the product makes no change • Step5: Since the signs of the original operands differ, make the sign of the product negative:  1.110two  2 3 2004 Morgan Kaufmann Publishers 64 FIGURE 3.18 Floating-point multiplication. 2004 Morgan Kaufmann Publishers 65 FIGURE 3.19 MIPS floating-point architecture revealed thus far. MIPS floating-point operands Name Example Comments 32 floatingpoint registers $f0, $f1, $f2, …, $f31 MIPS floating-point registers are used in pairs for double precision numbers. 230memory Memory[0], Memory[4], …, Memory[4294967292] Accessed only by data transfer instructions in MIPS. MIPS uses byte addresses, so sequential word addresses differ by 4. Memory holds data structures, arrays, and spilled registers, such as those saved on procedure calls. words 2004 Morgan Kaufmann Publishers 66 MIPS assembly language Category Arithmetic Data transfer Conditional branch Instruction Example Meaning Comments FP add single add.s $f2, $f4, $f6 $f2 = $f4 + $f6 FP add (single precision) FP subtract single sub.s $f2, $f4, $f6 $f2 = $f4 - $f6 FP sub (single precision) FP multiply single mul.s $f2, $f4, $f6 $f2 = $f4 x $f6 FP multiply (single precision) FP divide single div.s $f2, $f4, $f6 $f2 = $f4 / $f6 FP divide (single precision) FP add double add.d $f2, $f4, $f6 $f2 = $f4 + $f6 FP add (double precision) FP subtract double sub.d $f2, $f4, $f6 $f2 = $f4 - $f6 FP sub (double precision) FP multiply double mul.d $f2, $f4, $f6 $f2 = $f4 x $f6 FP multiply (double precision) FP divide double div.d $f2, $f4, $f6 $f2 = $f4 / $f6 FP divide (double precision) load word corp. 1 lwc1 $f1, 100($s2) $f1 = Memory [$s2 + 100] 32-bit data to FP register store word corp. 1 swc1 $f1, 100 ($s2) Memory [$s2 + 100] = $f1 32-bit data to memory branch on FP true bclt 25 if (cond == 1) go to PC+4+100 PC-relative branch if FP cond. branch on FP false bclf 25 if (cond == 0) go to PC+4+100 PC-relative branch if not cond. FP compare single (eq, ne, lt, le, gt, ge) c. lt. s $f2, $f4 if ($f2 < $f4) cond =1; else cond = 0 FP compare less than single precision FP compare double (eq, ne, lt, le, gt, ge) c. lt. d $f2, $f4 if ($f2 < $f4) cond =1; else cond = 0 FP compare less than double precision 2004 Morgan Kaufmann Publishers 67 MIPS floating-point machine language Name Format Example Comments add.s R 17 16 6 4 2 0 add.s $f2, $f4, $f6 sub.s R 17 16 6 4 2 1 sub.s $f2, $f4, $f6 mul.s R 17 16 6 4 2 2 mul.s $f2, $f4, $f6 div.s R 17 16 6 4 2 3 div.s add.d R 17 17 6 4 2 0 add.d $f2, $f4, $f6 sub.d R 17 17 6 4 2 1 sub.d $f2, $f4, $f6 mul.d R 17 17 6 4 2 2 mul.d $f2, $f4, $f6 div.d R 17 17 6 4 2 3 div.d $f2, $f4, $f6 lwc1 I 49 20 2 100 lwc1 $f2, $f4, $f6 swc1 I 57 20 2 100 sec1 $f2, $f4, $f6 bc1t I 17 8 1 25 bc1t 25 bc1f I 17 8 0 25 bc1f 25 c. lt. s R 17 16 4 2 0 60 c. lt. s $f2, $f4 c. lt. d R 17 17 4 2 0 60 c. lf. d $f2, $f4 6 bits 5 bits 5 bits 5 bits 5 bits 6 bits Field size $f2, $f4, $f6 ALL MIPS instructions 32 bits 2004 Morgan Kaufmann Publishers 68 FIGURE 3.20 MIPS floating-point instruction encoding. 2004 Morgan Kaufmann Publishers 69 Floating Point Complexities • Operations are somewhat more complicated (see text) • In addition to overflow we can have “underflow” • Accuracy can be a big problem – IEEE 754 keeps two extra bits, guard and round – four rounding modes – positive divided by zero yields “infinity” – zero divide by zero yields “not a number” – other complexities • • Implementing the standard can be tricky Not using the standard can be even worse – see text for description of 80x86 and Pentium bug! 2004 Morgan Kaufmann Publishers 70 3.7 Real Stuff: Floating Point in the IA-32 2004 Morgan Kaufmann Publishers 71 FIGURE 3.21 The floating-point instructions of the IA-32. 2004 Morgan Kaufmann Publishers 72 FIGURE 3.22 The variations of operands for floating-point add in the IA-32. 2004 Morgan Kaufmann Publishers 73 3.8 Fallacies and Pitfalls 2004 Morgan Kaufmann Publishers 74 • Fallacy: Floating-Point addition is associative; that is, x + ( y + z) = ( x + y ) + z • Pitfall: The MIPS instruction add immediate unsigned addiu sign-extends its 16-bit immediate field. • Fallacy: Only theoretical mathematicians care about floating-point accuracy. 2004 Morgan Kaufmann Publishers 75 FIGURE 3.23 A sampling of newspaper and magazine articles from November 1994, including the New York Times, San Jose Mercury News, San Francisco Chronicle, and Infoworld. 2004 Morgan Kaufmann Publishers 76 3.9 Concluding Remarks 2004 Morgan Kaufmann Publishers 77 FIGURE 3.24 The MIPS instruction set covered so far. 2004 Morgan Kaufmann Publishers 78 FIGURE 3.25 Remaining MIPS-32 and “Pseudo MIPS” instruction sets. 2004 Morgan Kaufmann Publishers 79 FIGURE 3.26 The frequency of the MIPS instructions for SPEC2000 integer and floating point. 2004 Morgan Kaufmann Publishers 80 Chapter Three Summary • Computer arithmetic is constrained by limited precision • Bit patterns have no inherent meaning but standards do exist – two’s complement – IEEE 754 floating point • Computer instructions determine “meaning” of the bit patterns • Performance and accuracy are important so there are many complexities in real machines • Algorithm choice is important and may lead to hardware optimizations for both space and time (e.g., multiplication) • You may want to look back (Section 3.10 is great reading!) 2004 Morgan Kaufmann Publishers 81

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Download Lecture 7