Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Machine Instructions: • Language of the Machine • Lowest level of programming, control directly the hardware • Assembly instructions are symbolic versions of machine instructions • More primitive than higher level languages • Very restrictive • Programs are stored in the memory, one instruction is fetched and executed at a time • We’ll be working with the MIPS instruction set architecture 1998 Morgan Kaufmann Publishers 1 MIPS instruction set: • Load from memory Store in memory • Logic operations – and, or, negation, shift, ... • Arithmetic operations – addition, subtraction, ... • Branch 1998 Morgan Kaufmann Publishers 2 Instruction types: • 1 operand Jump #address Jump $register number • 2 operands Multiply $reg1, $reg2 • 3 operands Add $reg1, $reg2, $reg3 1998 Morgan Kaufmann Publishers 3 MIPS arithmetic • Instructions have 3 operands • Operand order is fixed (destination first) Example: C code: A=B+C MIPS code: add $s0, $s1, $s2 $s0, etc. are registers (associated with variables by compiler) 1998 Morgan Kaufmann Publishers 4 MIPS arithmetic • Design Principle 1: simplicity favours regularity. • Of course this complicates some things... C code: A = B + C + D; E = F - A; MIPS code: add $t0, $s1, $s2 add $s0, $t0, $s3 sub $s4, $s5, $s0 • Operands must be registers, 32 registers provided • Design Principle 2: smaller is faster. 1998 Morgan Kaufmann Publishers 5 Registers vs. Memory • Arithmetic instructions operands are registers. • Compiler associates variables with registers. • What about programs with lots of variables? Memory! Control Input Memory Datapath Processor Output I/O 1998 Morgan Kaufmann Publishers 6 Memory Organization • Viewed as a large, single-dimension array, with an address. • A memory address is an index into the array. • "Byte addressing" means that the index points to a byte of memory. 0 1 2 3 4 5 6 ... 8 bits of data 8 bits of data 8 bits of data 8 bits of data 8 bits of data 8 bits of data 8 bits of data 1998 Morgan Kaufmann Publishers 7 Memory Organization • Bytes are nice, but most data items use larger "words”. • For MIPS, a word is 32 bits or 4 bytes. 0 4 8 12 ... 32 bits of data 32 bits of data 32 bits of data Registers hold 32 bits of data. 32 bits of data • 232 bytes with byte addresses from 0 to 232-1 • 230 words with byte addresses 0, 4, 8, ... 232-4 • Words are aligned, i.e., the 2 least significant bits of a word address are equal to 0. – Not in all architectures! 1998 Morgan Kaufmann Publishers 8 Load and store instructions • Example: C code: A[8] = h + A[8]; MIPS code: lw $t0, 32($s3) add $t0, $s2, $t0 sw $t0, 32($s3) • $s3 contains the base of the array. $s2 contains h. • Word offset 8 equals byte offset 32. • Store word has destination last. • Remember arithmetic operands are registers, not memory! 1998 Morgan Kaufmann Publishers 9 So far we’ve learned: • MIPS — loading and storing words but addressing bytes — arithmetic on registers only • Instruction add $s1, $s2, $s3 sub $s1, $s2, $s3 lw $s1, 100($s2) sw $s1, 100($s2) Meaning $s1 = $s2 + $s3 $s1 = $s2 – $s3 $s1 = Memory[$s2+100] Memory[$s2+100] = $s1 1998 Morgan Kaufmann Publishers 10 Machine Language • Instructions, like registers and words of data, are also 32 bits long. • Example: add $t0, $s1, $s2 R-type instruction Format: 000000 10001 op op rs rt rd shamt funct rs 10010 rt 01000 rd 00000 100000 shamt funct opcode, basic operation 1st source reg. 2nd source reg. destination reg shift amount (in shift instructions) function, selects the specific variant of the operation 1998 Morgan Kaufmann Publishers 11 Machine Language • Introduce a new type of instruction format – I-type for data transfer instructions Example: lw $t0, 32($s2) 35 18 9 op rs rt 32 16 bit number rt = destination register address range = 215 B = 213 words new instruction format but fields 1…3 are the same • Design principle 3: Good design demands good compromises 1998 Morgan Kaufmann Publishers 12 Stored Program Concept • Instructions are groups of bits • Programs are stored in memory — to be read or written just like data Processor Memory memory for data, programs, compilers, editors, etc. • Fetch & Execute Cycle – Instructions are fetched and put into a special register – Bits in the register "control" the subsequent actions – Fetch the “next” instruction and continue 1998 Morgan Kaufmann Publishers 13 Control • Decision making instructions – alter the control flow, – i.e., change the "next" instruction to be executed • MIPS conditional branch instructions: bne $t0, $t1, Label beq $t0, $t1, Label # branch if not equal # branch if equal • Example (if): if (i==j) h = i + j; bne $s0, $s1, Label add $s3, $s0, $s1 Label: .... 1998 Morgan Kaufmann Publishers 14 Control • MIPS unconditional branch instructions: j label • Example (if - then - else): if (i!=j) h=i+j; else h=i-j; beq $s4, $s5, Label1 add $s3, $s4, $s5 j Label2 Label1: sub $s3, $s4, $s5 Label2: ... 1998 Morgan Kaufmann Publishers 15 Control • Example (loop): Loop: ---i=i+j; if(i!=h) go to Loop --- • Loop: --add $s1, $s1, $s2 #i=i+j bne $s1, $s3, Loop --- 1998 Morgan Kaufmann Publishers 16 So far: • • • Instruction Meaning add $s1,$s2,$s3 sub $s1,$s2,$s3 lw $s1,100($s2) sw $s1,100($s2) bne $s4,$s5,L beq $s4,$s5,L j Label $s1 = $s2 + $s3 $s1 = $s2 – $s3 $s1 = Memory[$s2+100] Memory[$s2+100] = $s1 Next instr. is at Label if $s4 $s5 Next instr. is at Label if $s4 = $s5 Next instr. is at Label Formats: R op rs rt rd I op rs rt 16 bit address J op shamt funct 26 bit address the 16 b and 26 b addresses are word addresses 1998 Morgan Kaufmann Publishers 17 Control Flow • We have: beq, bne, what about Branch-if-less-than? • New instruction: set on less than if $s1 < $s2 then $t0 = 1 slt $t0, $s1, $s2 else $t0 = 0 • slt and bne can be used to implement branch on less than slt $t0, $s0, $s1 bne $t0, $zero, Less • Note that the assembler needs a register to do this, there are register conventions for the MIPS assembly language • we can now build general control structures 1998 Morgan Kaufmann Publishers 18 MIPS Register Convention Name Register number $zero 0 $v0-$v1 2-3 $a0-$a3 4-7 $t0-$t7 8-15 $s0-$s7 16-23 $t8-$t9 24-25 $gp 28 $sp 29 $fp 30 $ra 31 • • • • Usage the constant value 0 values for results and expression evaluation arguments temporaries saved more temporaries global pointer stack pointer frame pointer return address $at, 1 reserved for assembler $k0, $k1, 26-27 reserved for operating system $t0…$t7, $t8, $t9 subroutine does not save $s0…$s7 subroutine saves if uses 1998 Morgan Kaufmann Publishers 19 Procedure calls • Procedures and subroutines allow reuse and structuring of code • Steps – Place parameters in a place where the procedure can access them – Transfer control to the procedure – Acquire the storage needed for the procedure – Perform the desired task – Place the results in a place where the calling program can access them – Return control to the point of origin 1998 Morgan Kaufmann Publishers 20 Register assignments for procedure calls • $a0...$a3 • $v0...$v1 • $ra four argument registers for passing parameters two return value registers return address register • use of argument and return value register: compiler • handling of control passing mechanism: machine • jump and link instruction: jal ProcAddress – saves return address (PC+4) in $ra (Program Counter holds the address of the current instruction) – loads ProcAddress in PC • return jump: jr $ra – loads return address in PC 1998 Morgan Kaufmann Publishers 21 Stack • Used if four argument registers and two return value registers are not enough or if nested subroutines (a subroutine calls another one) are used • Can also contain temporary data • The stack is a last-in-first-out structure in the memory • Stack pointer ($sp) points at the top of the stack • Push and pop instructions • MIPS stack grows from higher addresses to lower addresses 1998 Morgan Kaufmann Publishers 22 Stack and Stack Pointer elements in the stack bottom elements in the stack SP top in out stack grows 1998 Morgan Kaufmann Publishers 23 Constants • Small constants are used quite frequently e.g., A = A + 5; B = B - 1; • Solution 1: put constants in memory and load them To add a constant to a register: lw $t0, AddrConstant($zero) add $sp,$sp,$t0 • Solution 2: to avoid extra instructions keep the constant inside the instruction itself addi $29, $29, 4 # i means immediate slti $8, $18, 10 andi $29, $29, 6 • Design principle 4: Make the common case fast. 1998 Morgan Kaufmann Publishers 24 How about larger constants? • We'd like to be able to load a 32 bit constant into a register • Must use two instructions, new "load upper immediate" instruction lui $t0, 1010101010101010 1010101010101010 filled with zeros 0000000000000000 • Then must get the lower order bits right, i.e., ori $t0, $t0, 1010101010101010 ori 1010101010101010 0000000000000000 0000000000000000 1010101010101010 1010101010101010 1010101010101010 1998 Morgan Kaufmann Publishers 25 Overview of MIPS • simple instructions all 32 bits wide • very structured, no unnecessary baggage • only three instruction formats R op rs rt rd I op rs rt 16 bit address J op shamt funct 26 bit address • rely on compiler to achieve performance — what are the compiler's goals? • help compiler where we can 1998 Morgan Kaufmann Publishers 26 Addresses in Branches and Jumps • Instructions: bne $t4,$t5,Label Next instruction is at Label if $t4 $t5 beq $t4,$t5,Label Next instruction is at Label if $t4 = $t5 j Label Next instruction is at Label • Formats: I op J op rs rt 16 bit address 26 bit address • Addresses are not 32 bits 1998 Morgan Kaufmann Publishers 27 Addresses in Branches • Instructions: bne $t4,$t5,Label beq $t4,$t5,Label Next instruction is at Label if $t4$t5 Next instruction is at Label if $t4=$t5 • Format: I op rs rt 16 bit address • We need 32 bit addresses; use PC-relative addressing – add the 16-bit address (2’s complement number) to the PC; – most branches are local, so 16-bit offset or 215 word ( 128 kB) address range is usually enough 1998 Morgan Kaufmann Publishers 28 Addresses in Jumps • Instruction: j Label Next instruction is at Label • Format: J op 26 bit address • To get a 32 bit address the upper bits of the PC are concatenated with the 26-bit address • 226 word (256 MB) address range • if range is not enough, use the jr instruction (not discussed in detail) jr Register 1998 Morgan Kaufmann Publishers 29 MIPS addressing mode summary • • • • • Register addressing – operand in a register Base or displacement addressing – operand in the memory – address is the sum of a register and a constant in the instruction Immediate addressing – operand is a constant within the instruction PC-relative addressing – address is the sum of the PC and a constant in the instruction – used e.g. in branch instructions Pseudodirect addressing – jump address is the 26 bits of the instruction concatenated with the upper bits of the PC 1998 Morgan Kaufmann Publishers 30 MIPS addressing mode summary 1. Immediate addressing op rs rt Immediate 2. Register addressing op rs rt rd ... funct Registers Register 3. Base addressing op rs rt Memory Address + Register Byte Halfword Word 4. PC-relative addressing op rs rt Memory Address PC + Word 5. Pseudodirect addressing op Address PC Memory Word 1998 Morgan Kaufmann Publishers 31 Additional addressing modes • • • • Direct addressing – operand in the memory – address in the instruction Register indirect addressing – operand in the memory – address in a register Implied addressing – operand location specified by the operation code Used in other computers 1998 Morgan Kaufmann Publishers 32 To summarize: MIPS operands Name 32 registers Example Comments $s0-$s7, $t0-$t9, $zero, Fast locations for data. In MIPS, data must be in registers to perform $a0-$a3, $v0-$v1, $gp, arithmetic. MIPS register $zero always equals 0. Register $at is $fp, $sp, $ra, $at reserved for the assembler to handle large constants. Memory[0], 2 30 Accessed only by data transfer instructions. MIPS uses byte addresses, so memory Memory[4], ..., words and spilled registers, such as those saved on procedure calls. add MIPS assembly language Example Meaning add $s1, $s2, $s3 $s1 = $s2 + $s3 Three operands; data in registers subtract sub $s1, $s2, $s3 $s1 = $s2 - $s3 Three operands; data in registers $s1 = $s2 + 100 $s1 = Memory[$s2 + 100] Memory[$s2 + 100] = $s1 $s1 = Memory[$s2 + 100] Memory[$s2 + 100] = $s1 Used to add constants Category Arithmetic sequential words differ by 4. Memory holds data structures, such as arrays, Memory[4294967292] Instruction addi $s1, $s2, 100 lw $s1, 100($s2) sw $s1, 100($s2) store word lb $s1, 100($s2) load byte sb $s1, 100($s2) store byte load upper immediate lui $s1, 100 add immediate load word Data transfer Conditional branch Unconditional jump $s1 = 100 * 2 16 Comments Word from memory to register Word from register to memory Byte from memory to register Byte from register to memory Loads constant in upper 16 bits branch on equal beq $s1, $s2, 25 if ($s1 == $s2) go to PC + 4 + 100 Equal test; PC-relative branch branch on not equal bne $s1, $s2, 25 if ($s1 != $s2) go to PC + 4 + 100 Not equal test; PC-relative set on less than slt $s1, $s2, $s3 if ($s2 < $s3) $s1 = 1; else $s1 = 0 Compare less than; for beq, bne set less than immediate slti jump j jr jal jump register jump and link $s1, $s2, 100 if ($s2 < 100) $s1 = 1; Compare less than constant else $s1 = 0 2500 $ra 2500 Jump to target address go to 10000 For switch, procedure return go to $ra $ra = PC + 4; go to 10000 For procedure call 1998 Morgan Kaufmann Publishers 33 Assembly Language vs. Machine Language • Assembly provides convenient symbolic representation – much easier than writing down numbers – e.g., destination first • Machine language is the underlying reality – e.g., destination is no longer first • Assembly can provide 'pseudoinstructions' – e.g., “move $t0, $t1” exists only in Assembly – would be implemented using “add $t0,$t1,$zero” • When considering performance you should count real instructions 1998 Morgan Kaufmann Publishers 34 Alternative Architectures • Design alternative: – provide more powerful operations than found in MIPS – goal is to reduce number of instructions executed – danger is a slower cycle time and/or a higher CPI • Sometimes referred to as “RISC vs. CISC” – Reduced Instruction Set Computers – Complex Instruction Set Computers – virtually all new instruction sets since 1982 have been RISC 1998 Morgan Kaufmann Publishers 35 Reduced Instruction Set Computers • Common characteristics of all RISCs – Single cycle issue – Small number of fixed length instruction formats – Load/store architecture – Large number of registers • Additional characteristics of most RISCs – Small number of instructions – Small number of addressing modes – Fast control unit 1998 Morgan Kaufmann Publishers 36 An alternative architecture: 80x86 • 1978: The Intel 8086 is announced (16 bit architecture) • 1980: The 8087 floating point coprocessor is added • 1982: The 80286 increases address space to 24 bits, +instructions • 1985: The 80386 extends to 32 bits, new addressing modes • 1989-1995: The 80486, Pentium, Pentium Pro add a few instructions (mostly designed for higher performance) • 1997: MMX is added 1998 Morgan Kaufmann Publishers 37 An alternative architecture: 80x86 • Intel had a 16-bit microprocessor two years before its competitors’ more elegant architectures which led to the selection of the 8086 as the CPU for the IBM PC • “This history illustrates the impact of the “golden handcuffs” of compatibility” “an architecture that is difficult to explain and impossible to love” 1998 Morgan Kaufmann Publishers 38 A dominant architecture: 80x86 • See your textbook for a more detailed description • Complexity: – Instructions from 1 to 17 bytes long – one operand must act as both a source and destination – one operand can come from memory – complex addressing modes e.g., “base or scaled index with 8 or 32 bit displacement” • Saving grace: – the most frequently used architectural components are not too difficult to implement – compilers avoid the portions of the architecture that are slow 1998 Morgan Kaufmann Publishers 39 Summary • Instruction complexity is only one variable – lower instruction count vs. higher CPI / lower clock rate • Design Principles: – simplicity favours regularity – smaller is faster – good design demands good compromises – make the common case fast • Instruction set architecture – a very important abstraction indeed! 1998 Morgan Kaufmann Publishers 40