Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
COM 249 – Computer Organization and Assembly Language Chapter 2 Instructions: Language of the Computer Part I - MIPS Based on slides from www-inst.eecs.berkeley.edu/~cs152/ CS 152 L01 Introd uction & MIPS Review ( 2) Modified by S. J. Fritz 1/2005 UC Regents Fall 2004 © UCB Introduction • Words of a computer’s language are called its instructions • Its vocabulary is its instruction set. • Goal: – Find a language that makes it easy to build the hardware and the compiler, – while maximizing performance and minimizing cost Review: Instruction Set Design software instruction set hardware Which is easier to change? Stored Program Computer • Basic Principles – Use of instructions that are indistinguishable from numbers – Use of alterable memory for programs • Demands balance among number of instructions, the number of clock cycles needed by an instruction and the speed of the clock. Instruction Set Architecture: What Must be Specified? Instruction Fetch Instruction Decode Operand Fetch Execute Result Store Next Instruction ° Instruction Format or Encoding – how is it decoded? ° Location of operands and result – where other than memory? – how many explicit operands? – how are memory operands located? – which can or cannot be in memory? ° Data type and Size ° Operations – what are supported ° Successor instruction – jumps, conditions, branches - fetch-decode-execute is implicit! Overview of Design Principles 1. Simplicity favors regularity – keep all instructions a single size – require three register operands for arithmetic – keep register fields in same place in each instruction 2. Smaller is faster – the reason that MIPS has 32 registers rather than many more 3.Make the common case fast – PC-relative addressing for conditional branch – immediate addressing for constant operands 4.Good design demands good compromises – compromise between larger addresses and keeping instructions same length MIPS I Instruction set MIPS • Typical of instruction sets since 1980’s • Almost 100 million MIPS processors manufactured in 2002 • Found in products from Cisco, NEC, Nintendo, Silicon Graphics, Sony, Texas Instruments, Toshiba, and others Historically General Purpose Registers Dominate ° 1975-1995 all machines use general purpose registers ° Advantages of registers • registers are faster than memory • registers are easier for a compiler to use - e.g., (A*B) – (C*D) – (E*F) can do multiplies in any order vs. stack • registers can hold variables - memory traffic is reduced, so program is sped up (since registers are faster than memory) - code density improves (since register named with fewer bits than memory location) MIPS Architecture • MIPS – semiconductor company that built one of the first commercial RISC architectures • We will study the MIPS architecture in some detail in this class Why MIPS instead of Intel 80x86? – MIPS is simple, elegant. Don’t want to get bogged down in gritty details. – MIPS widely used in embedded apps, x86 little used in embedded, and more embedded computers than PCs Instruction Set Architecture • Early trend was to add more and more instructions to new CPUs to do elaborate operations – VAX architecture had an instruction to multiply polynomials! • RISC philosophy (Cocke IBM, Patterson, Hennessy,1980s)–Reduced Instruction Set Computing – Keep the instruction set small and simple, makes it easier to build fast hardware. – Let software do complicated operations by composing simpler ones. Operations of the Computer Hardware • The MIPS assembly language instruction add a, b, c means a = b+c • This sequence adds four variables (a=b+c+d+e) add a, b, c add a, a, d add a, a, e #the sum of b and c is placed in a #the sum of b,c, and d is now in a #the sum of b,c,d and e is now in a • Notice that it takes 3 instructions to add four variables MIPS Instructions • Design Principle 1: Simplicity favors regularity • The MIPS assembly language instruction • • • • add a, b, c means a = b+c Each line represents one instruction Each instruction has exactly 3 operands for simplicity There is one operation per MIPS instruction Instructions are related to operations (=, +, -, *, /) in C or Java Operands of the Computer Hardware • Operands of arithmetic instructions must be from a limited number of special memory locations called registers • Size of a MIPS register is 32 bits - called a word • Major difference between variables in programming language (unlimited) and registers is the limited number of registerstypically 32 in MIPS C, Java Variables vs. Registers • In C (and most High Level Languages) variables are declared first and given a type – Example: int fahr, celsius; char a, b, c, d, e; • Each variable can ONLY represent a value of the type it was declared as (cannot mix and match int and char variables). • In Assembly Language, the registers have no type; operation determines how register contents are treated MIPS I Registers • Programmable storage – 232 x bytes of memory – 31 x 32-bit General Purpose Registers GPRs (R0 = 0) r0 r1 ° ° ° r31 PC lo hi 0 Operands of the Computer Hardware • Design Principle 2: Smaller is faster – Very large number of registers may increase clock cycle time because it takes electronic signals longer to travel farther. – Using more than 32 registers would require a different instruction format. – MIPS register convention is to use two character names following a dollar sign: $s0, $s1… for variables $t0, $t1… for temporary locations $a0, $a1…for arguments MIPS Addition and Subtraction • Syntax of Instructions: 1 2,3,4 where: 1) operation by name 2) operand getting result (“destination”) 3) 1st operand for operation (“source1”) 4) 2nd operand for operation (“source2”) • Syntax is rigid: – 1 operator, 3 operands – Why? Keep Hardware simple via regularity MIPS Addition and Subtraction of Integers • Addition in Assembly – Example: add $s0,$s1,$s2 (in MIPS) Equivalent to: a = b + c (in C/Java) where MIPS registers $s0,$s1,$s2 are associated with C variables a, b, c • Subtraction in Assembly – Example: sub $s3,$s4,$s5 (in MIPS) Equivalent to: d = e - f (in C) where MIPS registers $s3,$s4,$s5 are associated with C variables d, e, f Addition and Subtraction • How would MIPS do this C/Java statement? a = b + c + d - e; • Break into multiple instructions add $t0, $s1, $s2 # temp = b + c add $t0, $t0, $s3 # temp = temp + d sub $s0, $t0, $s4 # a = temp - e • Notice: A single line of C or Java may break up into several lines of MIPS. Also everything after the hash mark- # - on each line is ignored (comments) Addition and Subtraction • How do we do this? f = (g + h) - (i + j); • Use intermediate temporary register add $t0,$s1,$s2 add $t1,$s3,$s4 sub $s0,$t0,$t1 # temp0 = g + h # temp1 = i + j # f=(g+h)-(i+j) Constants or Immediate Operands • Design Principle 3: Make the common case FAST • Constants occur frequently and by including constants in arithmetic instructions, they are faster than if the constants were loaded from memory • To add 4 to register 3 use add immediate (addi): addi $s3,$s3,4 # $s3 = $s3+4 Immediates • Immediates are numerical constants. • They appear often in code, so there are special instructions for them. • Add Immediate: addi $s0,$s1,10 (in MIPS) f = g + 10 (in C) where MIPS registers $s0,$s1 are associated with C or Java variables f, g • Syntax similar to add instruction, except that the last argument is a number instead of a register. Immediates and Subtraction • There is no Subtract Immediate in MIPS: Why? • Limit types of operations that can be done to absolute minimum – negative constants are less frequent – if an operation can be decomposed into a simpler operation, don’t include it – addi …, -X = subi …, X => no need for subi • addi $s0,$s1,-10 (in MIPS) f = g - 10 (in C) where MIPS registers $s0,$s1 are associated with C or Java variables f, g Register Zero • One particular immediate, the number zero (0), appears very often in code. • So we define register zero ($0 or $zero) to always have the value 0; for example: add $s0,$s1,$zero (in MIPS) f = g (in C) where MIPS registers $s0,$s1 are associated with C variables f, g • Defined in hardware, so an instruction add $zero,$zero,$s0 will not do anything! Summarizing... • In MIPS Assembly Language: – Registers replace C variables – One Instruction (simple operation) per line – Simpler is Better – Smaller is Faster • New Instructions: add, addi, sub • New Registers: C or Java Variables: $s0 - $s7 Temporary Variables: $t0 - $t9 Zero: $zero Memory Operands • Programming Languages have both simple variables and complex data structures. • How can we handle large data structures with just a few registers? – Data structures are kept in memory. • MIPS includes instructions to transfer data between memory and registers. – Data transfer instructions ( load, store) Memory Operands • Data transfer Instruction – load copies data from memory to register – lw - load word • Format opcode register , constant (register) memory address • Syntax lw $t0, 8 ($s3) offset base address Memory Addressing ° Since 1980 almost every machine uses addresses to level of 8-bits (byte) ° 2 questions for design of Instruction Set Architecture: • Since we could read a 32-bit word as four loads of bytes from sequential byte addresses or as one load word from a single byte address, • How do byte addresses map onto words? • Can a word be placed on any byte boundary? Addressing Objects: Alignment • Since 8-bit bites are useful, most architectures address individual bites. • Address of a word matches the address of one of the four bites in the word • Addresses of sequential words differ by 4 bytes • MIPS words must start at addresses that are multiples of 4 - called alignment restriction Addressing Objects: Endianess • Computers are grouped into those that use: – the address of the leftmost or “big end byte” as the word address – and those that use the “little end” or rightmost byte • MIPS is in the BIG Endian group Addressing Objects: Endianess and Alignment • Big Endian: address of most significant IBM 360/370, Motorola 68k, MIPS, Sparc, HP PA • Little Endian: address of least significant Intel 80x86, DEC Vax, DEC Alpha (Windows NT) little endian byte 0 3 2 1 0 msb lsb 0 0 big endian byte 0 1 2 3 Alignment: require that objects fall on address that is multiple of their size. Aligned Not Aligned 1 2 3 Addressing arrays Byte addressing also affects the array index • The offset to be added to the base register ($s3) must be 4 x 8, or 32 12 8 4 • lw $t0, 32($s3) 0 $s3 Addresses DATA Load and Store • • • • The complementary instruction to load is store Store copies data from a register into memory Format similar to load MIPS name is sw for store word – sw $t0, 48($s3) # if $s3 contains the base address # of A[0] in memory, then this # stores result into A[12] Memory and Registers • Process of putting less commonly used variables in memory is called spilling the registers. • Arithmetic instructions reads two registers, operates on them and writes the result. • Data transfer instructions only reads or writes one operand, without operating on it. • MIPS register take less time and have greater throughput than memory, thus a compiler must use registers efficiently! Representing Instructions Inside the Computer • Instructions are stored as a series of bits, or binary numbers: 000000 100001 10010 opcode rs rt 01000 rd 000000 100000 shamt funct add 17= $s1 18=$s2 8=$t0 no shift Instruction: add $t0, $s1, $s2 add=32 • There must be a way to represent register names as numbers – In MIPS there is a mapping, for example • $t0… $t7 map to registers 8 …15 • $s0…$s7 map to registers 16…23 MIPS Assembler Register Convention Name Number Usage Preserved across a call? the value 0 n/a return values no arguments no temporaries no saved yes temporaries no stack pointer yes return address yes $zero $v0-$v1 $a0-$a3 $t0-$t7 $s0-$s7 $t18-$t19 $sp $ra 0 2-3 4-7 8-15 16-23 24-25 29 31 • “caller saved” • “callee saved” • On Green Card in Column #2 at bottom Why Multiple Instruction Formats? • Design Principle 4: Good design demands good compromises – there is a need to keep instructions the same length and desire for a single format • There is a problem using previous (R-format) when an instruction needs longer fields – for example lw must specify two registers and a constant, but the constant would have only 5 bits available, so the largest value would be 25 = 32 • Solution: allow I and J formats for different instructions - but keep all the same length = 32 bits Instruction Formats • I-format: used for instructions with immediates, lw and sw (since the offset counts as an immediate), and the branches (beq and bne), – (but not the shift instructions; later) • J-format: used for j and jal(jump and link) • R-format: used for all other instructions • It will soon become clear why the instructions have been partitioned in this way. Instruction Format Names and Field Descriptions Instruction Fields Name 6 bits (31-26) 5 bits (25-21) 5 bits (20-16) (15-11) R-format (6 fields) op rs rt rd I-format (4 fields) op rs rt J-format (2 fields) op Comments 5 bits (10-6) 6 bits (5-0) All MIPS instructions 32 bits sham t funct Arithmetic instruction format Address/immediate Data Transfer, branch, immediate instruction format 5 bits Target address Jump instruction format Instruction field notes: The op and funct fields form the op-code. The rs field gives a source register and rt is also normally a source register. rd is the destination register, and shamt supplies the shift amount for logical shift operations. R-Format Instructions • Define “fields” of the following number of bits each: 6 + 5 + 5 + 5 + 5 + 6 = 32 6 5 5 5 5 6 • For simplicity, each field has a name: opcode rs rt rd shamt funct R-Format Instructions • Meaning of fields: – rs (Source Register): generally used to specify register containing first operand – rt (Target Register): generally used to specify register containing second operand (note that name is misleading) – rd (Destination Register): generally used to specify register which will receive result of computation – shamt (Shift amount) – funct ( Function) - selects specific variant of the opcode operation - sometimes called function code R-Format Example • MIPS Instruction: add $8,$9,$10 Decimal number per field representation: 0 9 10 8 0 32 Binary number per field representation: 000000 01001 01010 01000 00000 100000 hex representation: 012A 4020hex decimal representation: 19,546,144ten hex On Green Card: Format in column 1, opcodes in column 3 J-Format Instructions • Define “fields” of the following number of bits each: 6 bits 26 bits • As usual, each field has a name: opcode target address • Key Concepts – Keep opcode field identical to R-format and I-format for consistency. – Combine all other fields to make room for large target address. Arrays and Data Structures • C and Java variables map onto registers; what about large data structures like arrays? • 1 of the 5 components of a computer - the memorycontains such data structures • But MIPS arithmetic instructions only operate on registers, never directly on memory. • Data transfer instructions transfer data between registers and memory: – Memory to register – Register to memory Overview of MIPS • Simple instructions all 32 bits wide • Very structured, no unnecessary baggage • Only three instruction formats R op rs rt rd I op rs rt 16 bit address J op shamt 26 bit address rely on compiler to achieve performance — what are the compiler's goals? • help compiler where we can funct Anatomy: 5 components of any Computer Registers are in the datapath of the processor; if operands are in memory, we must transfer them to the processor to operate on them, and then transfer back to memory when done. Personal Computer Computer Processor Control (“brain”) Datapath Registers Memory Devices Input Store (to) Load (from) Output These are “data transfer” instructions… Data Transfer: Memory to Registers • To transfer a word of data, we need to specify two things: – Register: specify this by # ($0 - $31) or symbolic name ($s0,…, $t0, …) – Memory address: more difficult • Think of memory as a single onedimensional array, so we can address it simply by supplying a pointer to a memory address. • Other times, we want to be able to offset from this pointer. • Remember: “Load FROM memory” Data Transfer: Memory to Registers • To specify a memory address to copy from, specify two things: – A register containing a pointer to memory – A numerical offset (in bytes) • The desired memory address is the sum of these two values. • Example: 8($t0) – specifies the memory address pointed to by the value in $t0, plus 8 bytes Data Transfer: Memory to Register • Load Instruction Syntax: 1 2, 3(4) – where 1) operation name 2) register that will receive value 3) numerical offset in bytes 4) register containing pointer to memory • MIPS Instruction Name: – lw (meaning Load Word, so 32 bits or one word are loaded at a time) Data Transfer: Memory to Register Data flow Example:lw $t0,12($s0) This instruction will take the pointer in $s0, add 12 bytes to it, and then load the value from the memory pointed to by this calculated sum into register $t0 • Notes: – $s0 is called the base register – 12 is called the offset – offset is generally used in accessing elements of array or structure: base register points to beginning of array or structure Data Transfer: Register to Memory • Also want to store from register into memory – Store instruction syntax is identical to Load’s • MIPS Instruction Name: sw (meaning Store Word, so 32 bits or one word are loaded at a time) Data flow • Example:sw $t0,12($s0) This instruction will take the pointer in $s0, add 12 bytes to it, and then store the value from register $t0 into that memory address • Remember: “ Store INTO memory” Pointers v. Values • Key Concept: • A register can hold any 32-bit value. That value can be a (signed) int, an unsigned int, a pointer (memory address), and so on • If you write add $t2,$t1,$t0 then $t0 and $t1 must contain values • If you write lw $t2,0($t0) then $t0 must contain a pointer to memory • Don’t mix these up! Addressing: Byte vs. Word • Every word in memory has an address, similar to an index in an array • Early computers numbered words like C numbers elements of an array: – Memory[0], Memory[1], Memory[2], … Called the “address” of a word • Computers needed to access 8-bit bytes as well as words (4 bytes/word) • Today machines address memory as bytes, (i.e.,“Byte Addressed”) hence 32-bit (4 byte) word addresses differ by 4 – Memory[0], Memory[4], Memory[8], … Compilation with Memory • What offset in lw to select A[5] in C/Java? • 4x5=20 to select A[5]: byte v. word • Compile by hand using registers: g = h + A[5]; – g: $s1, h: $s2, $s3:base address of A • 1st transfer from memory to register: lw $t0,20($s3) # $t0 gets A[5] – Add 20 to $s3 to select A[5], put into $t0 • Next add it to h and place in g add $s1,$s2,$t0 # $s1 = h+A[5] Translating Assembly Language into Machine Language • Suppose $t1 has base of array A and $s2 corresponds to h in the assignment A[300] = h + A[300] • In MIPS : ( try this ) lw $t0, 1200 ($t1) # temp register $t0 gets A[300] add $t0, $s2, $t0 # temp register $t0 gets h+ A[300] sw $t0, 1200($t1) #stores h = A[300] back into A[300] MIPS Instruction Encoding Instruction Format op rs rt rd shamt funct add R 0 reg reg reg 0 n.a. sub R 0 reg reg reg 0 n.a. addi I 8 reg reg n.a. n.a. constant lw I 35 reg reg n.a. n.a. address sw I 43 reg reg n.a. n.a. address Translating Assembly Language into Machine Language lw $t0, 1200 ($t1) # temp register $t0 gets A[300] add $t0, $s2, $t0 # temp register $t0 gets h+ A[300] sw $t0, 1200($t1) #stores h = A[300] back into A[300] op rs rt 35 9 8 0 18 8 43 9 8 rd add/shamt funct 1200 8 0 32 1200 Notes about Memory • Pitfall: Forgetting that sequential word addresses in machines with byte addressing do not differ by 1. – Many an assembly language programmer has toiled over errors made by assuming that the address of the next word can be found by incrementing the address in a register by 1 instead of by the word size in bytes. – So remember that for both lw and sw, the sum of the base address and the offset must be a multiple of 4 (to be word aligned) C/Java Decisions: if Statements • 2 kinds of if statements in C – if (condition) clause – if (condition) clause1 else clause2 • Rearrange 2nd if into following: if (condition) goto L1; clause2; goto L2; L1: clause1; L2: • Not as elegant as if-else, but same meaning MIPS Decision Instructions • Decision instruction in MIPS: – beq register1, register2, L1 – beq is “Branch if (registers are) equal” Same meaning as (using C/Java): if (register1==register2) goto L1 • Complementary MIPS decision instruction – bne register1, register2, L1 – bne is “Branch if (registers are) not equal” Same meaning as (using C): if (register1!=register2) goto L1 • Called conditional branches MIPS Goto Instruction • In addition to conditional branches, MIPS has an unconditional branch: j label • Called a Jump Instruction: jump (or branch) directly to the label without satisfying any condition • Same meaning as (using C/Java): goto label • Technically, it’s the same as: beq $0,$0,label since it always satisfies the condition. Compiling C/Java if into MIPS • Compile by hand if (i == j) f=g+h; else f=g-h; • Use this mapping: f: $s0 g: $s1 h: $s2 i: $s3 j: $s4 (true) i == j f=g+h (false) i == j? i != j f=g-h Exit Compiling C/Java if into MIPS • Compile by hand if (i == j) f=g+h; else f=g-h; (true) i == j f=g+h •Final compiled MIPS code: beq sub j True: add Fin: $s3,$s4,True $s0,$s1,$s2 Fin $s0,$s1,$s2 (false) i == j? i != j f=g-h Exit # # # # branch i==j f=g-h(false) goto Fin f=g+h (true) Note: Compiler automatically creates labels to handle decisions (branches). Generally not found in HLL code. “And in Conclusion…” • Memory is byte-addressable, but lw and sw access one word at a time. • A pointer (used by lw and sw) is just a memory address, so we can add to it or subtract from it (using offset). • A Decision allows us to decide what to execute at run-time rather than compile-time. • C/Java decisions are made using conditional statements within if, while, do while, for. • MIPS Decision making instructions are the conditional branches: beq and bne. • New Instructions: lw, sw, beq, bne, j Green Card: OPCODES, BASE CONVERSION, MIPS (1) MIPS funct opcode (5:0) (31:26) (2) MIPS funct (5:0) ASCII (3) Binary Deci Hexa- ASCII -mal deci- (1) sll add.f 00 0000 0 mal 0 j srl mul.f 00 0010 2 2 STX lui sync floor.w.f 00 1111 15 f SI NUL lbu and cvt.w.f 10 0100 36 24 $ (1) opcode(31:26) == 0 (2) opcode(31:26) == 17 ten (11 hex ); if fmt(25:21)==16 ten (10 hex ) f = s (single); if fmt(25:21)==17 ten (11 hex ) f = d (double) Note: 3-in-1 - Opcodes, base conversion, ASCII! Green Card • green card /n./ [after the "IBM System/360 Reference Data" card] A summary of an assembly language, even if the color is not green. For example, "I'll go get my green card so I can check the addressing mode for that instruction." Image from Dave's Green Card Collection: www.jargon.net http://www.planetmvs.com/greencard/