Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 1 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Part 7 Instruction.

Slides:

Advertisements

Ähnliche Präsentationen

Cadastre for the 21st Century – The German Way

Advertisements

R. Zankl – Ch. Oelschlegel – M. Schüler – M. Karg – H. Obermayer R. Gottanka – F. Rösch – P. Keidler – A. Spangler th Expert Meeting Business.

Multi electron atoms Atoms with Z>1 contain >1 electron. This changes the atomic structure considerably because in addition to the electron-nucleus interaction,

Informatik 12, TU Dortmund

Fakultät für informatik informatik 12 technische universität dortmund Optimizations Peter Marwedel TU Dortmund Informatik 12 Germany 2009/01/17 Graphics:

Fakultät für informatik informatik 12 technische universität dortmund Mapping of Applications to Platforms Peter Marwedel TU Dortmund, Informatik 12 Germany.

Fakultät für informatik informatik 12 technische universität dortmund Optimizations Peter Marwedel TU Dortmund Informatik 12 Germany 2010/01/13 Graphics:

Fakultät für informatik informatik 12 technische universität dortmund Universität Dortmund Middleware Peter Marwedel TU Dortmund, Informatik 12 Germany.

Peter Marwedel TU Dortmund, Informatik 12

Fakultät für informatik informatik 12 technische universität dortmund Hardware/Software Partitioning Peter Marwedel Informatik 12 TU Dortmund Germany Chapter.

Aufgabenbesprechung Programming Contest. Order 7 Bo Pat Jean Kevin Claude William Marybeth 6 Jim Ben Zoe Joey Frederick Annabelle 0 SET 1 Bo Jean Claude.

NUMEX – Numerical experiments for the GME Fachhochschule Bonn-Rhein-Sieg Wolfgang Joppich PFTOOL - Precipitation forecast toolbox Semi-Lagrangian Mass-Integrating.

Wozu die Autokorrelationsfunktion?

Hier wird Wissen Wirklichkeit Computer Architecture – Part 4 – page 1 of 35 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Part 4 Fundamentals.

Hier wird Wissen Wirklichkeit Computer Architecture – Part 10 – page 1 of 31 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Part 10 Thread and.

Hier wird Wissen Wirklichkeit Computer Architecture – Part 5 – page 1 of 25 – Prof. Dr. Uwe Brinkschulte, M.Sc. Benjamin Betting Part 5 Fundamentals in.

Thomas Herrmann Software - Ergonomie bei interaktiven Medien Step 6: Ein/ Ausgabe Instrumente (Device-based controls) Trackball. Joystick.

CCNA Exploration Network Fundamentals

Methods Fuzzy- Logic enables the modeling of rule based knowledge by the use of fuzzy criteria instead of exact measurement values or threshold values.

Institut für Umweltphysik/Fernerkundung Physik/Elektrotechnik Fachbereich 1 SADDU June 2008 S. Noël, K.Bramstedt,

Institut für Umweltphysik/Fernerkundung Physik/Elektrotechnik Fachbereich 1 Pointing Meeting Nov 2006 S. Noël IFE/IUP Elevation and Azimuth Jumps during.

Adjektive Endungen von Frau Templeton.

Die Geschichte von Rudi

Laurie Clarcq The purpose of language, used in communication, is to create a picture in the mind and/or the heart of another.

Institut AIFB, Universität Karlsruhe (TH) Forschungsuniversität gegründet 1825 Towards Automatic Composition of Processes based on Semantic.

Lehrstuhl Technische Informatik - Computer Engineering Brandenburgische Technische Universität Cottbus 1 Hierarchical Test Technology for Systems on a.

Sanjay Patil Standards Architect – SAP AG April 2008

| DC-IAP/SVC3 | © Bosch Rexroth Pneumatics GmbH This document, as well as the data, specifications and other information set forth in.

BAS5SE | Fachhochschule Hagenberg | Daniel Khan | S SPR5 MVC Plugin Development SPR6P.

3rd Review, Vienna, 16th of April 1999 SIT-MOON ESPRIT Project Nr Siemens AG Österreich Robotiker Technische Universität Wien Politecnico di Milano.

1 Ein kurzer Sprung in die tiefe Vergangenheit der Erde.

Deutsch 1 G Stunde. Donnerstag, der 25. Oktober 2012 Deutsch 1, G Stunde Heute ist ein B- Tag Unit: Family & homeFamilie & Zuhause Objectives: Phrases.

INTAKT- Interkulturelle Berufsfelderkundungen als ausbildungsbezogene Lerneinheiten in berufsqualifizierenden Auslandspraktika DE/10/LLP-LdV/TOI/

Algorithm Engineering Parallele Algorithmen Stefan Edelkamp.

Institut für Umweltphysik/Fernerkundung Physik/Elektrotechnik Fachbereich 1 K. Bramstedt, L. Amekudzi, J. Meyer IFE/IUP Tangent heights in occultation.

Verben Wiederholung Deutsch III Notizen.

Fusszeilentext – bitte in (Ansicht – Master – Folienmaster, 1. Folie oben) individuell ändern! Danach wieder zurück in Normalansicht gehen! 1 OTR Shearography.

Staatsballett Berlin Ein Verbesserungskonzept für den Social- Media Auftritt Your picture here.

Impairments in Polarization-Multiplexed DWDM Channels due to Cross- Polarization Modulation Marcus Winter Christian-Alexander Bunge Klaus Petermann Hochfrequenztechnik-Photonik.

Berner Fachhochschule Hochschule für Agrar-, Forst- und Lebensmittelwissenschaften HAFL Recent activities on ammonia emissions: Emission inventory Rindvieh.

4th Symposium on Lidar Atmospheric Applications

Ein Projekt des Technischen Jugendfreizeit- und Bildungsvereins (tjfbv) e.V. kommunizieren.de Blended Learning for people with disabilities.

Cross-Polarization Modulation in DWDM Systems

External Labels – The rules For all external labels the following rules apply (external labels are all labels which are not inside of a shape) - all labels.

Relativpronomen / Relativsätze:

© Boardworks Ltd of 8 Time Manner Place © Boardworks Ltd of 8 This icon indicates that the slide contains activities created in Flash. These.

By: Jade Bowerman. German numbers are quite a bit like our own. You start with one through ten and then you add 20, 30, 40 or 50 to them. For time you.

Alltagsleben Treffpunkt Deutsch Sixth Edition

1 (C)2006, Hermann Knoll, HTW Chur, FHO Quadratische Reste Definitionen: Quadratischer Rest Quadratwurzel Anwendungen.

Adjectiv Endungen Lite: Adjective following articles and pre-ceeding nouns. Colors and Clothes.

Relativpronomen / Relativsätze:

1 Chapter 7: Selected Algorithms 7.1 External Search 7.2 External Sorting 7.3 Text searching.

AVL-Trees (according to Adelson-Velskii & Landis, 1962) In normal search trees, the complexity of find, insert and delete operations in search.

Sentence Structure Subject and verb are always together. Subject and verb are always together. Subject and verb must agree Subject and verb must agree.

Separable Verbs Turn to page R22 in your German One Book R22 is in the back of the book There are examples at the top of the page.

1 Intern | ST-IN/PRM-EU | | © Robert Bosch GmbH Alle Rechte vorbehalten, auch bzgl. jeder Verfügung, Verwertung, Reproduktion, Bearbeitung,

Fakultät für informatik informatik 12 technische universität dortmund Memory architecture description languages - Session 20 - Peter Marwedel TU Dortmund.

1 Stevens Direct Scaling Methods and the Uniqueness Problem: Empirical Evaluation of an Axiom fundamental to Interval Scale Level.

THE PERFECT TENSE IN GERMAN

Technische Universität München 1 CADUI' June FUNDP Namur G B I The FUSE-System: an Integrated User Interface Design Environment Frank Lonczewski.

TUM in CrossGrid Role and Contribution Fakultät für Informatik der Technischen Universität München Informatik X: Rechnertechnik und Rechnerorganisation.

THE CONVERSATIONAL PAST

Institut für Nachrichtentechnik U. Reimers Technische Universität Braunschweig The MultiMedia Home Platform (MHP): Hype or Reality ?

1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt Modalverben.

Institut für Angewandte Mikroelektronik und Datentechnik Phase 5 Architectural impact on ASIC and FPGA Nils Büscher Selected Topics in VLSI Design (Module.

1 Medienpädagogischer Forschungsverbund Südwest KIM-Studie 2014 Landesanstalt für Kommunikation Baden-Württemberg (LFK) Landeszentrale für Medien und Kommunikation.

מבוא למערכות מחשב ואסמבלי

CSL211 Computer Architecture

FURTHER MASS SPECTROMETRY

Präsentation transkript:

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 1 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Part 7 Instruction Set Architecture (ISA) Computer Architecture Slide Sets WS 2011/2012 Prof. Dr. Uwe Brinkschulte Prof. Dr. Klaus Waldschmidt

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 2 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Programming model The Instruction Set Architecture (ISA) is the programming model which is needed for programming a processor. All details concerning the implementation of the processor are out of focus in the ISA. Therefore the ISA can be regarded as an abstract interface between the compiler and the microarchitecture of the processor.

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 3 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Programming model The following key questions lead us to the specification of this interface: How data is represented? Where data is stored? How data is accessed? How instructions are coded? Which instructions are available to process data?

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 4 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Programming model Therefore, the ISA defines: machine data types address space organisation register model addressing modes machine instruction set

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 5 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Programming model Since the programming model abstracts from implementation details it is realized either in hardware (real processors) or in software (virtual processors). For instance, if the instruction set includes an instruction for multiplication, the CPU of the processor needs a digital combinatorial circuit for multiplication. In this sense, a relation between the abstract ISA and the microarchitecture exists.

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 6 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Machine data types A data type is a tuple of values and operations which can be performed on these values. The operations are implemented by the machine instructions. Machine data types (like data types in high level languages) are classified into structured and unstructured data types. An additional class are the primitive data types.

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 7 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Primitive machine data types Bit:value set: 0,1 operations: AND, OR, XOR, negation, compare Byte: value set: bit pattern (8 bit) normally smallest addressable unit operations: same as for bit, additionally ADD, SUB, MUL, DIV, SHIFT, ROTATE, … Word: value set: normally a multiple of bytes largest addressable unit (in a single operation) operations: same as for byte (sometimes the following convention is used: Half-Word = 16 Bit Word = 32 Bit Double Word = 64 Bit)

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 8 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Examples for more data types 1 - vector (bit) - BCD number (binary coded decimal) - Binary number unsigned - two complement number - floating point number - string n-1 i 0 n = 8,16, , 16 Bit 32 Bit n-1 0 n = 8, 16, 32 MSB LSB n-1 0 MSB=sign bit LSB n = 8, 16, 32 biased.expon.sfraction... n = 8, 16, 32 n-1 0 n-1 0 n (taken from MC680x0)

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 9 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Address space organisation Physical organisation: depends on the processor n0n 8 bit processor n0n 16 bit processor n0n 32 bit processor... n: physical address, n = 2 address bus width

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 10 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Address space organisation Locical organisation: byte oriented access for most processor types m0123m physical word on a 8 bit processor physical word on a 16 bit processor physical word on a 32 bit processor m: logical address, m = n * bit width / 8

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 11 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Address space organisation Physical to logical mapping: n0n m-3m-2 m-1 m physical address locical address

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 12 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Address space organisation Aligned access: the accessed word is aligned according to its length in the physical address space (logical adress mod length) = n0n... byte bytes to byte boundaries half-word half-words to half-word boundaries word words to word boundaries

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 13 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Address space organisation n0n... half-word -word word Unaligned (misaligned) access: the accessed word is not aligned according to its length in the physical address space (logical adress mod length) 0 half- Some processors do not support unaligned access (e.g. SPARC)

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 14 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Byte order in words 8 Bit - byte 16 Bit - word 32 Bit - word 8 Bit - byte 16 Bit - word 32 Bit - word N + 1NN + 2N + 3 N + 1N N NN + 2N + 3 N + 1N N big endian byte ordering little endian byte ordering Two different formats: N: least significant byte, N + 3: most significant byte Word address is the address of the most significant byte (used e.g. in MC680x0 or SPARC) Word address is the address of the least significant byte (used e.g. in Pentium family)

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 15 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Byte order in words N + 1 N N + 2 N + 3 big endian byte ordering Locical (byte oriented) memory organization of a 32 bit word b b+1 b+2 b+3 byte address N + 2 N + 3 N + 1 N little endian byte ordering b b+1 b+2 b+3 byte address

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 16 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Register model The number of registers being part of a processor varies between 20 and 200. The advantage of data storage in registers against DRAM or SRAM-memories are: faster access time register addresses could be shorter with respect to the instruction format. An ISA is called Load-Store-ISA if all machine instructions except register load and store instructions operate on the register file only.

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 17 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Registers are classified into hidden registers and programmer visible registers. The visible registers are the workplace of the programmer and are often organized as register files. Hidden registers are supply registers needed for the internal functionality of the processing unit (CPU). Both visible and hidden registers are designed for various purpose and functionality. Register model

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 18 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt A register model defines which processor registers are visible (addressable) to the programmer. Usually these are the working registers and the state register. The state register monitors the state of the processor through conditional flags. It shows for example whether the processor operates in system or user mode. The state register is mostly read-only Commonly existing hidden registers are the instruction register and the memory interface registers. Register model

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 19 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Register implementation clk Q0Q0 D Q D0D0 Q1Q1 D1D1 Q 31 D Q D bit register with D-Latches Asynchronous counter with D-Latches D Q Q clk D Q Q D Q Q D Q Q Q0Q0 Q2Q2 Q1Q1 Q3Q3.... clk D 31 Q 31 D1D1 D0D0 Q0Q0 Q1Q1 Symbol

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 20 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt program counter (PC) - contains the next instruction address state register (SR) - monitors the state of the processor stackpointer (SP) - stores the top of the stack accumulator (ACCU) – stores computation results (in older or simple processors) data registers (DXi) - storing operands for computations address registers (AXi) - storing operand addresses general purpose registers (GPi) - storing either operands or operand addresses Common visible registers

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 21 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt instruction register (IR) – contains the currently processed instruction instruction queue (IQ) - contains the next instructions to be processed memory address register (MAR) - buffers the address of a memory access (e.g. to save or load a general purpose register) memory data register (MDR) - buffers the content of a memory access (e.g. to save or load a general purpose register) Common hidden registers

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 22 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Program counter register Pointer to the next instruction to be executed Normally incremented Set by a jump, jump subroutine, interrupt, return or return from interrupt instruction Program counter PC N - 4 N N + 4 M … … 31 0 Add A B Jump M …

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 23 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Stackpointer register Addresses a location in the memory which is organized as a stack (LIFO). Elements can be pushed (write) and popped (read) only from the top of the stack. Consequence: Data are stored in a subsequent order Used e.g. for jump subroutine/return operation on PC Stackpointer N - 4 N N + 4 … … 31 0 PushX Pop Some processors distinguish between user stackpointer (e.g. for jump subroutine/return) and supervisor stackpointer (e.g. for interrupt/return from interrupt)

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 24 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Sampe CISC register set Intel Pentium

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 25 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Sampe RISC register set Power PC (extract) The register file of RISC processors has to be much bigger compared to CISC processors. A RISC needs more registers, because the register file is source and destination of all arithmetic or logic instructions.

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 26 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Multiple register sets

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 27 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Processors with multiple register sets: a step towards multithreaded processors Processor with multiple register sets: Each register set can store the program counter (PC) and the state register (SR) PC and SR exist only once => several contexts can be stored, fast context switching Multithreaded processor: multiple PCs and SRs exist instructions from several threads can be executed at the same time in the pipeline => several contexts can be processed Multiple register sets

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 28 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt The registers of a register file are grouped into blocks called windows. These overlapping windows are used by the subroutines of a program. MORS (multiple overlapping register set) Multiple overlapping register sets, register windows Overall register set Register window 1 Register window 2 Register window 3 Register window n jump subroutine return

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 29 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Simplifies parameter passing on jumping to subroutines Each subroutine has its own working space within the register file Parameters can be directly passed with no need to copy registers or pass parameters by memory => mainly used in RISC processors Two possible approaches: Fixed size register window Variable size register window Multiple overlapping register sets, register windows

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 30 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt preceding window continued global registers r7 r0 current window succeeding window CWP restore save alternative register naming: r31 = i7 r24 = i0 r23 = I7 r16 = I0 r15 = o7 r8 = o0 r7 = g7 r0 = g0 based on SPARC architecture r31 r24 r23 r16 r15 r8 r31 r8 In i+1 Local i+1 Out i+1 In i Local i Out i Out i-1 Local i-1 In i-1 Fixed size window local register

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 31 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt In case of the SPARC architecture, a window consists of 32 registers of which the first 8 also belong to the preceding window and the last 8 also belong to the succeeding window. The registers are addressed relative to the current window pointer (CWP). A subroutine call is performed by incrementing the CWP and saving the PC. The parameters are passed through the overlapping registers of the two windows. The content of the program counter is saved (return address) into one of these registers. A time consuming save and reload of registers is omitted. In case of an overflow of the MORS the window contents have to be saved to a stack. Fixed size window

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 32 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt In Local Out Local Out preceding current window global registers local registers previous RSP current RSP r0 r1 r0 r1 Variable size window gr0 gr1 gr63 register stack pointer (RSP) r0 r1 r127 r65 r66 r64 based on AMD architecture

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 33 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Register size of processors with 3-address architecture processor/architecture (vendor) # of general purpose registersbit width overall directly accessible register width register address immediate operands instr. Alpha (Compaq)32 64 Bit5 Bit8 Bit32 Bit Am29000 (AMD) Bit8 Bit 32 Bit ARM7TDMI (ARM)16 32 Bit4 Bit8 Bit32 Bit Crusoe TM5800 (Transmeta)64 32 Bit6 Bit-- pa-8700 (HP)32 64 Bit5 Bit11 Bit32 Bit Itanium 2 (Intel, HP) Bit7 Bit8 Bit41 Bit MC88100 (Motorola)32 32 Bit5 Bit16 Bit32 Bit MIPS65 20Kc (MIPS)32 64 Bit5 Bit16 Bit32 Bit Nemesis C (TU Berlin) Bit4 Bit1 Bit16 Bit PowerPC 970 (IBM)32 64 Bit5 Bit16 Bit32 Bit UltraSPARC III Cu (SUN) Bit5 Bit13 Bit32 Bit

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 34 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Register size of processors with 2-address architecture processor (vendor) # of general purpose registersbit width overall directly accessible register width register address immediate operands smallest instr. Athlon (AMD X86-64)16 64 Bit4 Bit Bit8 Bit ColdFire MFC5206 (Motorola) Bit3 Bit Bit16 Bit MC680xx (Motorola) Bit3 Bit Bit16 Bit Pentium X (Intel X86)8832 Bit3 Bit Bit8 Bit

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 35 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Addressing modes Machine instructions normally hold information about the operand addresses. This can either be a physical address, e.g. a register number or the address of a memory location, or it can be an address specification. An address specification defines how to calculate the address. Thus, the address information determines the location of the operand(s) belonging to the instruction using one of many addressing modes.

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 36 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Addressing modes Instruction format e.g. arithmetic instruction opcode target source source operands needed for the execution defined by the opcode operand register memory address specification itself number location (dynamic address calculation) The result of the dynamic address calculation is called effective address

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 37 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt immediate: The operand is part of the instruction. memory direct and register direct: The instruction contains the operand address. register indirect: The instruction contains a register number pointing to a register holding the address of the operand. In assembler code this addressing mode is typically denoted by register name Addressing modes

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 38 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt memory indirect: A register addressed in the instruction contains the address of a memory cell which holds the operand address. register offset: The instruction contains a register number and an offset. The operand address is the sum of the registers content and the offset. implicit: The instruction implicitly targets a single register (like the ACCU) Addressing modes

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 39 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Reasons for using dynamic address calculation: Addresses of data structure elements are composed of the first address of the data structure and the offset of the element to the beginning. Often this offset is unknown at compile time, therefore the effective address has to be calculated at runtime. Repeated execution of the same instruction, e.g. in a loop, often accesses successive memory addresses which have to be calculated at runtime. Effective address The address is calculated from several parts found in the instruction and in registers or memory cells at runtime (dynamic address calculation). The calculated address is defined as effective address.

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 40 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt An operand address often is unknown at compile time, because it is calculated during program execution. The partitioning of addresses into a base address stored in a register and an offset simplifies the handling of shift able variables and shift able program code. Effective address (cont.)

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 41 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt o p e r a n d immediate instruction register memory operand eff. address register direct memory direct Addressing modes 1 e.g. LOAD 8, r1 e.g. LOAD (2000), r1 e.g. LOAD r2, r1

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 42 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt o p e r a n d register indirect instruction register instruction register address e f f e c t i v e a d d r e s s register address memory m e m o r y a d d r e s s decrement memory - eff. address register register indirect with predecrement Addressing modes 2 e.g. LOAD (r2), r1 e.g. LOAD -(r2), r1

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 43 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt register indirect with displacement (indexed) o p e r a n d instruction register address m e m o r y a d d r e s s memory + + i n d e x register displacement scaling 1, 2 or 4 eff. address Addressing modes 3 e.g. LOAD.B 126(r3)(r2), r1

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 44 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt memory indirect o p e r a n d instruction register address m e m o r y a d d r e s s memory + + displacement1 indirect memory address displacement2eff. address memory Addressing modes 4 e.g. LOAD 28(126(r2)), r1

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 45 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt memory indirect (post indexed) o p e r a n d instruction register address m e m o r y a d d r e s s memory + + displacement1 indirect memory address displacement2 eff. address memory i n d e x scaling 1, 2 or 4 + register Addressing modes 5 e.g. LOAD.B 28(r3)(126(r2)), r1

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 46 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt memory indirect (preindexed) o p e r a n d instruction register address m e m o r y a d d r e s s memory + + displacement1 indirect memory address displacement2 eff. address memory i n d e x scaling 1, 2 or 4 + register Addressing modes 6 e.g. LOAD.B 28(126(r3)(r2)), r1

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 47 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt branch target table access through program counter relative addressing JMP disp (PC)(r n ) memory target 0 target 2 target 1 i n d e x (PC) + + displacement Access to branch target table by PC relative addressing

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 48 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Machine instruction set The machine instruction set of a computer normally includes instructions of different formats, e.g. 0-address instructions, 1-address instructions, 2-address instructions and 3-address instructions. An instruction is divided into so called fields. The more address fields an instruction contains the smaller the number of addressable memory cells and/or the number of operations encoded in the opcode field becomes (if we assume a constant instruction length).

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 49 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Variable length vs. constant length instruction format Variable length: (e.g Bit) mostly used in CISC architectures + flexible instruction format + high code density + long immediate and displacement values Constant length: (e.g. 32 Bit) mostly used in RISC architectures + simple and fast fetch + simple and fast decode + simplified pipelining

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 50 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Scheme of basic operations of common processors basic operations unconditional operations conditional operations combinatorial operations control flow operations transport operations arithmetic logic operations simple branches system branches load operations store operations semaphore operations arithmetic operations logic and shift operations state and control operations subroutine branches call return

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 51 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Instruction sets are divided into groups combining instructions with similar functionality: Typical instruction groups: transport instructions arithmetic instructions logic instructions shift and rotate instructions bitwise instructions string and array instructions branch instructions system instructions synchronization instructions Instruction classes

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 52 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Load store architecture All instructions - except load and store instructions - address registers only. Load and store instructions are needed to transfer data to and from main memory. Mainly used in RISC ISA, combined with pipelining it allows to complete most instructions in one cycle Furthermore, the address fields of instructions becomes shorter as they only have to address a register instead of a memory address. A load store ISA accelerates a machine if there are only small caches or if the caches are completely missing and a big register file is available.

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 53 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Example: An arithmetic instruction SUBc r3, r7, r21 binary code hexcode D54E instruction format: OP: opcode TR: target register SRn: source register c/x: set/do not set condition code Example: A store instruction STORE r24, 126(r5) binary code hexcode 3E0A007E instruction format: OP: opcode SR: source register BR: base register DP: displacement (signed) Two examples for an instruction format OPTRSR1SR2OPSRBR cxcx DP

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 54 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt State register of a RISC processor (based on SPARC-architecture) N Z V C IM IEIE PSPS S CWP SR interrupt mask interrupt enable previous S-bit supervisor/user current window pointer carry overflow zero negative conditional bits

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 55 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Conditional codes dependent on conditional bits Z (zero), N (negative), C (carry) und V (overflow). Mnemonics according to Motorolas ColdFire MFC5206 processor. conditional value mnemonicoperationexpressionoperand type equal not equal eq ne = Z independent higher than higher than or same lower than lower than or same ht hs lo ls > < C Z C C Z unsigned greater than greater than or equal less than less than or equal gt ge lt le > < (N = V) Z (N = V) (N V) (N V) Z signed arithmetic overflow arithmetic shortfall negative positive vs vc ne pl V N signed

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 56 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt Multimedia instructions Typical SIMD instructions to process a single operation on a set of data (e.g. changing the brightness of image pixels) Operations can be on packed integers (e.g. MMX on Pentium) or packed floats (e.g. SSE2 on Pentium) Typical operations: arithmetic (saturated or overflow), logic, compare, pack, unpack Example: