# Cortex-M0+ CPU Core and ARM Instruction Set Architecture

# Microcontroller vs. Microprocessor

- Both have a CPU core to execute instructions
- Microcontroller has peripherals for embedded interfacing and control
  - Analog
  - Non-logic level signals
  - Timing
  - Clock generators
  - Communications
  - point to point
  - network
  - Reliability and safety



## Cortex-M0+ Core



# **ARM Processor Core Registers**



# **Operating Modes**



- Which SP is active depends on operating mode, and SPSEL (CONTROL register bit 1)
  - SPSEL == 0: MSP
  - SPSEL == I: PSP

# **ARM Program Status Register**



#### Three views of same register

- Application PSR (APSR)
  - Condition code flag bits Negative, Zero, oVerflow, Carry used for conditional branches, extended precision math, error detection
- Interrupt PSR (IPSR)
  - Holds exception number of currently executing ISR
- Execution PSR (EPSR)
  - Thumb state

| Mnemonic extension     | Meaning                      | Condition flags   |
|------------------------|------------------------------|-------------------|
| EQ                     | Equal                        | Z=1               |
| NE                     | Not equal                    | Z == 0            |
| CS a                   | Carry set                    | C=1               |
| CC p                   | Carry clear                  | C=0               |
| MI                     | Minus, negative              | N=1               |
| PL                     | Plus, positive or zero       | N=0               |
| VS                     | Overflow                     | V=1               |
| VC                     | No overflow                  | V=0               |
| HI                     | Unsigned higher              | C = 1 and $Z = 0$ |
| LS                     | Unsigned lower or same       | C = 0 or $Z = 1$  |
| GE                     | Signed greater than or equal | N == V            |
| LT                     | Signed less than             | N != V            |
| GT                     | Signed greater than          | Z = 0 and $N = V$ |
| LE                     | Signed less than or equal    | Z = 1 or $N = V$  |
| None (AL) <sup>d</sup> | Always (unconditional)       | Any               |

# **ARM Processor Core Registers**

- PRIMASK Exception mask register
  - Bit 0: PM Flag
    - Set to 1 to prevent activation of all exceptions with configurable priority
  - Access using CPS, MSR and MRS instructions
  - Use to prevent data race conditions with code needing atomicity
- CONTROL
  - Bit 1: SPSEL flag
    - Selects SP when in thread mode: MSP (0) or PSP (1)
  - Bit 0: nPRIV flag
    - Defines whether thread mode is privileged (0) or unprivileged (1)
  - With OS environment,
    - Threads use PSP
    - OS and exception handlers (ISRs) use MSP

# **Different Instruction Sets for Different Design Spaces?**

- ARM instructions optimized for resource-rich highperformance computing systems
  - Deeply pipelined processor, high clock rate, wide (e.g. 32bit) memory bus
  - <u>https://en.wikipedia.org/wiki/ARM\_Cortex-M#Instruction\_sets</u>
- Low-end embedded computing systems are different
  - Slower clock rates, shallow pipelines
  - Different cost factors e.g. code size matters much more
  - Bit and byte operations critical

# The Memory Wall

It has been easier to speed up the CPU than the memory

Facts of life

- Off-chip memory is slower than on-chip memory. May not want to put all memory on-chip, even if possible.
- Flash is slower to read or write than RAM.
- Fast RAM is more expensive than slow RAM. Same for flash.
- Design for high-performance CPUs
  - Use caches (small fast RAM) to make main memory (large slow RAM, flash) look faster at a low cost.
  - Put cache(s) on chip if possible.
  - Increase bandwidth by widening memory bus, improving protocol, reducing overhead, split transactions, using page mode, etc.)
- Design for low-performance CPUs
  - Put memory on-chip with CPU. RAM, flash ROM
  - Increase flash ROM bandwidth by widening memory bus, adding prefetch buffer, branch target buffer, etc.
  - Add cache
  - Change instruction set size to reduce instruction bandwidth needed



# **ARM** and Thumb Instructions

- Thumb reduces program memory size and bandwidth requirements
  - Subset of instructions re-encoded into fewer bits (most 16 bits, some 32 bits)
    - Not all 32-bit instructions available
    - Most 16-bit instructions can only access low registers (R0-R7), but a few can access high registers (R8-R15)
  - I995:Thumb-I instruction set
    - I6-bit instructions
  - 2003: Thumb-2 instruction set
    - Adds some 32 bit instructions
    - Improves speed with little memory overhead

- CPU operating state
  - CPU decodes instructions based on whether in Thumb state or ARM state - controlled by T bit
  - Thumb state indicated by program counter being odd (LSB = 1)
- Cortex-M0+ only uses Thumb instructions, is always in Thumb state
- See ARMv6-M Architecture Reference Manual for specifics per instruction (Section A.6.7)

# **Cortex-M Instruction Groups**

| Group        | lnstr<br>bits | Instructions                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   | M0,M0+,M1 | M3  | M4       | M7       | M23      | M33,M35P |
|--------------|---------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------|-----|----------|----------|----------|----------|
| Thumb-I      | 16            | ADC, ADD, ADR, AND, ASR, B, BIC, BKPT, BLX, BX, CMN, CMP, CPS, EOR, LDM, LDR, LDRB, LDRH,<br>LDRSB, LDRSH, LSL, LSR, MOV, MUL, MVN, NOP, ORR, POP, PUSH, REV, REV16, REVSH, ROR, RSB, SBC,<br>SEV, STM, STR, STRB, STRH, SUB, SVC, SXTB, SXTH, TST, UXTB, UXTH, WFE, WFI, YIELD                                                                                                                                                                                                                                                                                                                                                                | Yes       | Yes | Yes      | Yes      | Yes      | Yes      |
| Thumb-I      | 16            | CBNZ, CBZ                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      | No        | Yes | Yes      | Yes      | Yes      | Yes      |
| Thumb-I      | 16            | IT                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             | No        | Yes | Yes      | Yes      | No       | Yes      |
| Thumb-2      | 32            | BL, DMB, DSB, ISB, MRS, MSR                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    | Yes       | Yes | Yes      | Yes      | Yes      | Yes      |
| Thumb-2      | 32            | SDIV, UDIV                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | No        | Yes | Yes      | Yes      | Yes      | Yes      |
| Thumb-2      | 32            | ADC, ADD, ADR, AND, ASR, B, BFC, BFI, BIC, CDP, CLREX, CLZ, CMN, CMP, DBG, EOR, LDC, LDM,<br>LDR, LDRB, LDRBT, LDRD, LDREX, LDREXB, LDREXH, LDRH, LDRHT, LDRSB, LDRSBT, LDRSH,<br>LDRSHT, LDRT, LSL, LSR, MCR, MCRR, MLA, MLS, MOV, MOVT, MRC, MRRC, MUL, MVN, NOP, ORN,<br>ORR, PLD, PLDW, PLI, POP, PUSH, RBIT, REV, REV16, REVSH, ROR, RRX, RSB, SBC, SBFX, SEV, SMLAL,<br>SMULL, SSAT, STC, STM, STR, STRB, STRBT, STRD, STREX, STREXB, STREXH, STRH, STRHT, STRT, SUB,<br>SXTB, SXTH, TBB, TBH, TEQ, TST, UBFX, UMLAL, UMULL, USAT, UXTB, UXTH, WFE, WFI, YIELD                                                                           | No        | Yes | Yes      | Yes      | No       | Yes      |
| DSP          | 32            | PKH, QADD, QADD16, QADD8, QASX, QDADD, QDSUB, QSAX, QSUB, QSUB16, QSUB8, SADD16,<br>SADD8, SASX, SEL, SHADD16, SHADD8, SHASX, SHSAX, SHSUB16, SHSUB8, SMLABB, SMLABT, SMLATB,<br>SMLATT, SMLAD, SMLALBB, SMLALBT, SMLALTB, SMLALTT, SMLALD, SMLAWB, SMLAWT, SMLSD,<br>SMLSLD, SMMLA, SMMLS, SMMUL, SMUAD, SMULBB, SMULBT, SMULTT, SMULTB, SMULWT, SMULWB,<br>SMUSD, SSAT16, SSAX, SSUB16, SSUB8, SXTAB, SXTAB16, SXTAH, SXTB16, UADD16, UADD8, UASX,<br>UHADD16, UHADD8, UHASX, UHSAX, UHSUB16, UHSUB8, UMAAL, UQADD16, UQADD8, UQASX,<br>UQSAX, UQSUB16, UQSUB8, USAD8, USADA8, USAT16, USAX, USUB16, USUB8, UXTAB, UXTAB16,<br>UXTAH, UXTB16 | No        | No  | Yes      | Yes      | No       | Optional |
| SP Float     | 32            | VABS, VADD, VCMP, VCMPE, VCVT, VCVTR, VDIV, VLDM, VLDR, VMLA, VMLS, VMOV, VMRS, VMSR,<br>VMUL, VNEG, VNMLA, VNMLS, VNMUL, VPOP, VPUSH, VSQRT, VSTM, VSTR, VSUB                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 | No        | No  | Optional | Optional | No       | Optional |
| DP Float     | 32            | VCVTA, VCVTM, VCVTN, VCVTP, VMAXNM, VMINNM, VRINTA, VRINTM, VRINTN, VRINTP, VRINTR,<br>VRINTX, VRINTZ, VSEL                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    | No        | No  | No       | Optional | No       | No       |
| TrustZone    | 16            | BLXNS, BXNS                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    | No        | No  | No       | No       | Optional | Optional |
| TrustZone    | 32            | SG, TT, TTT, TTA, TTAT                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         | No        | No  | No       | No       | Optional | Optional |
| Co-processor | 16            | CDP, CDP2, MCR, MCR2, MCRR, MCRR2, MRC, MRC2, MRRC, MRRC2                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      | No        | No  | No       | No       | No       | Optional |

# Reference for ARM Instruction Set Architecture

#### • ARM V6-M Architecture Reference Manual, The encoding of 16-bit Thumb instructions is: Chapter A5. The Thumb Instruction Set Encoding 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 opcode 16- or 32-bit instruction? Table A5-1 shows the allocation of 16-bit instruction encodings. Bits [15:11] 32 Table A5-1 16-bit Thumb instruction encoding • 0b11101, 0b1110, 0b11111: 32-bit instruction. Page A5-91 Instruction or instruction class opcode Else 16-bit instruction. Page A5-84 Shift (immediate), add, subtract, move, and compare on page A5-85 00xxxx 010000 Data processing on page A5-86 The encoding of 32-bit Thumb instructions is: Special data instructions and branch and exchange on page A5-87 010001 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 01001x Load from Literal Pool, see LDR (literal) on page A6-141 1 1 1 op1 ор Load/store single data item on page A5-88 0101xx For 32-bit Thumb encoding, op1 != 0b00. If op1 == 0b00, a 16-bit instruction is encoded, see 16-bit Thumb 011xxx instruction encoding on page A5-84. 100xxx Table A5-9 shows the allocation of ARMv6-M Thumb encodings in this space. Generate PC-relative address, see ADR on page A6-115 10100x Table A5-9 32-bit Thumb encoding Generate SP-relative address, see ADD (SP plus immediate) on page A6-111 10101x Instruction class op Miscellaneous 16-bit instructions on page A5-89 001 1011xx Store multiple registers, see STM, STMIA, STMEA on page A6-175 11000x UNDEFINED x1 х See Branch and miscellaneous control Load multiple registers, see LDM, LDMIA, LDMFD on page A6-137 11001x 10 Conditional branch, and Supervisor Call on page A5-90 1101xx 10 0 UNDEFINED

11100x Unconditional Branch, see B on page A6-119

# Example Instruction Encoding: ADC (register)

#### ADC (register)

Noton CMOt ervalue RAIN E RAN+RM Add with Carry (register) adds a register value, the carry flag value, and an optionally-shifted register value and writes the result to the destination register. It updates the condition flags based on the result.

Encoding T1 All versions of the Thumb instruction set.

ADCS <Rdn>, <Rm>

15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 1 0 0 0 0 0 1 0 1 Rdn Rm

d = UInt(Rdn); n = UInt(Rdn); m = UInt(Rm); setflags = !InITBlock(); (snift\_t, shift\_n) = (SRType\_LSL, 0);

#### Assembler syntax

ADCS{<q>} {<Rd>,} <Rn>, <Rm>

where:

- The instruction updates the flags. S
- See Standard assembler syntax fields on page A6-98. {<q>}
- The destination register. If <Rd> is omitted, this register is the same as <Rn>. <Rd>

The register that contains the first operand. <Rn>

The register that is optionally shifted and used as the second operand. <Rm>

Operation if ConditionPassed() then EncodingSpecificOperations(); shifted = Shift(R[m], shift\_t, shift\_n, APSR.C); (result, carry, overflow) = AddWithCarry(R[n], shifted, APSR.C);R[d] = result;if setflags then APSR.N = result<31>; APSR.Z = IsZeroBit(result); APSR.C = carry;APSR.V = overflow:

#### Page A6-106 of **ARM-V6M ARM**

# Example Instruction Encoding: ADD (register)

#### ADD (register)

This instruction adds a register value and an optionally-shifted register value, and writes the result to the destination register. Encoding T1 updates the condition flags based on the result.



if (DM:Rdm) = '1101' || Rm == '1101' then SEE ADD (SP plus register); d UInt(DN:Rdn): n = d; m = UInt(Rm); setflags = FALSE; (shift\_t, shift\_n) = (SRType\_LSL, 0); if n == 15 && m == 15 then UNPREDICTABLE; if d == 15 && InITBlock() && !LastInITBlock() then UNPREDICTABLE;

#### Page A6-109 of ARM-V6M ARM

| Assemble                                                                                  | er syntax                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |          |  |  |
|-------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------|--|--|
| ADD{S}{ <q>}</q>                                                                          | { <rd>,} <rn>, <rm></rm></rn></rd>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         |          |  |  |
| where:                                                                                    |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |          |  |  |
| S                                                                                         | If present, specifies that the instruction updates the flags. Otherwise, the instruction does not update the flags.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |          |  |  |
| { <q>}</q>                                                                                | See Standard assembler syntax fields on page A6-98.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |          |  |  |
| <rd></rd>                                                                                 | The destination register. If <rd> is omitted, this register is the same as <rn> and encoding T2 is preferred to encoding T1 if both are available. If <rd> is specified, encoding T1 is preferred to encoding T2. If R<m> is not the PC, the PC can be used in encoding T2.</m></rd></rn></rd>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             |          |  |  |
| <rn></rn>                                                                                 | The register that contains the first operand. If the SP is specified for <rn>, see <i>ADD (SP plus register)</i> on page A6-113. If R<m> is not the PC, the PC can be used in encoding T2.</m></rn>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |          |  |  |
| <rm> The register that is used as the second operand. The PC can be used in encoding</rm> |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            | ding T2. |  |  |
|                                                                                           | <pre>Operation if ConditionPassed() then     EncodingSpecificOperations();     shifted = Shift(R[m], shift_t, shift_n, APSR.C);     (result, carry, overflow) = AddWithCarry(R[n], shifted, '0');     if d == 15 then         ALUWritePC(result); // setflags is always FALSE here     else         R[d] = result;         if setflags then             APSR.N = result&lt;31&gt;;             APSR.Z = IsZeroBit(result);         APSR.C = carry;         APSR.V = overflow;         APSR.V = overflo</pre> |          |  |  |

# Assembler Instruction Format



- <operation> <operandl> <operand2> <operand3>
  - There may be fewer operands
  - First operand is typically destination (<Rd>)
  - Other operands are sources (<Rn>, <Rm>)
- Examples
  - ADDS <Rd>, <Rn>, <Rm>
    - Add registers: <Rd> = <Rn> + <Rm>
  - AND <Rdn>, <Rm>
    - Bitwise and: <Rdn> = <Rdn> & <Rm>
  - CMP <Rn>, <Rm>
    - Compare: Set condition flags based on result of computing <Rn> <Rm>

# Update Condition Codes in APSR?



- "S" suffix indicates the instruction updates APSR
  - ADD vs. ADDS
  - ADC vs. ADCS
  - SUB vs. SUBS
  - MOV vs. MOVS



# **USING REGISTERS**

# **AAPCS** Register Use Conventions

• Make it easier to create modular, isolated and integrated code

Scratch registers are not expected to be preserved upon returning from a called subroutine r0-r3 Preserved ("variable") registers are expected to have their original values upon returning from a called subroutine Scratch Can be Margeo r4-r8, r10-r11

# AAPCS Core Register Use

| Register | Synonym          | Special        | Role in the procedure call standard                                                                                  |                                                       |                               |  |
|----------|------------------|----------------|----------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------|-------------------------------|--|
| r15      |                  | PC             | The Program Counter.                                                                                                 |                                                       |                               |  |
| r14      |                  | LR             | The Link Register.                                                                                                   | he Link Register.                                     |                               |  |
| r13      |                  | SP             | The Stack Pointer.                                                                                                   | he Stack Pointer.                                     |                               |  |
| r12      |                  | IP             | The Intra-Procedure-ca                                                                                               | The Intra-Procedure-call scratch register.            |                               |  |
| r11      | v8               |                | Variable-register 8.                                                                                                 | Variable-register 8. Must be saved, restored by call  |                               |  |
| r10      | v7               |                | Variable-register 7.                                                                                                 | /ariable-register 7. procedure if it will modify then |                               |  |
| 19       | $\left( \right)$ | V6<br>SB<br>TR | Platform register.<br>The meaning of this reg                                                                        | retain th                                             | eir value.                    |  |
| r8       | <b>v</b> 5       |                | Variable-register 5.                                                                                                 |                                                       |                               |  |
| r7       | v4               |                | Variable register 4.                                                                                                 | ariable register 4. Must be saved, restored by call   |                               |  |
| r6       | <b>v</b> 3       |                | Variable register 3.                                                                                                 | Calling s                                             | ubroutine expects these to    |  |
| r5       | v2               |                | Variable register 2.                                                                                                 | retain th                                             | eir value.                    |  |
| r4       | v1               |                | Variable register 1.                                                                                                 |                                                       |                               |  |
| r3       | a4               |                | Argument / scratch register 4.                                                                                       |                                                       |                               |  |
| r2       | a3               |                | Argument / scratch register 3.<br>Argument / result / scratch register 2.<br>Argument / result / scratch register 1. |                                                       | Don't need to be saved. May   |  |
| r1       | a2               |                |                                                                                                                      |                                                       | be used for arguments,        |  |
| rO       | a1               |                |                                                                                                                      |                                                       | results, or temporary values. |  |

# Instruction Set Summary

| Instruction Type        | Instructions                                             |  |
|-------------------------|----------------------------------------------------------|--|
| Move                    | MOV                                                      |  |
| Load/Store              | LDR, LDRB, LDRH, LDRSH, LDRSB, LDM, STR, STRB, STRH, STM |  |
| Add, Subtract, Multiply | ADD, ADDS, ADCS, ADR, SUB, SUBS, SBCS, RSBS, MULS        |  |
| Compare                 | CMP, CMN                                                 |  |
| Logical                 | ANDS, EORS, ORRS, BICS, MVNS, TST                        |  |
| Shift and Rotate        | LSLS, LSRS, ASRS, RORS                                   |  |
| Stack                   | PUSH, POP                                                |  |
| Conditional branch      | B, BL, B{cond}, BX, BLX                                  |  |
| Extend                  | SXTH, SXTB, UXTH, UXTB                                   |  |
| Reverse                 | REV, REVI6, REVSH                                        |  |
| Processor State         | SVC, CPSID, CPSIE, SETEND, BKPT                          |  |
| No Operation            | NOP                                                      |  |
| Hint                    | SEV, WFE, WFI, YIELD                                     |  |
| Barriers                | DMB, DSB, ISB                                            |  |

# Load and Store Register Instructions



#### **NC STATE UNIVERSITY**

 ARM is a load/store architecture, so must process data in registers (not memory)

- LDR: load register with word (32 bits) from memory
- LDR <Rt>, source address

 STR: store register contents (32 bits) to memory

STR <Rt>, destination address



- Source and destination addresses are specified using available addressing modes
  - Offset Addressing mode: [<Rn>, <offset>] accesses address <Rn>+<offset>
  - Base Register <Rn> can be R0-R7, SP or PC
  - <offset> is added or subtracted from base register to create effective address
    - Can be an immediate constant
    - Can be another register, used as index <Rm>
- Auto-update: Can write effective address back to base register
  - Pre-indexing: use effective address to access memory, then update base register
  - Post-indexing: use base register to access memory, then update base register

# Memory Maps For Cortex M0+ and MCU



# Memory Maps For Cortex M0+ and MCU





# **ARMv6-M Endianness**

- Instructions are always little-endian
- Loads and stores to Private Peripheral Bus are always little-endian
- Data: Depends on implementation, or from reset configuration
  - Kinetis processors are little-endian

# Loading/Storing Smaller Data Sizes

|           | Signed | Unsigned |
|-----------|--------|----------|
| Byte      | LDRSB  | LDRB     |
| Half-word | LDRSH  | LDRH     |

- Some load and store instructions can handle half-word (16 bits) and byte (8 bits)
- Store just writes to half-word or byte
  - STRH, STRB
- Loading a byte or half-word requires padding or extension: What do we put in the upper bits of the register?
  - Example: How do we extend 0x80 into a full word?
    - Unsigned? Then 0x80 = 128, so zero-pad to extend to word 0x0000\_0080 = 128
    - Signed? Then 0x80 = -128, so sign-extend to word 0xFFF\_FF80 = -128

1000 0000 Z's coupl. T = nogativeSign: 1 = nogative 0 = positive

26

# **In-Register Size Extension**

|           | Signed | Unsigned |
|-----------|--------|----------|
| Byte      | SXTB   | UXTB     |
| Half-word | SXTH   | UXTH     |

- Can also extend byte or half-word already in a register
  - Signed or unsigned (zero-pad)
- How do we extend 0x80 into a full word?
  - Unsigned? Then 0x80 = 128, so zero-pad to extend to word 0x0000\_0080 = 128
  - Signed? Then 0x80 = -128, so sign-extend to word 0xFFFF\_FF80 = -128

# Load/Store Multiple

- LDM/LDMIA: load multiple registers starting from [base register], update base register afterwards
  - LDM <Rn>!,<registers>
  - LDM <Rn>,<registers>
- STM/STMIA: store multiple registers starting at [base register], update base register after STM <Rn>!, <registers>
- LDMIA and STMIA are pseudo-instructions, translated by assembler

# Load Literal Value into Register

Assembly pseudo-instruction: LDR <rd>,

=value

- Assembler generates code to load <rd> with value
- Assembler selects best approach depending on value
  - Load immediate
  - MOV instruction provides 8-bit unsigned immediate operand
  - (0-255)
  - Load and shift immediate values
    - Can use MOV, shift, rotate, sign extend instructions
  - Load from literal pool
    - I. Place value as a 32-bit literal in the program's literal pool (table of literal values to be loaded into registers)
    - 2. Use instruction LDR <rd>, [pc,#offset] where offset indicates position of literal relative to program counter value

- Example formats for literal values (depends on compiler and toolchain used)
  - Decimal: 3909
  - Hexadecimal: 0xa7ee
  - Character:'A'
  - String:"44??"



# Move (Pseudo-)Instructions

- Copy data from one register to another without updating condition flags
  - MOV <Rd>, <Rm>
- Assembler translates pseudoinstructions into equivalent instructions (shifts, rotates)
  - Copy data from one register to another and update condition flags
    - MOVS <Rd>, <Rm>
  - Copy immediate literal value (0-255) into register and update condition flags
    - MOVS <Rd>, #<imm8>

| MOV instruction                        | Canonical form                     |
|----------------------------------------|------------------------------------|
| MOVS <rd>,<rm>,ASR #<n></n></rm></rd>  | ASRS <rd>,<rm>,#<n></n></rm></rd>  |
| MOVS <rd>,<rm>,LSL #<n></n></rm></rd>  | LSLS <rd>,<rm>,#<n></n></rm></rd>  |
| MOVS <rd>,<rm>,LSR #<n></n></rm></rd>  | LSRS <rd>,<rm>,#<n></n></rm></rd>  |
| MOVS <rd>,<rm>,ASR <rs></rs></rm></rd> | ASRS <rd>,<rm>,<rs></rs></rm></rd> |
| MOVS <rd>,<rm>,LSL <rs></rs></rm></rd> | LSLS <rd>,<rm>,<rs></rs></rm></rd> |
| MOVS <rd>,<rm>,LSR <rs></rs></rm></rd> | LSRS <rd>,<rm>,<rs></rs></rm></rd> |
| MOVS <rd>,<rm>,ROR <rs></rs></rm></rd> | RORS <rd>,<rm>,<rs></rs></rm></rd> |

# **Stack Operations**

- Push some or all of registers (R0-R7, LR) to stack
  - PUSH {<registers>}
  - Decrements SP by 4 bytes for each register saved
  - Pushing LR saves return address
  - PUSH {r1, r2, LR}
  - Always pushes registers in same order
- Pop some or all of registers (R0-R7, PC) from stack
  - POP {<registers>}
  - Increments SP by 4 bytes for each register restored
  - If PC is popped, then execution will branch to new PC value after this POP instruction (e.g. return address)
  - POP {r5, r6, r7}
  - Always pops registers in same order (opposite of pushing)

# **Add Instructions**

- Add registers, update condition flags
  - ADDS <Rd>,<Rn>,<Rm>
- Add registers and carry bit, update condition flags
  - ADCS <Rdn>,<Rm>
- Add registers
  - ADD <Rdn>,<Rm>
- Add immediate value to register
  - ADDS <Rd>,<Rn>,#<imm3>
  - ADDS <Rdn>,#<imm8>

# Add Instructions with Stack Pointer

- Add SP and immediate value
  - ADD <Rd>, SP, #<imm8>
  - ADD SP, SP, #<imm7>
- Add SP and register
  - ADD <Rdm>, SP, <Rdm>
  - ADD SP, <Rm>

# Address to Register Pseudo-Instruction

- Add immediate value to PC, write result in register
  - ADR <Rd>,<label>
- How is this used?
  - Enables storage of constant data near program counter
  - First, load register R2 with address of const\_data
  - ADR R2, const\_data
  - Second, load const\_data into R2
  - LDR R2, [R2]
- Value must be close to current PC value

## Subtract

- Subtract immediate from register, update condition flags
  - SUBS <Rd>, <Rn>, #<imm3>
  - SUBS <Rdn>,#<imm8>
- Subtract registers, update condition flags
  - SUBS <Rd>, <Rn>, <Rm>
- Subtract registers with carry, update condition flags
  - SBCS <Rdn>, <Rm>
- Subtract immediate from SP
  - SUB SP, SP, #<imm7>

# **Multiply**

- Multiply source registers, save lower word of result in destination register, update condition flags
  - MULS <Rdm>, <Rn>, <Rdm>
  - <Rdm> = <Rdm> \* <Rn>
- Signed multiply
- Note:
  - 32-bit \* 32-bit = 64-bit
  - Upper word of result is truncated

# Logical Operations

- All of these instructions update the condition flags
- Bitwise AND registers
  - ANDS <Rdn>,<Rm>
- Bitwise OR registers
  - ORRS <Rdn>,<Rm>
- Bitwise Exclusive OR registers
  - EORS <Rdn>,<Rm>
- Bitwise AND register and complement of second register
  - BICS <Rdn>,<Rm>
- Move inverse of register value to destination
  - MVNS <Rd>,<Rm>
- Bitwise AND two registers, discard result
  - TST <Rn>, <Rm>

## Compare

- Compare subtracts second value from first, updates condition flags, discards result
  - CMP <Rn>,#<imm8>
  - CMP <Rn>,<Rm>
- Compare negative adds two values, updates condition flags, discards result
  - CMN <Rn>,<Rm>

# Shift and Rotate

- Common features
  - All of these instructions update APSR condition flags
  - Shift/rotate amount (in number of bits) specified by last operand
- Logical shift left shifts in zeroes on right
  - LSLS <Rd>,<Rm>,#<imm5>
  - LSLS <Rdn>,<Rm>
- Logical shift right shifts in zeroes on left
  - LSRS <Rd>,<Rm>,#<imm5>
  - LSRS <Rdn>,<Rm>
- Arithmetic shift right shifts in copies of sign bit on left (to maintain arithmetic sign)
  - ASRS <Rd>,<Rm>,#<imm5>
- Rotate right
  - RORS <Rdn>,<Rm>

# **Reversing Bytes**

- REV reverse all bytes in word
  REV <Rd>,<Rm>
- REVI6 reverse bytes in both half-words
   REVI6 <Rd>,<Rm>
- REVSH reverse bytes in low half-word (signed) and sign-extend
   REVSH <Rd>,<Rm>







# **Changing Program Flow - Branches**

# Unconditional Branches

- B <label>
- Target address must be within 2 KB of branch instruction (-2048 B to +2046 B)
- Conditional Branches
  - B<cond> <label>
  - <cond> is condition see next page
  - B<cond> target address must be within of branch instruction
  - B target address must be within 256 B of branch instruction (-256 B to +254 B)

# **Condition Codes**

- Append to branch instruction
   (B) to make a conditional branch
- Full ARM instructions (not Thumb or Thumb-2) support conditional execution of arbitrary instructions
- Note: Carry bit = not-borrow for compares and subtractions

| Suffix   | Flags                                 | Meaning                                                  |
|----------|---------------------------------------|----------------------------------------------------------|
| EQ       | Z = 1                                 | Equal, last flag setting result was zero.                |
| NE       | Z = 0                                 | Not equal, last flag setting result was non-zero.        |
| CS or HS | C = 1                                 | Higher or same, unsigned.                                |
| CC or LO | C = 0                                 | Lower, unsigned.                                         |
| MI       | N = 1                                 | Negative.                                                |
| PL       | N = 0                                 | Positive or zero.                                        |
| VS       | V = 1                                 | Overflow.                                                |
| VC       | $\mathbf{V} = 0$                      | No overflow.                                             |
| HI       | C = 1 and $Z = 0$                     | Higher, unsigned.                                        |
| LS       | C = 0 or $Z = 1$                      | Lower or same, unsigned.                                 |
| GE       | N = V                                 | Greater than or equal, signed.                           |
| LT       | N != V                                | Less than, signed.                                       |
| GT       | Z = 0 and $N = V$                     | Greater than, signed.                                    |
| LE       | $\overline{Z} = 1 \text{ or } N != V$ | Less than or equal, signed.                              |
| AL       | Can have any value                    | Always. This is the default when no suffix is specified. |

# **Changing Program Flow - Subroutines**

# Call

- BL <label> branch with link
  - Call subroutine at <label>
  - PC-relative, range limited to PC+/-I6MB
  - Save return address in LR
- BLX <Rd> branch with link and exchange
  - Call subroutine at address in register Rd (exchange Rd with PC)
  - Supports full 4GB address range
  - LSB of target address must be set to 1 to ensure continued execution in Thumb state
  - Save return address in LR

# Return

- BX <Rd> branch and exchange
  - Branch to address specified by <Rd>
    - LSB of target address must be set to 1 to ensure continued execution in Thumb state
    - Supports full 4 GB address space
  - BX LR Return from subroutine
- POP {PC}

# **Special Register Instructions**

- Move to Register from Special Register
  - MSR <Rd>, <spec\_reg>
- Move to Special Register from Register
  - MRS <spec\_reg>, <Rd>
- Change Processor State Modify PRIMASK register
  - CPSIE Interrupt enable
  - CPSID Interrupt disable

| Special register | Contents                                                                                 |
|------------------|------------------------------------------------------------------------------------------|
| APSR             | The flags from previous instructions.                                                    |
| IAPSR            | A composite of IPSR and APSR.                                                            |
| EAPSR            | A composite of EPSR and APSR.                                                            |
| XPSR             | A composite of all three PSR registers.                                                  |
| IPSR             | The Interrupt status register.                                                           |
| EPSR             | The execution status register.b                                                          |
| IEPSR            | A composite of IPSR and EPSR.                                                            |
| MSP              | The Main Stack pointer.                                                                  |
| PSP              | The Process Stack pointer.                                                               |
| PRIMASK          | Register to mask out configurable exceptions.c                                           |
| CONTROL          | The CONTROL register, see <i>The special-purpose</i><br>CONTROL register on page B1-215. |
|                  |                                                                                          |

## Other

- No Operation does nothing!
  - NOP
- Breakpoint causes hard fault or debug halt used to implement software breakpoints
  - BKPT #<imm8>
- Wait for interrupt Pause program, enter low-power state until a WFI wake-up event occurs (e.g. an interrupt)
  - WFI
- Supervisor call generates SVC exception (#11), same as software interrupt
  - SVC #<imm>