Chapter 18: Assembler Directives
Let's go over Assembler Directives.
Directives are various instructions (these are **NOT** PowerPC instructions) that tell the Assembler about any symbols, macros, statements, etc that are present in the Source File and how to apply these to help the Assembling process.
When your Source File becomes quite large/complex, it will need Directives. You may have a chunk of code that is present multiple times in your source. Why write it out multiple times when you can just write it out once and have it applied to multiple locations? Not only that, are you running into situations of having to memorize a specific numerical value for something that is commonly known by name? Directives allow you to use Names to represent any numerical value.
Setting Symbols:
You can direct specific names to any numerical value you want. This is done via the SET directive. Anything that uses the SET directive is known as a Symbol.
Format~
.set name, expression
name = The name (label) you want to give the symbol
expression = Value/Number, but this can also be a name (more on names within names soon)
Let's say you want to set the name True for the numerical value of 1. You would write the following line anywhere in your source file~
.set True, 1
You can write it out in Hexadecimal as well
.set True, 0x1
With the name True set to the value of 1, you can now write True in your source wherever you need to value of 1 applied. Like this....
.set True, 1 #This line only needs to present once in your source lwz r3, 0 (r30) cmpwi r3, True #Use 'True' in place of numerical value 1 beq- end addi r4, r4, 4 stw r10, 0x8 (r3) end:
Some more Symbol examples~
.set Offset, 0x1340
.set Memory_Address, 0x8045ECC4
.set Error, -1
---
Let's go over more about Symbols for Memory Addresses. Having Symbols set for commonly used Memory Addresses can come in handy. Like so...
.set Memory_Address, 0x8045ECC4
Therefore, if you need to set Memory_Address into a register (let's say r12), you can do so something like this...
.set Memory_Address, 0x8045ECC4 lis r12, Memory_Address@h ori r12, r12, Memory_Address@l
Let's say we use that same address value but instead we have it called Loading_Spot, and we need to load a word value from the address. We will use r5 for the example. You would write it out like so..
.set Loading_Spot, 0x8045ECC4 lis r5, Loading_Spot@ha lwz r5, Loading_Spot@l (r5)
What is "sign-extended"?
For halfwords this means the value of bit 16 is copied to all bits of bits 0 thru 15.
Examples:
0x71A0 sign extended is still 0x71A0
0xCCCF sign extended is 0xFFFFCCCF
In other terms, any halfword value greater than 0x8000 will be prepended with 0xFFFF.
@ha stands for High Algebraic. You need to use @ha if you are ever storing/loading to/from an Address. This is just in case the lower 16 bits of your store/load Address exceed 0x7FFF. Even if the use of @ha isn't necessary for a particular store/load, you should always be applying it as a good force of habit.
Using Math, Binary, and Logical Operations within Symbols:
Instead of writing just plain jane values for your Symbols. You can incorporate basic calculations. Like this..
.set add_values, 0x1000 + 0xB00
.set subtract_values, 100 - 50
.set divide_values, 33 / 11
.set multiply_values, 0xA0 * 0x14
You can beyond basic math and utilize basic binary logical operations
.set OR_values, 0x1008 | 0x400
.set AND_values, 0x800 & 0x000
.set XOR_values, 0xF8001CFD ^ 0x01000AAB
You can bit shifting like this..
.set Shifted_left_value, 0x8 << 23
.set Shifted_right_value, 0xCCC >> 3
Names within Names:
You can use names within names for Symbols. Like this...
Example~
.set bowser, 0x1E00
.set auto_drift, 1
.set character_config, bowser | auto_drift
Another example~
.set table_address, 0x81500078
.set index_one, 4
.set index_two, index_one*2
.set index_three, index_two*2
.set table_special_data_address, table_address + index_three
As you can see, you can establish symbols that can be used directly from calculations of other symbols.
Psuedo-Ops:
There will be times when you will need to write out numerical values or text (string values) in a Source File without the use of PPC Instructions. These values will later be referenced by load instructions in your Source File. We call this "Pseudo-Ops".
Here's a list of all Pseudo-Ops~
Examples:
.long 0x80001500
.float 3.5
.asciz "/home/user/example"
There may be times where you will need your Psuedo-Op data to be aligned. All this means is that a certain amount of zero bytes are appended to the end so the block of data is divisible by a desired amount.
This is done via ".align X".
.align X Guide:
This would be needed if you have some data that comes before actual PPC instructions. All PPC instructions have to reside at an address that is divisible by 4. Example...
.asciz "Hi there!" .align 2 some_label: add r3, r4, r5
Macros:
While symbols can be handy, they are very limited. If you want your PowerPC instruction(s) to be tied to a particular Name, you will need to use what are called Macros.
Format of a Macro~
.macro Name Arg**
contents of Macro located HERE
.endm
**Arg is optional and will be discussed soon.
Let's say we have the following instructions...
lis r12, 0x8000 lwz r12, 0x1500 (r12) lwz r12, 0 (r12)
...And these instructions occur multiple times in your source. Well instead of 'manually' writing these batch of instructions for every single occurrence, you can write out the Macro for it just once, with a name tied to it, and then literally just write out the name.
Like this..
.macro Load_Pointer lis r12, 0x8000 lwz r12, 0x1500 (r12) lwz r12, 0 (r12) add r3, r4, r5 Load_Pointer #Insert batch of instructions (aka Macro) here or r3, r3, r0
IMPORTANT: You must have the macro established (via .macro) before you use the macro itself in your Source. This does *not* apply to symbols unless the Macro contains a symbol(s). If so, then the establishment of that symbol *must* be above the establishment of the Macro.
Macros w/ Args:
Now we will discuss the optional Arg (Argument) in Macros. With the earlier macro example, you were 'forced' to use r12. If you wanted to use a different register and keep the original macro as well, you would have to write out a whole new 2nd macro. Well, with Arg options, you wouldn't need to resort to that.
A Macro can be given arg(s) with custom names.
Look at the following example..
.macro Load_Pointer register lis \register, 0x8000 lwz \register, 0x1500 (\register) lwz \register, 0 (\register) .endm
As you can see, we have have the Macro equipped with 'register' as an Arg. The \register occurrences within the macro is how to use the arg.
When you use the Macro in your Source, you will need to supply the Arg. We named our Arg as register so we will know to supply a register.
add r3, r4, r5 Load_Pointer r12 #Use r12 as Arg for Macro or r3, r3, r0
If we wanted to use something other than r12, it's super simple to do...
add r3, r4, r5 Load_Pointer r9 #Use r9 as Arg for Macro or r3, r3, r0
Here's an example Macro using two args
.macro Load_Pointer register, address #NOTICE the comma separating the two Args. This is recommended but not needed lis \register, \address@ha lwz \register, \address@l (\register) lwz \register, 0 (\register) .endm add r3, r4, r5 Load_Pointer r12, 0x80001500 #NOTICE the comma separating r12 & 0x80001500 or r3, r3, r0
...and that's it for Assembler Directives. There are quite a bit more Assembler Directives. But this gives you enough insight on the most commonly used ones.