Before this commit, fixed point multiplication was implemented using an assembly routine in a separate translation unit. This commit implements this routine directly using inline assembly. By doing so, these operations can be inlined when called from ARM code. Fixed point division is implemented as well, along with various documentation and style improvements.