Overview

A collection of functions that approximate math operations using floating-point hacks. These exploit a floating-point's binary representation in combination with logarithmic identities to avoid costly operations.

See Deriving formulas.pdf for an explanation for why these works.

Note: Not all approximations will be much faster than other math libraries.
Most are at least a bit faster than Paul Mineiro's (which use very similar approximations) when I tested it on my machine (Ryzen 5 5500). May be more useful in embedded or RISC environments.

See: Performance and accuracy

Based on the famous “Fast inverse square root” algorithm.

Credit

I used Paul Mineiro's float approximation library in the fastPowAltAlt function as well as for comparison. All his work is placed in the fastPaulMineiro.h file.

Additionally, I also included Ger Hobbelt's fork there as well because his exp2 approximation is a bit faster.

Included operations

Additional

I included some math operations from SSE2 and the x87 FPU (using GCC inline asm), mostly for comparison.

Additionally, there are a handful of functions with vectorized or double variants. These are mostly included as an example. All functions can be easily ported like this as they don't use lookup tables or loops/branches.

Portability

Floating-point format

All method used only work for IEEE 754 binary types, or any format substantially similar. These approximation assumes a normalized input/output so they will become less accurate the smaller the input when in the sub-normal range.

Operations that are not defined for negative inputs will not return a NaN, so care should be taken to ensure valid inputs.

Other IEEE floats

One of the goals on this project was ensuring portability across differently sized floating-point types. Thus, I used the header file intsizedfloat.h so that in the rare case that the compiler uses a 64-bit float, the integer used to hold the float when type punning will match. For the vast majority of compilers, a uint32_t will work without issue.

Other floating-point types (double, quadruple, and half) should work so long as the macro constants are replaced with their associated values. fastInvSqrRoot_DBL shows an example for a C’s double.

The 80-bit intermediate float in the x87 won't work directly as there is no easy method of type punning.

Note on type punning

In order to manipulate its binary representation, we reinterpret a float as an unsigned integer using a union. According to the C standard, this is defined behaviour so long as both types are the same size.

However, in C++ this is undefined behaviour. Instead, use C++20’s std::bit_cast or std::memcpy. On modern C/C++ compilers, memcpy() is often implemented as a compiler intrinsic so a 32 or 64-bit load will not produce calls to a function.

If you port this to another language, be careful to ensure your method of type punning is well defined.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
src		src
.gitattributes		.gitattributes
Deriving formulas.pdf		Deriving formulas.pdf
FastMathOperations.c		FastMathOperations.c
LICENSE		LICENSE
OptimizeParams.c		OptimizeParams.c
README.md		README.md
Timings and accuracy.xlsx		Timings and accuracy.xlsx
findMaxError.c		findMaxError.c
perf_and_accuracy.md		perf_and_accuracy.md

Operation	Fast approximation	Standard equivalent
$\log_2{}$	`fastLog2`, `fastLog2Bits`, `fastLog2Alt`	`log2f`
$\ln{}$	`fastLog`	`logf`
$\sqrt{n}$	`fastSqrRoot`	`sqrtf`
$\frac{1}{\sqrt{n}}$	`fastInvSqrRoot`, `fastInvSqrRoot_DBL`	`1.0f / sqrtf`
General power	`fastPow`, `fastPowAlt`, `fastPowAltAlt`	`powf`
$2^n$	`fastExp2`, `fastExp2_alt`, `fastExp2_bits`	`exp2f`
$e^n$	`fastExp`	`expf`
1/n^th root	`fastInvRootApprox`	`powf(n, 1.0f / root)`
$\frac{1}{n}$	`fastReciprocal`, `reciprocalSSE`	`1.0f / n`
$\ln{(\Gamma{(n)})}$	`fastLogGamma`	`lgammaf`
Multiplication	`fastMultiply`	`a * b`
Division	`fastDivision`	`a / b`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Overview

Credit

Included operations

Additional

Portability

Floating-point format

Other IEEE floats

Note on type punning

About

Uh oh!

Releases

Packages

Languages

License

WalterKruger/Fast-floating-point-math-operations

Folders and files

Latest commit

History

Repository files navigation

Overview

Credit

Included operations

Additional

Portability

Floating-point format

Other IEEE floats

Note on type punning

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages