This is a tracking issue for expanding upon the machine code analysis part of the perf guide. ## To do - [ ] how to emit assembly from Rust code (e.g. `cargo-asm`) - [ ] how to emit IACA / `llvm-mca` markers with inline assembly - [ ] give an example on how to intepret results, in the context of one of the example benchmarks ## References - [IACA homepage](https://software.intel.com/en-us/articles/intel-architecture-code-analyzer) - "What is IACA and how to use it?" on [StackOverflow](https://stackoverflow.com/questions/26021337) - TheIronBorn's usage of [IACA with `movmsk.c`](https://gist.github.com/TheIronBorn/faf896e7b86bf2353e0b915ed5c596e7) - LLVM [MCA docs](https://llvm.org/docs/CommandGuide/llvm-mca.html)