`crc-fast`

World's fastest generic CRC calculator for all known CRC-16, CRC-32, and CRC-64 variants, as well as bring-your-own custom parameters, using SIMD intrinsics, which can exceed 100GiB/s on modern systems.

Supports acceleration on aarch64, x86_64, and x86 architectures, plus has a safe non-accelerated table-based software fallback for others.

The crc crate is ~0.5GiB/s by default, so this is up to >220X faster.

This is unique, not just because of the performance, but also because I couldn't find a single generic SIMD-accelerated implementation (in any language) which worked for all known variants, using the Rocksoft model, especially the "non-reflected" variants.

So I wrote one. :)

Other languages

Supplies a C/C++ compatible library for use with other non-Rust languages.

Implementations

AWS SDK for Rust via the aws-smithy-checksums crate.
crc-fast-php-ext PHP extension using this library.

Changes

See CHANGELOG.

Build & Install

Library

cargo build --release will obviously build the Rust library, including the C/C++ compatible dynamic and static libraries.

CLI tools

There are some command-line tools available:

checksum calculates CRC checksums from the supplied string or file
get-custom-params generates the custom CRC parameters for the supplied Rocksoft model values
arch-check checks the current architecture's hardware acceleration features (primarily for debugging)

To build them, enable the cli feature: cargo build --features cli --release.

Everything

To build the libraries and the CLI tools, use the --all-features flag: cargo build --all-features --release.

A very basic Makefile is supplied which supports make install to install the libraries, header file, and CLI binaries to the local system. Specifying the DESTDIR environment variable will allow you to customize the install location.

DESTDIR=/my/custom/path make install

Features

The library supports various feature flags for different environments:

Default Features

std - Standard library support, includes alloc
ffi - C/C++ FFI bindings for shared library (will become optional in v2.0)
panic-handler - Provides panic handler for no_std environments (disable when building binaries)

Optional Features

alloc - Heap allocation support (enables Digest trait, custom CRC params, checksum combining)
cache - Caches generated constants for custom CRC parameters (requires alloc)
cli - Enables command-line tools (checksum, arch-check, get-custom-params)

Building for `no_std`

For embedded targets without standard library:

# Minimal no_std (core CRC only, no heap)
cargo build --target thumbv7em-none-eabihf --no-default-features --lib

# With heap allocation (enables Digest, custom params)
cargo build --target thumbv7em-none-eabihf --no-default-features --features alloc --lib

# With caching (requires alloc)
cargo build --target thumbv7em-none-eabihf --no-default-features --features cache --lib

Tested on ARM Cortex-M (thumbv7em-none-eabihf, thumbv8m.main-none-eabihf) and RISC-V ( riscv32imac-unknown-none-elf).

Building for `WASM`

For WebAssembly targets:

# Minimal WASM
cargo build --target wasm32-unknown-unknown --no-default-features --lib

# With heap allocation (typical use case)
cargo build --target wasm32-unknown-unknown --no-default-features --features alloc --lib

# Using wasm-pack for browser
wasm-pack build --target web --no-default-features --features alloc

Tested on wasm32-unknown-unknown, wasm32-wasip1, and wasm32-wasip2 targets.

Usage

Add crc-fast = "1" to your Cargo.toml dependencies, which will enable every available optimization for the stable toolchain.

Fast helper functions

For the most common and popular CRC variants, there are specialized one-shot functions to make adoption easier and performance faster, particularly for smaller input sizes, since it reduces some of the overhead of the generic checksum path.

CRC-32/ISCSI

Also commonly known as crc32c in many, but not all, implementations.

use crc_fast::crc32_iscsi;

let checksum = crc32_iscsi(b"123456789");

assert_eq!(checksum, 0xe3069283);

CRC-32/ISO-HDLC

Also commonly known as crc32 in many, but not all, implementations.

use crc_fast::crc32_iso_hdlc;

let checksum = crc32_iso_hdlc(b"123456789");

assert_eq!(checksum, 0xcbf43926);

CRC-64/NVME

use crc_fast::crc64_nvme;

let checksum = crc64_nvme(b"123456789");

assert_eq!(checksum, 0xae8b14860a799888);

Digest

Implements the digest::DynDigest trait for easier integration with existing Rust code.

Creates a Digest which can be updated over time, for stream processing, intermittent workloads, etc, enabling finalizing the checksum once processing is complete.

use crc_fast::{Digest, CrcAlgorithm::Crc32IsoHdlc};

let mut digest = Digest::new(Crc32IsoHdlc);
digest.update(b"1234");
digest.update(b"56789");
let checksum = digest.finalize();

assert_eq!(checksum, 0xcbf43926);

Digest Write

Implements the std::io::Write trait for easier integration with existing Rust code.

use std::env;
use std::fs::File;
use crc_fast::{Digest, CrcAlgorithm::Crc32IsoHdlc};

// for example/test purposes only, use your own file path
let binding = env::current_dir().expect("missing working dir").join("crc-check.txt");
let file_on_disk = binding.to_str().unwrap();

// actual usage
let mut digest = Digest::new(Crc32IsoHdlc);
let mut file = File::open(file_on_disk).unwrap();
std::io::copy( & mut file, & mut digest).unwrap();
let checksum = digest.finalize();

assert_eq!(checksum, 0xcbf43926);

checksum

Checksums a string.

 use crc_fast::{checksum, CrcAlgorithm::Crc32IsoHdlc};

let checksum = checksum(Crc32IsoHdlc, b"123456789");

assert_eq!(checksum, 0xcbf43926);

checksum_combine

Combines checksums from two different sources, which can be useful for distributed or multithreaded workloads, etc.

 use crc_fast::{checksum, checksum_combine, CrcAlgorithm::Crc32IsoHdlc};

let checksum_1 = checksum(Crc32IsoHdlc, b"1234");
let checksum_2 = checksum(Crc32IsoHdlc, b"56789");
let checksum = checksum_combine(Crc32IsoHdlc, checksum_1, checksum_2, 5);

assert_eq!(checksum, 0xcbf43926);

checksum_file

Checksums a file, which will chunk through the file optimally, limiting RAM usage and maximizing throughput. Chunk size is optional.

 use crc_fast::{checksum_file, CrcAlgorithm::Crc32IsoHdlc};

// for example/test purposes only, use your own file path
let binding = env::current_dir().expect("missing working dir").join("crc-check.txt");
let file_on_disk = binding.to_str().unwrap();

let checksum = checksum_file(Crc32IsoHdlc, file_on_disk, None);

assert_eq!(checksum.unwrap(), 0xcbf43926);

Custom CRC Parameters

For cases where you need to use CRC variants not included in the predefined algorithms, you can define custom CRC parameters and use the *_with_params functions.

Digest with custom parameters

Creates a Digest with custom CRC parameters for stream processing.

use crc_fast::{Digest, CrcParams};

// Define custom CRC-32 parameters (equivalent to CRC-32/ISO-HDLC)
let custom_params = CrcParams::new(
    "CRC-32/CUSTOM",
    32,
    0x04c11db7,
    0xffffffff,
    true,
    0xffffffff,
    0xcbf43926,
);

let mut digest = Digest::new_with_params(custom_params);
digest.update(b"123456789");
let checksum = digest.finalize();

assert_eq!(checksum, 0xcbf43926);

checksum_with_params

Checksums data using custom CRC parameters.

use crc_fast::{checksum_with_params, CrcParams};

// Define custom CRC-32 parameters (equivalent to CRC-32/ISO-HDLC)
let custom_params = CrcParams::new(
    "CRC-32/CUSTOM",
    32,
    0x04c11db7,
    0xffffffff,
    true,
    0xffffffff,
    0xcbf43926,
);

let checksum = checksum_with_params(custom_params, b"123456789");

assert_eq!(checksum, 0xcbf43926);

checksum_combine_with_params

Combines checksums from two different sources using custom CRC parameters.

use crc_fast::{checksum_with_params, checksum_combine_with_params, CrcParams};

// Define custom CRC-32 parameters (equivalent to CRC-32/ISO-HDLC)
let custom_params = CrcParams::new(
    "CRC-32/CUSTOM",
    32,
    0x04c11db7,
    0xffffffff,
    true,
    0xffffffff,
    0xcbf43926,
);

let checksum_1 = checksum_with_params(custom_params, b"1234");
let checksum_2 = checksum_with_params(custom_params, b"56789");
let checksum = checksum_combine_with_params(custom_params, checksum_1, checksum_2, 5);

assert_eq!(checksum, 0xcbf43926);

checksum_file_with_params

Checksums a file using custom CRC parameters, chunking through the file optimally.

use std::env;
use crc_fast::{checksum_file_with_params, CrcParams};

// for example/test purposes only, use your own file path
let binding = env::current_dir().expect("missing working dir").join("crc-check.txt");
let file_on_disk = binding.to_str().unwrap();

// Define custom CRC-32 parameters (equivalent to CRC-32/ISO-HDLC)
let custom_params = CrcParams::new(
    "CRC-32/CUSTOM",
    32,
    0x04c11db7,
    0xffffffff,
    true,
    0xffffffff,
    0xcbf43926,
);

let checksum = checksum_file_with_params(custom_params, file_on_disk, None);

assert_eq!(checksum.unwrap(), 0xcbf43926);

C/C++ compatible library

cargo build will produce a shared library target (.so on Linux, .dll on Windows, .dylib on macOS, etc) and an auto-generated libcrc_fast.h header file for use in non-Rust projects, such as through FFI. It will also produce a static library target (.a on Linux and macOS, .lib on Windows, etc) for projects which prefer statically linking.

There is a crc-fast PHP extension using it, for example.

Background

This implementation is based on Intel's Fast CRC Computation for Generic Polynomials Using PCLMULQDQ Instruction white paper, though it folds 8-at-a-time, like other modern implementations, rather than the 4-at-a-time as in Intel's paper.

This library works on aarch64, x86_64, and x86 architectures, and is hardware-accelerated and optimized for each architecture.

Inspired by crc32fast, crc64fast, and crc64fast-nvme, each of which only accelerates a single, different CRC variant, and all of them were "reflected" variants.

In contrast, this library accelerates every known variant (and should accelerate any future variants without changes), including all the "non-reflected" variants.

Important CRC variants

While there are many variants, three stand out as being the most important and widely used (all of which are "reflected"):

CRC-32/ISCSI

Many, but not all, implementations simply call this crc32c and it's probably the 2nd most popular and widely used, after CRC-32/ISO-HDLC. It's used in iSCSI, ext4, btrfs, etc.

Both x86_64 and aarch64 have native hardware support for this CRC variant, so we can use fusion in many cases to accelerate it further by fusing SIMD CLMUL instructions with the native CRC instructions.

CRC-32/ISO-HDLC

Many, but not all, implementations simply call this crc32 and it may be the most popular and widely used. It's used in Ethernet, PKZIP, xz, etc.

Only aarch64 has native hardware support for this CRC variant, so we can use fusion on that platform, but not x86_64.

CRC-64/NVME

CRC-64/NVME comes from the NVM Express® NVM Command Set Specification (Revision 1.0d, December 2023), is AWS S3's recommended checksum option (as CRC64-NVME), and has also been implemented in the Linux kernel (where it's been called CRC-64/Rocksoft in the past).

Note that the Check value in the NVMe spec uses incorrect endianness (see Section 5.2.1.3.4, Figure 120, page 83) but all known public & private implementations agree on the correct value, which this library produces.

Acceleration targets

This library has baseline support for accelerating all known CRC-16, CRC-32, and CRC-64 variants on aarch64, x86_64, and x86 internally in pure Rust.

It uses the best available acceleration method for the detected CPU features at runtime, including:

aarch64:
- neon-pmull-sha3 (preferred, if available)
- neon-pmull
x86_64 and x86:
- avx512-vpclmulqdq (preferred, if available)
- avx512-pclmulqdq
- sse-pclmulqdq

There is a safe table-based software fallback for other architectures, or if no acceleration features are detected.

Checking your platform capabilities

There's an arch-check binary which will explain the selected target architecture.

// test it works on your system (patches welcome!)
cargo test

// examine the chosen acceleration targets
cargo run arch-check

// build for release
cargo build --release

Minimum Supported Rust Version (MSRV)

This crate targets a stable-2 policy (ie, the latest stable Rust version minus 2 minor versions) as the Minimum Supported Rust Version (MSRV).

Bumping the rust-version will be considered a MINOR version bump.

We'll try to support even older versions when possible, but given the high-performance use of SIMD intrinsics and other modern Rust features, this is merely a stated goal. We'll move up to stable-2 ASAP when a sufficiently high performance improvement or necessary feature requires it.

Performance

Modern systems can exceed 100 GiB/s for calculating CRC-32/ISCSI, and nearly 90 GiB/s for CRC-32/ISO-HDLC, CRC-64/NVME, and all other reflected variants. (Forward variants are slower, due to the extra shuffle-masking, but are still extremely fast in this library).

This is a summary of the performance for the most important and popular CRC checksums.

CRC-32/ISCSI (reflected)

AKA crc32c in many, but not all, implementations.

Arch	Brand	CPU	System	Target	1KiB (GiB/s)	1MiB (GiB/s)
x86_64	Intel	Sapphire Rapids	EC2 c7i.metal-24xl	avx512-vpclmulqdq	~61	~111
x86_64	AMD	Genoa	EC2 c7a.metal-48xl	avx512-vpclmulqdq	~26	~54
aarch64	AWS	Graviton4	EC2 c8g.metal-48xl	neon-pmull-sha3	~23	~54
aarch64	AWS	Graviton2	EC2 c6g.metal	neon-pmull	~11	~17
aarch64	Apple	M3 Ultra	Mac Studio (32 core)	neon-pmull-sha3	~60	~99
aarch64	Apple	M4 Max	MacBook Pro 16" (16 core)	neon-pmull-sha3	~56	~94

CRC-32/ISO-HDLC (reflected)

AKA crc32 in many, but not all, implementations.

Arch	Brand	CPU	System	Target	1KiB (GiB/s)	1MiB (GiB/s)
x86_64	Intel	Sapphire Rapids	EC2 c7i.metal-248xl	avx512-vpclmulqdq	~28	~88
x86_64	AMD	Genoa	EC2 c7a.metal-48xl	avx512-vpclmulqdq	~21	~55
aarch64	AWS	Graviton4	EC2 c8g.metal-48xl	neon-pmull-sha3	~23	~54
aarch64	AWS	Graviton2	EC2 c6g.metal	neon-pmull	~11	~17
aarch64	Apple	M3 Ultra	Mac Studio (32 core)	neon-pmull-sha3	~48	~98
aarch64	Apple	M4 Max	MacBook Pro 16" (16 core)	neon-pmull-sha3	~56	~94

CRC-64/NVME (reflected)

AWS S3's recommended checksum option

Arch	Brand	CPU	System	Target	1KiB (GiB/s)	1MiB (GiB/s)
x86_64	Intel	Sapphire Rapids	EC2 c7i.metal-24xl	avx512-vpclmulqdq	~28	~88
x86_64	AMD	Genoa	EC2 c7a.metal-48xl	avx512-vpclmulqdq	~22	~55
aarch64	AWS	Graviton4	EC2 c8g.metal-48xl	neon-pmull-sha3	~28	~41
aarch64	AWS	Graviton2	EC2 c6g.metal	neon-pmull	~11	~16
aarch64	Apple	M3 Ultra	Mac Studio (32 core)	neon-pmull-sha3	~58	~72
aarch64	Apple	M4 Max	MacBook Pro 16" (16 core)	neon-pmull-sha3	~52	~72

CRC-32/BZIP2 (forward)

Arch	Brand	CPU	System	Target	1KiB (GiB/s)	1MiB (GiB/s)
x86_64	Intel	Sapphire Rapids	EC2 c7i.metal-24xl	avx512-vpclmulqdq	~20	~56
x86_64	AMD	Genoa	EC2 c7a.metal-48xl	avx512-vpclmulqdq	~14	~43
aarch64	AWS	Graviton4	EC2 c8g.metal-48xl	neon-pmull-eor3	~18	~40
aarch64	AWS	Graviton2	EC2 c6g.metal	neon-pmull	~9	~14
aarch64	Apple	M3 Ultra	Mac Studio (32 core)	neon-pmull-eor3	~41	~59
aarch64	Apple	M4 Max	MacBook Pro 16" (16 core)	neon-pmull-eor3	~47	~64

CRC-64/ECMA-182 (forward)

Arch	Brand	CPU	System	Target	1KiB (GiB/s)	1MiB (GiB/s)
x86_64	Intel	Sapphire Rapids	EC2 c7i.metal-24xl	avx512-vpclmulqdq	~21	~56
x86_64	AMD	Genoa	EC2 c7a.metal-48xl	avx512-vpclmulqdq	~14	~43
aarch64	AWS	Graviton4	EC2 c8g.metal-48xl	neon-pmull-eor3	~19	~40
aarch64	AWS	Graviton2	EC2 c6g.metal	neon-pmull	~9	~14
aarch64	Apple	M3 Ultra	Mac Studio (32 core)	neon-pmull-eor3	~40	~59
aarch64	Apple	M4 Max	MacBook Pro 16" (16 core)	neon-pmull-eor3	~46	~61

Other CRC widths

There are a lot of other known CRC widths and variants, ranging from CRC-3/GSM to CRC-82/DARC, and everything in between.

Since Awesome only uses CRC-32 or CRC-64 widths in our products, this library began by supporting only those widths, including all known variants plus support for custom Rocksoft parameters.

CRC-16 has since been added, including all known variants plus support for custom parameters as well.

In theory, much of the "heavy lifting" has been done, so it should be possible to add other widths with minimal effort.

PRs welcome!

Memory Safety

Given the heavy use of hardware intrinsics, this crate uses a decent amount of unsafe code.

To help ensure memory safety and stability, this crate is validated using Miri on x86_64 as well as fuzz tested using libFuzzer over millions of iterations.

References

Catalogue of parametrised CRC algorithms
crc32-fast Original CRC-32/ISO-HDLC (crc32) implementation in Rust.
crc64-fast Original CRC-64/XZ implementation in Rust.
crc64fast-nvme Original CRC-64/NVME implementation in Rust.
Fast CRC Computation for Generic Polynomials Using PCLMULQDQ Instruction Intel's paper.
NVM Express® NVM Command Set Specification The NVMe spec, including CRC-64-NVME (with incorrect endian Check value in Section 5.2.1.3.4, Figure 120, page 83).
CRC-64/NVME The CRC-64/NVME quick definition.
A PAINLESS GUIDE TO CRC ERROR DETECTION ALGORITHMS Best description of CRC I've seen to date (and the definition of the Rocksoft model).
Linux implementation Linux implementation of CRC-64/NVME.
MASM/C++ artifacts implementation - Reference MASM/C++ implementation for generating artifacts.
Intel isa-l GH issue #88 - Additional insight into generating artifacts.
StackOverflow PCLMULQDQ CRC32 answer Insightful answer to implementation details for CRC32.
StackOverflow PCLMULQDQ CRC32 question Insightful question & answer to CRC32 implementation details.
AWS S3 announcement about CRC64-NVME support
AWS S3 docs on checking object integrity using CRC64-NVME
Vector Carry-Less Multiplication of Quadwords (VPCLMULQDQ) details
Linux kernel updates by Eric Biggers to use VPCLMULQDQ, etc
Faster CRC32-C on x86
Faster CRC32 on the Apple M1
An alternative exposition of crc32_4k_pclmulqdq
fast-crc32 - implementations of fusion for two CRC-32 variants.

License

cfc-fast is dual-licensed under

Apache 2.0 license (LICENSE-Apache or http://www.apache.org/licenses/LICENSE-2.0)
MIT license (LICENSE-MIT or https://opensource.org/licenses/MIT)

Name		Name	Last commit message	Last commit date
Latest commit History 260 Commits
.github		.github
.kiro		.kiro
benches		benches
fuzz		fuzz
src		src
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE-Apache		LICENSE-Apache
LICENSE-MIT		LICENSE-MIT
Makefile		Makefile
README.md		README.md
crc-check.txt		crc-check.txt
libcrc_fast.h		libcrc_fast.h

Folders and files

Latest commit

History

Repository files navigation

crc-fast

Other languages

Implementations

Changes

Build & Install

Library

CLI tools

Everything

Features

Default Features

Optional Features

Building for no_std

Building for WASM

Usage

Fast helper functions

CRC-32/ISCSI

CRC-32/ISO-HDLC

CRC-64/NVME

Digest

Digest Write

checksum

checksum_combine

checksum_file

Custom CRC Parameters

Digest with custom parameters

checksum_with_params

checksum_combine_with_params

checksum_file_with_params

C/C++ compatible library

Background

Important CRC variants

CRC-32/ISCSI

CRC-32/ISO-HDLC

CRC-64/NVME

Acceleration targets

Checking your platform capabilities

Minimum Supported Rust Version (MSRV)

Performance

CRC-32/ISCSI (reflected)

CRC-32/ISO-HDLC (reflected)

CRC-64/NVME (reflected)

CRC-32/BZIP2 (forward)

CRC-64/ECMA-182 (forward)

Other CRC widths

Memory Safety

References

License

About

Topics

Resources

License

Licenses found

Uh oh!

Stars

Watchers

Forks

Releases 18

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

`crc-fast`

Building for `no_std`

Building for `WASM`

Packages