gil

Get In Line - A fast single-producer single-consumer queue with sync and async support.

Inspired by Facebook's folly's ProducerConsumerQueue.

Features

Lock-free: Uses atomic operations for synchronization
Single-producer, single-consumer: Optimized for this specific use case
Thread-safe: Producer and consumer can run on different threads
Blocking and non-blocking operations: Both sync and async APIs
Batch operations: Send and receive multiple items efficiently
High performance (probably): Competitive with Facebook's folly implementation

Safety

The code was verified using loom and miri.

Usage

The producer and consumer can run on different threads, but there can only be one producer and only one consumer. The producer (or consumer) can be moved between threads, but cannot be shared between threads. The queue has a fixed capacity that must be specified when creating the channel.

Consumer blocks until there is a value on the queue, or use Receiver<T>::try_recv for non-blocking version. Similarly, producer blocks until there is a free slot on the queue, or use Sender<T>::try_send for non-blocking version.

Example (sync version)

use std::thread;
use gil::channel;

fn main() {
    const COUNT: u32 = 100_000_000;

    let (mut tx, mut rx) = channel::<u32>(COUNT as usize);

    let handle = thread::spawn(move || {
        for i in 0..COUNT {
            // block until send completes
            let _ = tx.send(i);
        }
    });

    let _ = handle.join();

    for i in 0..COUNT {
        // block until recv completes
        let r = rx.recv();
        assert_eq!(r, i);
    }
}

Example (async version)

use gil::channel;

#[tokio::main]
async fn main() {
    const COUNT: u32 = 100_000_000;

    let (mut tx, mut rx) = channel::<u32>(COUNT as usize);

    let handle = tokio::spawn(async move {
        for i in 0..COUNT {
            // block until send completes
            let _ = tx.send_async(i).await;
        }
    });

    let _ = handle.await;

    for i in 0..COUNT {
        // block until recv completes
        let r = rx.recv_async().await;
        assert_eq!(r, i);
    }
}

Performance

Probably good enough. On my testing on M3 Mac, I can get 10gb/s throughput when using 129 byte objects and batching sends and receives. Aim to improve more later.

Possible improvements

more docs
use zero-copy when sending to push data directly on L3 cache. (DC CVAC on Apple Silicon, CLDEMOTE on intel x86).
investigate having a queue only for uints + arena allocator for for both more throughput and lower latency at the same time.

License

MIT License - see LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
benches		benches
examples		examples
src		src
.gitignore		.gitignore
.lazy.lua		.lazy.lua
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

gil

Features

Safety

Usage

Example (sync version)

Example (async version)

Performance

Possible improvements

License

About

Uh oh!

Releases

Packages

Languages

License

abhikjain360/spsc

Folders and files

Latest commit

History

Repository files navigation

gil

Features

Safety

Usage

Example (sync version)

Example (async version)

Performance

Possible improvements

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages