Name	Name	Last commit message	Last commit date
Latest commit mwiewior fix: Fixing build (#21 ) Dec 16, 2024 19a0a7c · Dec 16, 2024 History 11 Commits
.github/workflows	.github/workflows	fix: Fixing build (#21 )	Dec 16, 2024
benchmark	benchmark	chore: Documentation update (#20 )	Dec 16, 2024
docs	docs	chore: Documentation update (#20 )	Dec 16, 2024
polars_bio	polars_bio	chore: Documentation update (#20 )	Dec 16, 2024
src	src	chore: Documentation update (#20 )	Dec 16, 2024
tests	tests	chore: Documentation update (#20 )	Dec 16, 2024
.gitignore	.gitignore	feat: Boilerplate and overlap implementation (#1 )	Dec 12, 2024
.pre-commit-config.yaml	.pre-commit-config.yaml	chore: Documentation update (#20 )	Dec 16, 2024
.readthedocs.yaml	.readthedocs.yaml	fix: Fixing build (#21 )	Dec 16, 2024
Cargo.lock	Cargo.lock	chore: Documentation update (#20 )	Dec 16, 2024
Cargo.toml	Cargo.toml	chore: Documentation update (#20 )	Dec 16, 2024
LICENSE	LICENSE	Initial commit	Nov 26, 2024
Makefile	Makefile	chore: Documentation update (#20 )	Dec 16, 2024
README.md	README.md	Nearest algorithm (#18 )	Dec 16, 2024
mkdocs.yml	mkdocs.yml	chore: Documentation update (#20 )	Dec 16, 2024
poetry.lock	poetry.lock	chore: Documentation update (#20 )	Dec 16, 2024
pyproject.toml	pyproject.toml	chore: Documentation update (#20 )	Dec 16, 2024
requirements.txt	requirements.txt	Init plugin	Nov 26, 2024
rust-toolchain.toml	rust-toolchain.toml	feat: Boilerplate and overlap implementation (#1 )	Dec 12, 2024
rustfmt.toml	rustfmt.toml	Init plugin	Nov 26, 2024

Repository files navigation

polars_bio

Features

Genomic ranges operations

Features	Bioframe	polars-bio	PyRanges	Pybedtools	PyGenomics	GenomicRanges
overlap	✅	✅	✅	✅	✅	✅
nearest	✅	✅	✅
cluster	✅
merge	✅
complement	✅
select/slice	✅

coverage	✅
expand	✅
sort	✅

Input/Output

I/O	Bioframe	polars-bio	PyRanges
Pandas DataFrame	✅	✅	✅
Polars DataFrame		✅
Polars LazyFrame		✅
Native readers		✅

Genomic file format

I/O	Bioframe	PyRanges
BED	✅	✅
BAM
VCF

Performance

Remarks

Pyranges is multithreaded, but :

Requires Ray backend plus

  nb_cpu: int, default 1

            How many cpus to use. Can at most use 1 per chromosome or chromosome/strand tuple.
            Will only lead to speedups on large datasets.

for nearest returns no empty rows if there is no overlap (we follow Bioframe where nulls are returned)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

polars_bio

Features

Genomic ranges operations

Input/Output

Genomic file format

Performance

Remarks

About

Releases 29

Contributors 2

Languages

License

biodatageeks/polars-bio

Folders and files

Latest commit

History

Repository files navigation

polars_bio

Features

Genomic ranges operations

Input/Output

Genomic file format

Performance

Remarks

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 29

Contributors 2

Languages