You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Fast batched dataloading of BigWig files containing epigentic track data and corresponding sequences powered by GPU
4
7
for deep learning applications.
5
8
6
9
> ⚠️ **BREAKING CHANGE (v0.3.0+)**: The output matrix dimensionality has changed from `(n_tracks, batch_size, sequence_length)` to `(batch_size, sequence_length, n_tracks)`. This change was long overdue and eliminates the need for (potentially memory expensive) transpose operations downstream. If you're upgrading from an earlier version, please update your code accordingly (probaby you need to delete one transpose in your code).
7
10
8
11
> ✨ **NEW FEATURE (v0.3.0+)**: Full `bfloat16` support! You can now specify `dtype="bfloat16"` to get output tensors in bfloat16 format, reducing memory usage by 50%.
9
12
13
+
> ⚠️ **Cupy and bfloat16 support**
14
+
Because cupy does not support bfloat16 yet, the cupy array is typed as uint64, but the actual data behind it is in bfloat16. So when converting the array to a tensor in a framework that DOES support bfloat16 like pytorch, tensorflow or JAX should be followed by a "view" method that just changes how the underlying bytes are interpreted (and not actually casting to bfloat16, which would change the underlaying data). In the *bigwig_loader.pytorch.PytorchBigWigDataset* this has already been done for you (when you set dtype="bfloat16").
15
+
10
16
11
17
12
18
13
19
## Quickstart
14
20
15
21
### Installation with Pixi
16
22
Using [pixi](https://pixi.sh/) to install bigwig-loader is highly recommended.
17
-
Please take a look at the pixi.toml file. If you just want to use bigwig-loader, just
18
-
copy that pixi.toml, add the other libraries you need and use the "prod" environment
copy that into a pixi.toml file and add the other libraries you need.
19
68
(you don't need to clone this repo, pixi will download bigwig-loader from the
20
69
conda "dataloading" channel):
21
70
@@ -26,14 +75,16 @@ conda "dataloading" channel):
26
75
27
76
* change directory to wherever you put the pixi.toml, and:
28
77
```shell
29
-
pixi run -e prod <my_training_command>
78
+
pixi run <my_training_command>
30
79
```
31
80
32
81
82
+
The pixi.toml I included in this repository works for both the released version and for development of bigwig-loader, but assumes you cloned this repo.
83
+
84
+
33
85
### Installation with conda/mamba
34
86
35
-
Bigwig-loader mainly depends on the rapidsai kvikio library and cupy, both of which are best installed using
36
-
conda/mamba. Bigwig-loader can now also be installed using conda/mamba. To create a new environment with bigwig-loader
87
+
Alternatively, bigwig-loader can be installed using conda/mamba. To create a new environment with bigwig-loader
0 commit comments