...
Hi,
I've noticed some pretty drastic runtime differences between cell2location versions 0.1.3 and 0.1.4. I ran the two versions on the same dataset with the same parameters. A process that took ~4hrs with 0.1.3 is taking ~40hrs with 0.1.4. Each iteration is also much longer with 0.1.4. (14s/it vs. 1s/it) One thing I noticed is that when i look at GPU usage with 'nvidia-smi', 0.1.3 is using ~5000MiB while the process run with 0.1.4. is only taking ~700MiB.
Are there any suggestions to make runtime better? I'm puzzled as to why the runtimes are so different between the versions. I've included some images of the runs.
Additionally, the "use_gpu" parameter in version 0.1.3. was pretty handy and it would be great to see it back in future versions.
Thanks!
