Skip to content
View Ekin-Kahraman's full-sized avatar
  • UEA · Huang Lab (Mount Sinai, remote)
  • Norwich, UK

Highlights

  • Pro

Block or report Ekin-Kahraman

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Ekin-Kahraman/README.md

Ekin Kahraman

Rust/Python software engineer building data and ML systems for computational biology.

I ship installable bioinformatics packages, reproducible pipelines, and clinical prototypes with CI, real-data validation, and published artefacts.

RustScenic

rustscenic (v0.4.7, PyPI, docs, Zenodo DOI): faster, lower-overhead regulatory-network analysis for single-cell and multiome data, shipped as one Python package with Rust kernels.

  • 11x to 52x faster than SCENIC+ on tested real-data core E2E rows; median speedup 27x
  • 100k-cell benchmark used 6.3 GB RAM; comparable legacy workflows have reported >40 GB
  • One install: pip install rustscenic; Python 3.10 to 3.13; Linux, macOS, and Windows wheels
  • Core install avoids Java, dask, CUDA, and Snakemake
  • Rust + PyO3 stages: GRN, AUCell, topics, cisTarget, enhancer links, eRegulons
  • Evidence: benchmarks, PyPI, docs, Zenodo DOI, branch-protected CI, committed validation artefacts
  • Built with the Kuan-lin Huang Lab at Icahn Mount Sinai

Stack

  • Core: Rust, PyO3, Python, pandas, numpy, scipy, scanpy, anndata
  • Pipelines: Nextflow DSL2, Docker, Singularity, GitHub Actions
  • ML/product: PyTorch, React, TypeScript, Supabase

Selected work

Project Stack Evidence
RustScenic airway validation case study Python, pySCENIC comparison, CI Real-atlas head-to-head on 31,602 airway cells and 59 regulons; mean per-cell Pearson r = 0.984; 27x AUCell timing difference; Zenodo DOI
External open-source contributions scverse scientific Python ecosystem 5 merged PRs to scanpy, 2 merged PRs to PyDESeq2, and open algorithmic PR on AnnData concat API
RNA-seq Nextflow pipeline Nextflow DSL2, Docker, Singularity, AWS Batch FASTQ to QC, trimming, HISAT2, featureCounts, DESeq2, and MultiQC; Seqera-ready schema; synthetic end-to-end CI
Bulk RNA-seq differential expression R, DESeq2, CI, reproducible artefacts SARS-CoV-2 nasopharyngeal RNA-seq; 1,773 DE genes in primary cohort; 99.8% concordance with larger sensitivity set; Zenodo DOI
Airway cell-type deconvolution PyTorch, single-cell references, pseudo-bulk validation Deconvolution of 484 bulk RNA-seq samples into 14 airway cell types; r = 0.954 on pseudo-bulk 5-fold CV; model metadata for reuse
Single-cell immune profiling Scanpy, Scrublet, Leiden, PAGA, CI PBMC pipeline with QC, marker annotation, trajectory inference, T-cell subclustering, full-pipeline CI smoke validation, and output checksums
SafetyNett React, TypeScript, Supabase Clinical safety-netting prototype; CI covers lint, explicit TypeScript checking, production build, and tests

Contact

evk23umu@uea.ac.uk

Pinned Loading

  1. rustscenic rustscenic Public

    Faster, lower-memory Rust rewrite of the SCENIC and SCENIC+ analysis stack: GRN, AUCell, cisTarget, topics, peak-to-gene links, and eRegulons.

    Python 11 2

  2. bulk-rnaseq-differential-expression bulk-rnaseq-differential-expression Public

    Reproducible bulk RNA-seq pipeline for SARS-CoV-2 host response in R (DESeq2, pathway enrichment, viral load and sex-interaction analyses). Zenodo DOI.

    R 1

  3. covid-airway-deconvolution covid-airway-deconvolution Public

    PyTorch deconvolution of 484 COVID-19 nasopharyngeal samples into 14 airway cell types using a tissue-matched scRNA-seq reference. Validation r = 0.954.

    Python

  4. single-cell-rnaseq-immune-profiling single-cell-rnaseq-immune-profiling Public

    End-to-end single-cell RNA-seq immune cell profiling pipeline in Python (scanpy, PBMC 3k)

    Python

  5. rnaseq-nextflow-pipeline rnaseq-nextflow-pipeline Public

    Bulk RNA-seq Nextflow pipeline: FastQC, fastp, HISAT2, featureCounts, DESeq2, MultiQC. Dockerised, tested, reproducible.

    Python

  6. safetynett safetynett Public

    AI-powered clinical safety netting for NHS primary care. Red flag detection, automated patient follow-up, GP escalation.

    TypeScript 1 1