Skip to content

Commit 687ba3d

Browse files
authored
Added Paper (#88)
1 parent a4167c8 commit 687ba3d

10 files changed

Lines changed: 120 additions & 49 deletions

_data/single-cell-transformers.yml

Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,15 @@
1+
- model: The Complexity of Automated Cell Type Annotations with GPT-4
2+
paper:
3+
type: preprint
4+
text: '[Soumya Luthra, et al. 2024](https://www.biorxiv.org/content/10.1101/2025.02.11.637659v2)'
5+
url: https://www.biorxiv.org/content/10.1101/2025.02.11.637659v2
6+
code:
7+
type: reproducible
8+
text: "\x9F\x9B\_ï¸\x8FGithub](https://github.com/soulbio/cell_type_annotation)"
9+
url: https://github.com/soulbio/cell_type_annotation
10+
11+
12+
113
- model: BioLLM
214
paper:
315
type: preprint

_site/_data/single-cell-transformers.yml

Lines changed: 79 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,66 @@
1+
- model: The Complexity of Automated Cell Type Annotations with GPT-4
2+
paper:
3+
type: preprint
4+
text: '[Soumya Luthra, et al. 2024](https://www.biorxiv.org/content/10.1101/2025.02.11.637659v2)'
5+
url: https://www.biorxiv.org/content/10.1101/2025.02.11.637659v2
6+
code:
7+
type: reproducible
8+
text: "\x9F\x9B\_ï¸\x8FGithub](https://github.com/soulbio/cell_type_annotation)"
9+
url: https://github.com/soulbio/cell_type_annotation
10+
11+
12+
13+
- model: BioLLM
14+
paper:
15+
type: preprint
16+
text: '[Ping Qiu, et al. 2024](https://www.biorxiv.org/content/10.1101/2024.11.22.624786v1.full.pdf)'
17+
url: https://www.biorxiv.org/content/10.1101/2024.11.22.624786v1.full.pdf
18+
code:
19+
type: reproducible
20+
text: "\x9F\x9B\_ï¸\x8FGithub](https://github.com/BGIResearch/BioLLM)"
21+
url: https://github.com/BGIResearch/BioLLM
22+
omic_modalities: '-'
23+
pre_training_dataset: '-'
24+
input_embedding: '-'
25+
architecture: '-'
26+
ssl_tasks: '-'
27+
supervised_tasks: '-'
28+
29+
30+
31+
32+
- model: scGPT-spatial
33+
paper:
34+
type: preprint
35+
text: '[Chloe Wang, et al. 2024](https://www.biorxiv.org/content/10.1101/2025.02.05.636714v1.full.pdf)'
36+
url: https://www.biorxiv.org/content/10.1101/2025.02.05.636714v1.full.pdf
37+
code:
38+
type: reproducible
39+
text: "\x9F\x9B\_ï¸\x8FGithub](https://github.com/bowang-lab/scGPT-spatial)"
40+
url: https://github.com/bowang-lab/scGPT-spatial
41+
omic_modalities: '-'
42+
pre_training_dataset: '-'
43+
input_embedding: '-'
44+
architecture: '-'
45+
ssl_tasks: '-'
46+
supervised_tasks: '-'
47+
48+
- model: scCello
49+
paper:
50+
type: peer_reviewed
51+
text: '[Yuan, Xinyu, et al. 2024](https://openreview.net/pdf?id=aeYNVtTo7o)'
52+
url: https://openreview.net/pdf?id=aeYNVtTo7o
53+
code:
54+
type: reproducible
55+
text: "\x9F\x9B\_ï¸\x8FGithub](https://github.com/DeepGraphLearning/scCello)"
56+
url: https://github.com/DeepGraphLearning/scCello
57+
omic_modalities: scRNA-seq
58+
pre_training_dataset: 23M / cross-tissue, human ([CELLxGENE](https://cellxgene.cziscience.com/))
59+
input_embedding: 'Ordering: rank-based'
60+
architecture: Encoder
61+
ssl_tasks: 'Multi-level pre-training: MLM with CE loss for gene level modeling; an ontologybased cell-type coherence loss for intra-cellular level modeling; a relational alignment loss to inject cell-type lineage from cell ontology graph for inter-cellular level modeling'
62+
supervised_tasks: 'fine-tuning tasks: cell type classification; zero-shot tasks: cell type annotation, marker gene prediction, novel cell type prediction, cancer drug prediction'
63+
164
- model: scGREAT
265
paper:
366
type: peer_reviewed
@@ -14,6 +77,22 @@
1477
ssl_tasks: '-'
1578
supervised_tasks: '-'
1679

80+
- model: MAMMAL
81+
paper:
82+
type: preprint
83+
text: '[Shoshan et al. 2024](https://arxiv.org/abs/2410.22367)'
84+
url: https://arxiv.org/abs/2410.22367
85+
code:
86+
type: reproducible
87+
text: "\x9F\x9B\_ï¸\x8FGitHub](https://github.com/BiomedSciAI/biomed-multi-alignment)"
88+
url: https://github.com/BiomedSciAI/biomed-multi-alignment
89+
omic_modalities: bulk/scRNA-seq, amino acid sequences, SMILES molecule sequences
90+
pre_training_dataset: CellXGene Human
91+
input_embedding: '-'
92+
architecture: T5 Encoder-Decoder
93+
ssl_tasks: Expression-ranked gene masking (CELLxGENE Human), Protein LM (Uniref90), Antibody LM (OAS), Antibody Denoising (OAS), Small-Molecule LM (ZINC), Protein Interaction LM (STRING)
94+
supervised_tasks: Cell type annotation (zheng68k), Cancer drug response prediction (GDSC1/2/3), Brain Blood Barrier Penetration prediction (MoleculeNet), Small-Molecule toxicity prediction (MoleculeNet), drug clinical trial result prediction (MoleculeNet), Antibody-Antigen binding prediction (HER2), Targeted antibody generation (SAbDAb), Protein-Protein delta-delta G prediction (SKEMPI v2), Drug-Target interaction prediction (PEER), TCR binding prediction (Weber et al)
95+
1796
- model: Nicheformer
1897
paper:
1998
type: peer_reviewed
@@ -145,22 +224,6 @@
145224
supervised_tasks: '-'
146225

147226

148-
- model: scCello
149-
paper:
150-
type: preprint
151-
text: '[Xinyu Yuan et al. 2024](https://github.com/theislab/single-cell-transformer-papers/issues/32)'
152-
url: https://github.com/theislab/single-cell-transformer-papers/issues/32
153-
code:
154-
type: '-'
155-
text: "\x9F\x94\x8DGitHub](https://github.com/DeepGraphLearning/scCello)"
156-
url: 'https://github.com/DeepGraphLearning/scCello'
157-
omic_modalities: '-'
158-
pre_training_dataset: '-'
159-
input_embedding: '-'
160-
architecture: '-'
161-
ssl_tasks: '-'
162-
supervised_tasks: '-'
163-
164227
- model: scGenePT
165228
paper:
166229
type: preprint

_site/_data/transformer-evaluation.yml

Lines changed: 16 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -26,22 +26,6 @@
2626
tasks: '-'
2727
notes: '-'
2828

29-
30-
- paper:
31-
type: preprint
32-
text: '[George Crowley et al. 2024](https://www.biorxiv.org/content/10.1101/2024.10.10.617605v1.full.pdf)'
33-
url: https://www.biorxiv.org/content/10.1101/2024.10.10.617605v1.full.pdf
34-
code:
35-
type: 'reproducible'
36-
text: '[ð\x9F\x9B\_ï¸\x8FGitHub](https://github.com/ggit12/anndictionary/)'
37-
url: 'https://github.com/ggit12/anndictionary/'
38-
omic_modalities: '-'
39-
evaluated_transformers: '-'
40-
tasks: '-'
41-
notes: '-'
42-
43-
44-
4529
- paper:
4630
type: preprint
4731
text: '[George Crowley et al. 2024](https://www.biorxiv.org/content/10.1101/2024.10.10.617605v1.full.pdf)'
@@ -77,10 +61,10 @@
7761
type: 'reproducible'
7862
text: '[ð\x9F\x9B\_ï¸\x8FGitHub](https://github.com/aaronwtr/PertEval)'
7963
url: 'https://github.com/aaronwtr/PertEval'
80-
omic_modalities: '-'
81-
evaluated_transformers: '-'
82-
tasks: '-'
83-
notes: '-'
64+
omic_modalities: 'scRNA-seq'
65+
evaluated_transformers: 'UCE, scBERT, scGPT, Geneformer, scFoundation'
66+
tasks: 'Transcriptomic perturbation prediction'
67+
notes: 'Introduces PertEval-scFM, a benchmark to assess the zero-shot utility of single-cell foundation model embeddings for transcriptomic perturbation prediction. Uses SPECTRA to generate train-test splits with increasing dissimilarity to evaluate robustness against distribution shift. Models are evaluated with MSE and AUSPC, with AUSPC reflecting robustness under distribution shift. Additional analyses include E-distance and predicted transcriptomic distributions across the top 20 DEGs. Findings suggest that single-cell foundation model embeddings capture average perturbation effects but generally lack robustness to distribution shift. Ongoing work demonstrates that the domain-specific model GEARS outperforms foundation model embeddings, indicating that masked-language modeling on gene expression data without domain-specific inductive biases is insufficient for accurate transcriptomic perturbation prediction.'
8468

8569

8670

@@ -194,3 +178,15 @@
194178
evaluated_transformers: scGPT, Geneformer, scBERT
195179
tasks: Cell type annotation
196180
notes: Focused on imbalanced cell type classification. Geneformer appears to be outperformed by scGPT and scBERT, where the two latter perform similarly.
181+
- paper:
182+
type: preprint
183+
text: '[Csendes et al. 2024](https://www.biorxiv.org/content/10.1101/2024.09.30.615843v1)'
184+
url: https://www.biorxiv.org/content/10.1101/2024.09.30.615843v1
185+
code:
186+
type: reproducible
187+
text: "\x9F\x9B\_ï¸\x8FGitHub](https://github.com/turbine-ai/PerturbSeqPredBenchmark)"
188+
url: https://github.com/turbine-ai/PerturbSeqPredBenchmark
189+
omic_modalities: scRNA-seq
190+
evaluated_transformers: scGPT
191+
tasks: Genetic perturbation effect prediction
192+
notes: Simple baseline models can outperform scGPT on perturbational downstream tasks. The most widely used benchmarking datasets contain significant biases, making them suboptimal for evaluation.

_site/_pages/implementations.html

Lines changed: 2 additions & 2 deletions
Large diffs are not rendered by default.

_site/feed.xml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
<?xml version="1.0" encoding="utf-8"?><feed xmlns="http://www.w3.org/2005/Atom" ><generator uri="https://jekyllrb.com/" version="4.3.4">Jekyll</generator><link href="http://localhost:4000/single-cell-transformer-papers/feed.xml" rel="self" type="application/atom+xml" /><link href="http://localhost:4000/single-cell-transformer-papers/" rel="alternate" type="text/html" /><updated>2025-01-21T19:30:38+01:00</updated><id>http://localhost:4000/single-cell-transformer-papers/feed.xml</id><title type="html">Transformers in Single-Cell Omics</title><subtitle>A curated collection of papers on transformers in single-cell analysis</subtitle></feed>
1+
<?xml version="1.0" encoding="utf-8"?><feed xmlns="http://www.w3.org/2005/Atom" ><generator uri="https://jekyllrb.com/" version="4.3.4">Jekyll</generator><link href="http://localhost:4000/single-cell-transformer-papers/feed.xml" rel="self" type="application/atom+xml" /><link href="http://localhost:4000/single-cell-transformer-papers/" rel="alternate" type="text/html" /><updated>2025-02-25T09:42:38+01:00</updated><id>http://localhost:4000/single-cell-transformer-papers/feed.xml</id><title type="html">Transformers in Single-Cell Omics</title><subtitle>A curated collection of papers on transformers in single-cell analysis</subtitle></feed>

_site/implementations.html

Lines changed: 2 additions & 2 deletions
Large diffs are not rendered by default.

_site/index.html

Lines changed: 2 additions & 2 deletions
Large diffs are not rendered by default.

_site/single-cell-transformers.html

Lines changed: 2 additions & 2 deletions
Large diffs are not rendered by default.

_site/transformer-evaluation.html

Lines changed: 2 additions & 2 deletions
Large diffs are not rendered by default.

_site/transformer-llms.html

Lines changed: 2 additions & 2 deletions
Large diffs are not rendered by default.

0 commit comments

Comments
 (0)