-
Notifications
You must be signed in to change notification settings - Fork 66
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Extend cell cycle gene scoring (#421)
* refactor cell cycle gene handling * add new gene sets for C. elegans and zebrafish and re-parse Tirosh genes from beginning * add test datasets for C. elegans and zebrafish * separate function for cell cycle gene set retrieval * smart use of gene ID or gene name depending on data * add gene scoring test for zebrafish and c_elegans * simplify random gene selection for error reporting * use Literal for organism * add list of possible organism names to error message * use scanpy backup_url functionality * add assertions for gene score columns
- Loading branch information
Showing
10 changed files
with
559 additions
and
29 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
18 changes: 18 additions & 0 deletions
18
scib/resources/cell_cycle_genes_caenorhabditis_elegans.tsv
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,18 @@ | ||
phase modified gene_id gene_name | ||
G2/M 2024-07-04 WBGene00006974 zen-4 | ||
G2/M 2024-07-04 WBGene00000257 bmk-1 | ||
G2/M 2024-07-04 WBGene00000405 cdk-1 | ||
G2/M 2024-07-04 WBGene00000099 air-2 | ||
S 2024-07-04 WBGene00011912 T22C1.1 | ||
S 2024-07-04 WBGene00004338 rfc-2 | ||
S 2024-07-04 WBGene00004297 rad-51 | ||
S 2024-07-04 WBGene00003154 mcm-2 | ||
S 2024-07-04 WBGene00013241 ung-1 | ||
S 2024-07-04 WBGene00009372 evl-18 | ||
S 2024-07-04 WBGene00000382 cdc-6 | ||
S 2024-07-04 WBGene00003418 msh-2 | ||
S 2024-07-04 WBGene00003156 mcm-4 | ||
S 2024-07-04 WBGene00009287 psf-2 | ||
S 2024-07-04 WBGene00022141 chaf-2 | ||
S 2024-07-04 WBGene00000794 crn-1 | ||
S 2024-07-04 WBGene00022455 tyms-1 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,47 @@ | ||
phase modified gene_id gene_name | ||
G2/M 2018-10-19 ENSDARG00000078654 tpx2 | ||
G2/M 2018-10-19 ENSDARG00000075621 birc5a | ||
G2/M 2018-10-19 ENSDARG00000001313 g2e3 | ||
G2/M 2018-10-19 ENSDARG00000061187 cbx5 | ||
G2/M 2018-10-19 ENSDARG00000056621 ctcf | ||
G2/M 2018-10-19 ENSDARG00000041361 ttk | ||
G2/M 2018-10-19 ENSDARG00000038882 smc4 | ||
G2/M 2018-10-19 ENSDARG00000005619 nek2 | ||
G2/M 2018-10-19 ENSDARG00000055133 cenpf | ||
G2/M 2018-10-19 ENSDARG00000117089 CKS2 | ||
G2/M 2018-10-19 ENSDARG00000024488 top2a | ||
G2/M 2018-10-19 ENSDARG00000043137 cdca8 | ||
G2/M 2018-10-19 ENSDARG00000002403 nusap1 | ||
G2/M 2018-10-19 ENSDARG00000010948 kif11 | ||
G2/M 2018-10-19 ENSDARG00000054804 anp32e | ||
G2/M 2018-10-19 ENSDARG00000014013 lbr | ||
G2/M 2018-10-19 ENSDARG00000036180 ccnb2 | ||
G2/M 2018-10-19 ENSDARG00000029722 hmgb2a | ||
G2/M 2018-10-19 ENSDARG00000087554 cdk1 | ||
G2/M 2018-10-19 ENSDARG00000007971 cks1b | ||
G2/M 2018-10-19 ENSDARG00000102674 ckap5 | ||
S 2018-10-19 ENSDARG00000057683 mcm6 | ||
S 2018-10-19 ENSDARG00000043720 cdc45 | ||
S 2018-10-19 ENSDARG00000018022 msh2 | ||
S 2018-10-19 ENSDARG00000019507 mcm5 | ||
S 2018-10-19 ENSDARG00000045308 pola1 | ||
S 2018-10-19 ENSDARG00000040041 mcm4 | ||
S 2018-10-19 ENSDARG00000035957 gmnn | ||
S 2018-10-19 ENSDARG00000037188 rpa2 | ||
S 2018-10-19 ENSDARG00000057738 hells | ||
S 2018-10-19 ENSDARG00000057323 e2f8 | ||
S 2018-10-19 ENSDARG00000002304 gins2 | ||
S 2018-10-19 ENSDARG00000054155 pcna | ||
S 2018-10-19 ENSDARG00000039208 nasp | ||
S 2018-10-19 ENSDARG00000074410 brip1 | ||
S 2018-10-19 ENSDARG00000019907 dscc1 | ||
S 2018-10-19 ENSDARG00000023002 dtl | ||
S 2018-10-19 ENSDARG00000077620 cdca7a | ||
S 2018-10-19 ENSDARG00000056473 chaf1b | ||
S 2018-10-19 ENSDARG00000056414 usp1 | ||
S 2018-10-19 ENSDARG00000100558 slbp | ||
S 2018-10-19 ENSDARG00000014017 rrm1 | ||
S 2018-10-19 ENSDARG00000011404 fen1 | ||
S 2018-10-19 ENSDARG00000056832 exo1 | ||
S 2018-10-19 ENSDARG00000042894 tyms | ||
S 2018-10-19 ENSDARG00000103409 uhrf1 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,98 @@ | ||
gene_name gene_id phase | ||
MCM5 ENSG00000100297 S | ||
PCNA ENSG00000132646 S | ||
TYMS ENSG00000176890 S | ||
FEN1 ENSG00000168496 S | ||
MCM2 ENSG00000073111 S | ||
MCM4 ENSG00000104738 S | ||
RRM1 ENSG00000167325 S | ||
UNG ENSG00000076248 S | ||
GINS2 ENSG00000131153 S | ||
MCM6 ENSG00000076003 S | ||
CDCA7 ENSG00000144354 S | ||
DTL ENSG00000143476 S | ||
PRIM1 ENSG00000198056 S | ||
UHRF1 ENSG00000276043 S | ||
MLF1IP ENSG00000151725 S | ||
HELLS ENSG00000119969 S | ||
RFC2 ENSG00000049541 S | ||
RPA2 ENSG00000117748 S | ||
NASP ENSG00000132780 S | ||
RAD51AP1 ENSG00000111247 S | ||
GMNN ENSG00000112312 S | ||
WDR76 ENSG00000092470 S | ||
SLBP ENSG00000163950 S | ||
CCNE2 ENSG00000175305 S | ||
UBR7 ENSG00000012963 S | ||
POLD3 ENSG00000077514 S | ||
MSH2 ENSG00000095002 S | ||
ATAD2 ENSG00000156802 S | ||
RAD51 ENSG00000051180 S | ||
RRM2 ENSG00000171848 S | ||
CDC45 ENSG00000093009 S | ||
CDC6 ENSG00000094804 S | ||
EXO1 ENSG00000174371 S | ||
TIPIN ENSG00000075131 S | ||
DSCC1 ENSG00000136982 S | ||
BLM ENSG00000197299 S | ||
CASP8AP2 ENSG00000118412 S | ||
USP1 ENSG00000162607 S | ||
CLSPN ENSG00000092853 S | ||
POLA1 ENSG00000101868 S | ||
CHAF1B ENSG00000159259 S | ||
BRIP1 ENSG00000136492 S | ||
E2F8 ENSG00000129173 S | ||
HMGB2 ENSG00000164104 G2/M | ||
CDK1 ENSG00000170312 G2/M | ||
NUSAP1 ENSG00000137804 G2/M | ||
UBE2C ENSG00000175063 G2/M | ||
BIRC5 ENSG00000089685 G2/M | ||
TPX2 ENSG00000088325 G2/M | ||
TOP2A ENSG00000131747 G2/M | ||
NDC80 ENSG00000080986 G2/M | ||
CKS2 ENSG00000123975 G2/M | ||
NUF2 ENSG00000143228 G2/M | ||
CKS1B ENSG00000173207 G2/M | ||
MKI67 ENSG00000148773 G2/M | ||
TMPO ENSG00000120802 G2/M | ||
CENPF ENSG00000117724 G2/M | ||
TACC3 ENSG00000013810 G2/M | ||
FAM64A ENSG00000129195 G2/M | ||
SMC4 ENSG00000113810 G2/M | ||
CCNB2 ENSG00000157456 G2/M | ||
CKAP2L ENSG00000169607 G2/M | ||
CKAP2 ENSG00000136108 G2/M | ||
AURKB ENSG00000178999 G2/M | ||
BUB1 ENSG00000169679 G2/M | ||
KIF11 ENSG00000138160 G2/M | ||
ANP32E ENSG00000143401 G2/M | ||
TUBB4B ENSG00000188229 G2/M | ||
GTSE1 ENSG00000075218 G2/M | ||
KIF20B ENSG00000138182 G2/M | ||
HJURP ENSG00000123485 G2/M | ||
CDCA3 ENSG00000111665 G2/M | ||
HN1 ENSG00000189159 G2/M | ||
CDC20 ENSG00000117399 G2/M | ||
TTK ENSG00000112742 G2/M | ||
CDC25C ENSG00000158402 G2/M | ||
KIF2C ENSG00000142945 G2/M | ||
RANGAP1 ENSG00000100401 G2/M | ||
NCAPD2 ENSG00000010292 G2/M | ||
DLGAP5 ENSG00000126787 G2/M | ||
CDCA2 ENSG00000184661 G2/M | ||
CDCA8 ENSG00000134690 G2/M | ||
ECT2 ENSG00000114346 G2/M | ||
KIF23 ENSG00000137807 G2/M | ||
HMMR ENSG00000072571 G2/M | ||
AURKA ENSG00000087586 G2/M | ||
PSRC1 ENSG00000134222 G2/M | ||
ANLN ENSG00000011426 G2/M | ||
LBR ENSG00000143815 G2/M | ||
CKAP5 ENSG00000175216 G2/M | ||
CENPE ENSG00000138778 G2/M | ||
CTCF ENSG00000102974 G2/M | ||
NEK2 ENSG00000117650 G2/M | ||
G2E3 ENSG00000092140 G2/M | ||
GAS2L3 ENSG00000139354 G2/M | ||
CBX5 ENSG00000094916 G2/M | ||
CENPA ENSG00000115163 G2/M |
Oops, something went wrong.