Skip to content

Commit 67e9422

Browse files
Simplify CLAUDE.md
1 parent 462ec8b commit 67e9422

File tree

2 files changed

+27
-73
lines changed

2 files changed

+27
-73
lines changed

CLAUDE.md

Lines changed: 27 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -1,13 +1,12 @@
11
# CLAUDE.md - Complete Development Guide
22

3-
This document provides comprehensive coding standards, development practices, and technical details for AI assistants working on the GLM Experiments project.
4-
5-
**For project roadmap and feature planning**, see [PROJECT.md](PROJECT.md).
3+
This document provides comprehensive coding standards, development practices, technical details, and project roadmap for AI assistants working on the GLM Experiments project.
64

75
---
86

97
## Table of Contents
108

9+
- [Project Roadmap](#project-roadmap)
1110
- [Core Principles](#core-principles)
1211
- [Code Quality Standards](#code-quality-standards)
1312
- [Technology Stack](#technology-stack)
@@ -24,6 +23,30 @@ This document provides comprehensive coding standards, development practices, an
2423

2524
---
2625

26+
## Project Roadmap
27+
28+
This section tracks the development roadmap and feature planning for the gLM experiments project.
29+
30+
High-level goals:
31+
- Perform experiments training genomic language models (gLMs)
32+
33+
Requirements for training framework:
34+
- Reproducible (as long as you fix the seed)
35+
- bf16
36+
- torch compile
37+
- DDP
38+
- Let's skip tests for now (ignore everything below talking about testing)
39+
40+
Next steps:
41+
- Reproduce GPN-Animal-Promoter
42+
Paper: https://www.biorxiv.org/content/10.1101/2025.02.11.637758v2
43+
Code: https://github.com/songlab-cal/gpn/tree/main/analysis/gpn_animal_promoter
44+
Model: https://huggingface.co/songlab/gpn-animal-promoter
45+
Training dataset: https://huggingface.co/datasets/songlab/gpn-animal-promoter-dataset
46+
Eval dataset: https://huggingface.co/datasets/songlab/TraitGym (subset to promoter variants)
47+
48+
---
49+
2750
## Core Principles
2851

2952
### Simplify Relentlessly
@@ -184,7 +207,6 @@ glm-experiments/
184207
│ └── test_*.py # Test files
185208
186209
├── CLAUDE.md # This file
187-
├── PROJECT.md # Project roadmap
188210
└── pyproject.toml # Dependencies and configuration
189211
```
190212

@@ -402,8 +424,7 @@ This project uses a structured, issue-driven workflow with five phases.
402424
### Phase 1: Formalize Goals
403425

404426
Ensure these documents exist:
405-
- **CLAUDE.md** (this file) - Coding standards and development guide
406-
- **PROJECT.md** - Project roadmap and feature planning
427+
- **CLAUDE.md** (this file) - Coding standards, development guide, and project roadmap
407428
- **Design docs** - Feature specifications (if needed)
408429

409430
### Phase 2: Develop Issues
@@ -937,7 +958,6 @@ python glm_experiments/train.py debug=fdr # Fast debug
937958

938959
```
939960
CLAUDE.md # This file
940-
PROJECT.md # Project roadmap
941961
glm_experiments/ # Main package
942962
configs/ # Hydra configs
943963
tests/ # Test suite
@@ -948,7 +968,6 @@ pyproject.toml # Dependencies
948968
949969
## References
950970
951-
- **[PROJECT.md](PROJECT.md)** - Project roadmap and feature planning
952971
- **[PyTorch Lightning Docs](https://lightning.ai/docs/pytorch/stable/)** - Official docs
953972
- **[Hydra Docs](https://hydra.cc/)** - Official docs
954973
- **[Agentic Git Flow Blog](https://matsen.group/general/2025/11/01/agentic-git-flow.html)** - Workflow inspiration

PROJECT.md

Lines changed: 0 additions & 65 deletions
This file was deleted.

0 commit comments

Comments
 (0)