Independent researcher — AI evaluation infrastructure, narrative intelligence, LLM grader reliability.
- inspect_ai PR #4170 (merged) — grader reproducibility docs for UK AISI evaluation framework
- EU AI Act Article 6 consultation submission (DOI: 10.5281/zenodo.20605168)
- NIST AI 800-2 public comment on AI evaluation standards
- LLM-judge non-determinism — empirical reproducibility note (DOI: 10.5281/zenodo.20674090)
- Behavioral red teaming reproduction report (DOI: 10.5281/zenodo.20609109)
ORCID: 0009-0004-7635-0741 · OSF
Seeking endorsement for a cs.AI submission on LLM-as-judge grader non-determinism. Code: V6FVHF
