Historical Windows temporal memory-state research artifact for studying time-bound memory observations, validation limits, and defensive visibility.
-
Updated
May 15, 2026 - Python
Historical Windows temporal memory-state research artifact for studying time-bound memory observations, validation limits, and defensive visibility.
PCBench: Benchmark for Python API parameter compatibility issues
Code, data, and ontologies for FAOS research papers on ontology-powered enterprise AI agent verification (RA-3 neurosymbolic, RA-6 trust certification).
Behavioral HIDS that survives baseline poisoning. Sediment robust estimator, Linux eBPF collector, tamper-evident SHA-256 audit chain.
JSON Schema for decision events as governance evidence units in automated decision and real-time risk systems. MIT.
Side-channel profiler that detects deceptive intent in LLMs by measuring the computational cost of lying.
PCART-LLM: Research artifact for LLM-based API compatibility analysis
Reproducibility package for fixed-ontology GraphRAG court-form filling experiments
Curated code and result summary for world-model inputs in Atari policy experiments.
Python library for evidence sufficiency scoring in governance assessments under delayed ground truth, drift, and decision-readiness constraints.
REQBench: Benchmark for compatible requirements inference in Python third-party library upgrades
PGP-inspired Post-Quantum text encryption. Features Hybrid Crypto (Kyber + X25519), TPM Hardware Binding, and paranoid memory hygiene.
PCREQ-evaluation: Evaluation artifact for PCREQ
Python toolkit for label-free monitoring of governance evidence degradation in delayed-label risk decision systems using proxy drift monitors and response chains.
Comparative determinism experiment on a 285v D6 substrate (P_95 □ K_3) — Zer0pa Computation portfolio. Pure-rational deterministic Rust pipeline; 31,560 byte-identical SHA-256 hashes on commodity Android.
Reference prototype and reproducibility artifact for an ML-KEM-768-based incompleteness-secured commitment framework with claim guards, benchmark scripts, and wrapper portability probes.
Structured self-reports for long-running LLM sessions: operator signals, explicit uncertainty, no claims of model feelings.
PCART-evaluation: Evaluation artifact for PCART
Benchmark dataset and evaluation harness for comparing governance evidence feasibility across rule-based, hybrid ML, streaming, and agentic AI decision systems.
Research artifact for "Duty, Defect, and Disclosure: Reassessing Developer Liability for LLM Chatbots in Suicidal Crises under Swiss and European Law" (UZH FS26, AI: Technology and Law)
Add a description, image, and links to the research-artifact topic page so that developers can more easily learn about it.
To associate your repository with the research-artifact topic, visit your repo's landing page and select "manage topics."