Skip to content

Latest commit

 

History

History
49 lines (36 loc) · 1.6 KB

File metadata and controls

49 lines (36 loc) · 1.6 KB

Changelog

All notable changes to Free Transformer will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

Added

  • Comprehensive documentation with MkDocs
  • GitHub Pages documentation deployment
  • PyPI packaging and release automation
  • Version bumping scripts
  • Enhanced README with badges and links

Changed

  • Improved project structure and organization
  • Enhanced Makefile with documentation and publishing commands

0.1.0 - 2024-11-02

Added

  • Initial release of Free Transformer
  • Core Free Transformer architecture with latent planning
  • Baseline Transformer for comparison
  • Conditional VAE training with free bits regularization
  • Multi-GPU training support with FSDP
  • Synthetic data generation for prototyping
  • Docker support for easy deployment
  • Comprehensive test suite
  • Code quality tools (Black, Ruff, MyPy)
  • Example scripts and configurations
  • Basic documentation

Features

  • Architecture: Llama-style backbone with RMSNorm, SwiGLU, RoPE, GQA
  • Latent Planning: Explicit binary plan variable with differentiable sampling
  • Training: Conditional VAE loss with reconstruction and KL divergence
  • Scaling: FSDP support for multi-GPU training
  • Development: Full development environment with quality checks
  • Usability: Modular API with YAML configuration