Skip to content

merge main into dev testing branch#195

Merged
avecplezir merged 73 commits intotestingfrom
main
May 8, 2025
Merged

merge main into dev testing branch#195
avecplezir merged 73 commits intotestingfrom
main

Conversation

@avecplezir
Copy link
Copy Markdown
Collaborator

No description provided.

jacobthebanana and others added 30 commits March 21, 2025 20:11
…omplexData-MILA/AIF-Gen into 112-dataset-generation-with-openai-api
…with-openai-api

Dataset Generation Finalization and Debug
* Transmute

* CLI Typos
filter cli feature implementation and testing
EMZEDI and others added 29 commits April 20, 2025 19:56
…edding-diversity

Text embedding diversity metric
* Add spellcheck hooks

* Fix spelling
* Hook up mkdocs

* Remove docs from ci
* Add rtd config

* CLI is public
* Consolidate benchmark dependancies

* Pin accelerate==0.34.2 and datasets>=3.2.0

* Pin deepspeed==0.16.3
* Fix similarity

* Add dataset idx to future

* Add basic test
* Add cppo starter files
Based on ppo_continual implementation and trlx's cppo implementation

* Add CPPO loss logic to benchmarks
Incorporate trlx CPPO loss into the CPPO trainer, using the detect track function and existing PPO iterative structure

* Update naming convention in README

* Update function docstring to pass linting

* Lint CPPO trainer file with ruff

* Implement feedback from PR review

* Implement minor updates from CPPO testing

* Fix sweep config

* Run formatter

* Fix ref_policy variable deletion

* Specialize detect_track based on ablation type

* Detect track to standalone function

* Update docstrings

* Fix cppo loss

* Fix cloning of old logprobs and rewards

* Add knowledge retention regularization coefficient

* Update benchmark sync

* Push old logprobs/rewards out to trainer

* Revert value loss without alpha

* Use ppo approx kl logs

* Remove redundant gc

* CPPO first successful run

* cppo unnecessary list indexing on mask

* mask duplicate variable debug in CPPO

* Dead code remove

---------

Co-authored-by: Jacob-Chmura <jacobpaul.chmura@gmail.com>
Co-authored-by: Shahrad Mohammadzadeh <shahrad_m@icloud.com>
* Deprecate transmute

* Deprecate filter
* Clean up utils

* Spelling
* Cleanup mappers

* Fix docs ref

* Fix merge

* WIP
* WIP

* Fix tests

* update continual dataset

* Pipe instead of union

* Annotate
* Seperate validation module

* Update docs

* Deprecate token entropy

* Update embedding diversity

* Update llm judge
* Preference swap clean

* Split is a transform

* Seperate transform module

* Revert accidentl file push

* Fix tests
* Preference swap clean

* Split is a transform

* Seperate transform module

* Revert accidentl file push

* Fix tests

* WIp

* Update docs
* Update service

* Share retry logic

* Consolidate preference axes sample gen

* Update docs

* Consolidate judge prompt
* upload logo

* Upload log

* wip

* wip

* upd

* wip

* WIP

* WIP

* Upd

* Upd

* upd

* upd

* Upd

* wip

* upd

* wip

* WIp
* wip

* wip

* wip

* wip

* wip

* wip

* WIP

* wip

* Wip

* wip

* wip

* wip

* Remove old img ref
* WIP: update readme

* wip

* wip

* Check install

* wip

* wip

* wip

* wip

* wip

* WIP

* wip

* qip

* wip

* wip

* wip

* wip
* Bump torch==2.6.0

* Relax torch upper bound

* No need to pin huggingface_hub in benchmarks group
* Generate sample completions during log call

* Clean config
@avecplezir avecplezir merged commit 3b67d2d into testing May 8, 2025
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants