Skip to content

Latest commit

 

History

History
12 lines (9 loc) · 937 Bytes

KTO_Model_Alignment_as_Prospect_Theoretic_Optimization.md

File metadata and controls

12 lines (9 loc) · 937 Bytes

KTO: Model Alignment as Prospect Theoretic Optimization

This week's paper is KTO: Model Alignment as Prospect Theoretic Optimization. This explores removing the constraint of needing preference pairs in PPO.

Further Reading: