Contrastive Learning from AI Correction (CLAIR): A Novel Approach to Address Underspecification of AI Model Alignment via Anchor Preference Optimization (APO)Mark Tech Post
Contrastive Learning from AI Correction (CLAIR): A Novel Approach to Address Underspecification of AI Model Alignment via Anchor Preference Optimization (APO)Mark Tech Post