Model Spec Midtraining: Improving How Alignment Training Generalizes

(alignment.anthropic.com)

2 points | by bearseascape 7 days ago ago

No comments yet.