Skip to main content
Constitutional AI: Harmlessness from AI Feedback

Constitutional AI: Harmlessness from AI Feedback

Yuntao Bai, et al.

00
2022-12-15
alignmentsafety

Abstract

This paper introduces and evaluates the idea described in “Constitutional AI: Harmlessness from AI Feedback”, and reports empirical results that helped shape subsequent work in alignment, safety.