Skip to main content
Adaptive LLMs via Self-Training

Adaptive LLMs via Self-Training

C. Patel, J. Tremblay, G. Hassan, A. Smith

01
2022-12-15
alignmentevaluation

Abstract

This paper proposes a method that improves quality, reliability, and efficiency for modern AI systems. We evaluate on standard benchmarks and provide ablations and analyses. Results indicate consistent gains with minimal overhead.