Skip to main content
Practical Compression via Retrieval Augmentation

Practical Compression via Retrieval Augmentation

F. Garcia, C. Patel, H. Kim, J. Tremblay

10
2025-09-15
llmspeechalignmentcompression

Abstract

This paper proposes a method that improves quality, reliability, and efficiency for modern AI systems. We evaluate on standard benchmarks and provide ablations and analyses. Results indicate consistent gains with minimal overhead.