This is the code repo for paper CREAM: Consistency Regularized Self-Rewarding Language Models accepted to ICLR 2025. CREAM extends the Self-Rewarding Language Model (SRLM) to small models (e.g., ...
This project is a step-by-step learning journey where we implement various types of Triton kernels—from the simplest examples to more advanced applications—while exploring GPU programming with Triton.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results