4,262 stars | 553 forks | Python
Star
THUDM /
slime
slime is an LLM post-training framework for RL Scaling.
What it does
slime is a powerful LLM post-training framework designed for reinforcement learning scaling, enabling efficient training and flexible data generation. It supports various models and facilitates advanced research in AI.
Why it matters: Unlock the potential of AI with slime, the ultimate framework for high-performance reinforcement learning training!
Want to create content about this repo? Use Nemati AI tools to generate articles, tutorials, and social posts.




