Extreme infrastructure and specialized custom kernels for large-scale reinforcement learning.
We are an open-source collective dedicated to breaking the memory and compute walls in RLHF and GRPO training pipelines. Our mission is to provide end-to-end, high-performance CUDA/Triton operators that seamlessly integrate into your existing training and rollout frameworks.
- RL-Kernel: Our flagship project. A high-performance, memory-efficient kernel library featuring prefix_shared_attention and fused_logp for GRPO workloads.
Whether you are a kernel hacker, an AI Infra engineer, or an LLM researcher, you are welcome here!
- Discord: Join our server for real-time technical discussions.
- Contact: Reach out to us at team@rl-align.org for collaborations.