You can reach me by email: linkai0508@gmail.com
🤓
CMU ECE | RL & MLsys
Highlights
- Pro
Pinned Loading
-
rl-bandits-lab/BOFormer
rl-bandits-lab/BOFormer Public[ICLR 2025] BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL
Jupyter Notebook 11
-
Trajectory-Transformer-for-Quatitative-Trading
Trajectory-Transformer-for-Quatitative-Trading PublicNYCU Intro2AI Final Project
-
-
RL-Align/RL-Kernel
RL-Align/RL-Kernel PublicModern RL Post-training Infrastructure: Optimized for NVIDIA/AMD GPUs with a focus on vLLM and DeepSpeed integration, CUDA/ROCm/Triton kernels, and transparent hardware-aware scaling.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.



