Shangming Cai
Add ApplyRoPE and RMSNorm kernels written in OpenAI Triton.
af64202