EthanZero2Hero

Follow

EthanZero2Hero

Follow

2 followers · 3 following

Popular repositories Loading

vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
llvm-project llvm-project Public

Forked from llvm/llvm-project

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM
cuLA cuLA Public

Forked from inclusionAI/cuLA

CUDA kernels for linear attention variants, written in CuTe DSL and CUTLASS C++.

Python
RL-Kernel RL-Kernel Public

Forked from RL-Align/RL-Kernel

High-performance RL post-training infrastructure. Designed to achieve bitwise operator-level train-inference consistency across heterogeneous engines and extreme memory efficiency for GRPO, PPO, etc.

Python