yechank-nvidia

Follow

Yechan Kim yechank-nvidia

Follow

AI DevTech @ NVIDIA

3 followers · 0 following

NVIDIA

Achievements

Achievements

Pinned Loading

TensorRT-LLM TensorRT-LLM Public

Forked from NVIDIA/TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

Python
smg smg Public

Forked from lightseekorg/smg

Engine-agnostic LLM gateway in Rust. Full OpenAI & Anthropic API compatibility across SGLang, vLLM, TRT-LLM, OpenAI, Gemini & more. Industry-first gRPC pipeline, KV cache-aware routing, chat histor…

Rust
tokenspeed tokenspeed Public

Forked from lightseekorg/tokenspeed

TokenSpeed is a speed-of-light LLM inference engine.

Python