Skip to content
View WoosukKwon's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report WoosukKwon

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. vllm-project/vllm vllm-project/vllm Public

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 23.2k 3.3k

  2. skypilot-org/skypilot skypilot-org/skypilot Public

    SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.

    Python 6.3k 433

  3. retraining-free-pruning retraining-free-pruning Public

    [NeurIPS 2022] A Fast Post-Training Pruning Framework for Transformers

    Python 153 24