Skip to content
Change the repository type filter

All

    Repositories list

    • tinygrad.c
      Python
      3.6k101Updated Sep 18, 2025Sep 18, 2025
    • spectrum

      Public
      Python
      2513552Updated Aug 20, 2025Aug 20, 2025
    • 916151Updated Aug 8, 2025Aug 8, 2025
    • AutoAWQ

      Public
      AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
      Python
      289000Updated Jul 31, 2025Jul 31, 2025
    • MoE-Quant

      Public
      Code for data-aware compression of DeepSeek models
      Python
      8000Updated Jul 29, 2025Jul 29, 2025
    • Python
      21400Updated Jul 27, 2025Jul 27, 2025
    • Python
      0000Updated Jul 27, 2025Jul 27, 2025
    • minichat

      Public
      HTML
      0100Updated Jul 25, 2025Jul 25, 2025
    • HTML
      0100Updated Jul 25, 2025Jul 25, 2025
    • qaitop

      Public
      An interactive GPU process viewer and beyond, the one-stop solution for GPU process management.
      Python
      190001Updated Jul 21, 2025Jul 21, 2025
    • quixi-cli

      Public
      An open-source AI agent that brings the power of Gemini directly into your terminal.
      TypeScript
      8k200Updated Jul 17, 2025Jul 17, 2025
    • LLM training in simple, raw C/CUDA
      Cuda
      3.2k000Updated Jun 26, 2025Jun 26, 2025
    • nano-vllm

      Public
      Nano vLLM
      Python
      831000Updated Jun 24, 2025Jun 24, 2025
    • JavaScript
      2501Updated Jun 21, 2025Jun 21, 2025
    • Python
      1615701Updated Jun 21, 2025Jun 21, 2025
    • Python
      910410Updated Jun 14, 2025Jun 14, 2025
    • PyQuest

      Public
      0000Updated Jun 12, 2025Jun 12, 2025
    • qwen2to3

      Public
      Python
      0300Updated Jun 10, 2025Jun 10, 2025
    • axolotl

      Public
      Go ahead and axolotl questions
      Python
      1.1k200Updated Jun 8, 2025Jun 8, 2025
    • Python
      42100Updated Jun 8, 2025Jun 8, 2025
    • Python
      0100Updated Jun 7, 2025Jun 7, 2025
    • pytorch

      Public
      Tensors and Dynamic neural networks in Python with strong GPU acceleration
      Python
      25k240Updated Jun 2, 2025Jun 2, 2025
    • Rust
      0000Updated May 31, 2025May 31, 2025
    • 72201Updated May 30, 2025May 30, 2025
    • 0000Updated May 23, 2025May 23, 2025
    • Python
      55512134Updated May 21, 2025May 21, 2025
    • Python
      1600Updated May 6, 2025May 6, 2025
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      10k000Updated May 3, 2025May 3, 2025
    • CSS
      0100Updated Apr 24, 2025Apr 24, 2025
    • collatz

      Public
      Lean
      0100Updated Apr 21, 2025Apr 21, 2025