FlagOS
Popular repositories Loading
-
FlagAttention
FlagAttention PublicA collection of memory efficient attention operators implemented in the Triton language.
Repositories
- flagtree Public
FlagTree is a unified compiler for multiple AI chips, which is forked from triton-lang/triton.
flagos-ai/flagtree’s past year of commit activity - FlagCX Public
flagos-ai/FlagCX’s past year of commit activity - vllm-plugin-FL Public
flagos-ai/vllm-plugin-FL’s past year of commit activity - Megatron-LM-FL Public Forked from NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
flagos-ai/Megatron-LM-FL’s past year of commit activity - vllm-FL Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
flagos-ai/vllm-FL’s past year of commit activity - TransformerEngine-FL Public Forked from NVIDIA/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.
flagos-ai/TransformerEngine-FL’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…