Commit 1f02d53
committed
Merge changes from user/venky/auto-assign-pr-test to origin-user/venky/auto-assign-pr-test
Signed-off-by: Venky Ganesh <[email protected]>File tree
312 files changed
+11037
-5425
lines changed- .devcontainer
- .github
- scripts
- tests
- workflows
- benchmarks/cpp
- cpp
- cmake/modules
- include/tensorrt_llm
- batch_manager
- common
- executor
- kernels
- runtime
- kernels/fmha_v2/src/fmha
- tensorrt_llm
- batch_manager
- common
- cutlass_extensions/include/cutlass_extensions
- deep_ep
- executor
- cache_transmission
- kernels
- contextFusedMultiHeadAttention
- cutlass_kernels
- allreduce_gemm/kernel
- fp4_gemm
- fp8_blockscale_gemm
- fp8_rowwise_gemm
- fused_gated_gemm
- include
- low_latency_gemm
- moe_gemm
- launchers
- python
- decoderMaskedMultiheadAttention
- decoderXQAImplJIT/nvrtcWrapper/src
- flashMLA
- fusedLayernormKernels
- llama4MinLatencyKernels
- trtllmGenKernels
- batchedGemm
- trtllmGen_bmm_export
- blockScaleMoe
- gemmGatedAct
- trtllmGen_gatedAct_export
- gemm
- trtllmGen_gemm_export
- userbuffers
- plugins
- pybind
- batch_manager
- executor
- runtime
- testing
- runtime
- testing
- thop
- tests
- batch_manager
- unit_tests
- batch_manager
- executor
- kernels
- routing
- docker
- common
- docs/source
- blogs
- tech_blog
- scripts/disaggregated
- examples
- auto_deploy
- .vscode
- disaggregated
- llm-eval/lm-eval-harness
- models
- contrib
- baichuan
- bloom
- chatglm-6b
- chatglm2-6b
- chatglm3-6b-32k
- deepseek_v1
- deepseek_v2
- falcon
- gptj
- gptneox
- grok
- internlm
- jais
- mpt
- opt
- skywork
- smaug
- core
- commandr
- deepseek_v3
- gemma
- glm-4-9b
- gpt
- internlm2
- llama
- mamba
- nemotron
- phi
- qwen
- recurrentgemma
- whisper
- pytorch
- scaffolding
- contrib
- AsyncGeneration
- Dynasor
- wide_ep
- ep_load_balancer
- slurm_scripts
- jenkins
- scripts
- tensorrt_llm
- _torch
- attention_backend
- auto_deploy
- models
- shim
- transformations
- custom_ops
- models
- modules
- fused_moe
- pyexecutor
- speculative
- bench
- benchmark
- utils
- dataclasses
- executor
- inputs
- llmapi
- models
- gemma
- scaffolding
- contrib
- AsyncGeneration
- Dynasor
- serve
- scripts
- tests
- integration
- defs
- accuracy
- references
- cpp
- disaggregated
- test_configs
- examples
- perf
- test_lists
- qa
- test-db
- unittest
- _torch
- auto_deploy
- _utils_test
- unit
- multigpu/custom_ops
- singlegpu
- shim
- modeling
- modules
- speculative
- thop
- api_stability
- references_committed
- references
- llmapi
- apps
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
312 files changed
+11037
-5425
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
180 | 180 | | |
181 | 181 | | |
182 | 182 | | |
183 | | - | |
184 | 183 | | |
185 | 184 | | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
186 | 188 | | |
187 | 189 | | |
188 | 190 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
| 2 | + | |
2 | 3 | | |
3 | 4 | | |
4 | | - | |
| 5 | + | |
5 | 6 | | |
6 | 7 | | |
7 | 8 | | |
| |||
0 commit comments