feat: Add TensorRT-LLM recipes for G4 instances #52

tohaowu · 2025-11-25T00:51:47Z

Adds single-host TensorRT-LLM benchmark recipes for the following models on G4 instances:

Qwen3-30B-A3B
Qwen3-4B
Qwen3-8B
Qwen3-32B
Llama3.1-70B
DeepSeek-R1

Each recipe includes steps for VM creation, TensorRT-LLM setup, model quantization (if needed), and running benchmarks.

Adds single-host TensorRT-LLM benchmark recipes for the following models on G4 instances: - Qwen3-30B-A3B - Qwen3-4B - Qwen3-8B - Qwen3-32B - Llama3.1-70B - DeepSeek-R1 Each recipe includes steps for VM creation, TensorRT-LLM setup, model quantization (if needed), and running benchmarks.

tohaowu requested review from Chris113113, gangji and jyj0w0 November 25, 2025 00:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add TensorRT-LLM recipes for G4 instances #52

feat: Add TensorRT-LLM recipes for G4 instances #52

tohaowu commented Nov 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: Add TensorRT-LLM recipes for G4 instances #52

Are you sure you want to change the base?

feat: Add TensorRT-LLM recipes for G4 instances #52

Conversation

tohaowu commented Nov 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant