Benchmarking with `vllm`

Overview

This document provides instructions on how to run benchmarks using the vllm. You can run benchmarks through Python scripts or shell scripts provided in this repository.

Installation

Install vllm:

To install the vllm library, use pip:
```
pip install vllm
```

Running Benchmarks Using Python Script Once vllm is installed, you can execute the Python script to run a benchmark. Here’s an example command:

python benchmark_throughput.py --backend vllm --model "meta-llama/Meta-Llama-3-8B-Instruct" --input-len=128 --output-len=128 --gpu-memory-        
utilization=0.98 --num-prompts=1

Using Shell Scripts

Alternatively, you can use the provided shell scripts to run benchmarks. Follow these steps:

Grant Execute Permissions:

Before running the shell scripts, make sure to grant execute permissions using chmod:

chmod +x benchmark_throughput.sh
chmod +x benchmark_latency.sh
chmod +x benchmark_all.sh

Run the Benchmarks:

For throughput benchmarking: ./benchmark_throughput.sh 
For latency benchmarking: ./benchmark_latency.sh
To run both throughput and latency benchmarks: ./benchmark_all.sh

Additional Information

Make sure to replace the example parameters with your own if needed. The shell scripts and Python script are designed to be run in a Unix-like environment. For any further customization or parameters, refer to the script documentation or source code.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
__pycache__		__pycache__
amd		amd
common		common
image-gen		image-gen
nvidia		nvidia
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
benchmark_all.sh		benchmark_all.sh
benchmark_latency.py		benchmark_latency.py
benchmark_latency.sh		benchmark_latency.sh
benchmark_throughput.py		benchmark_throughput.py
benchmark_throughput.sh		benchmark_throughput.sh
context.txt		context.txt
run_benchmark.py		run_benchmark.py
supabase_credentials.env		supabase_credentials.env

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Benchmarking with `vllm`

Overview

Installation

Using Shell Scripts

Grant Execute Permissions:

Run the Benchmarks:

Additional Information

About

Uh oh!

Releases

Packages

Languages

runpod/ai-infra-benchmarks-v2

Folders and files

Latest commit

History

Repository files navigation

Benchmarking with vllm

Overview

Installation

Using Shell Scripts

Grant Execute Permissions:

Run the Benchmarks:

Additional Information

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Benchmarking with `vllm`

Packages