Skip to content

Commit 82fc6f0

Browse files
committed
readme updates
1 parent eff6275 commit 82fc6f0

File tree

5 files changed

+25
-1
lines changed

5 files changed

+25
-1
lines changed

README.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -83,6 +83,11 @@ Configuration files are available in the [configs](./src/fmbench/configs) folder
8383

8484
## New in this release
8585

86+
## 2.1.2
87+
88+
1. Deepseek prompt updates.
89+
1. Handle case for < 1 txn/minute.
90+
8691
## 2.1.1
8792

8893
1. Optimized prompt templates and config files for DeepSeek-R1 and Amazon Nova for `ConvFinQA` and `LongBench` datasets.

docs/deepseek.md

Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -15,3 +15,20 @@ python main.py --config-file configs/deepseek/deepseek-convfinqa.yml
1515
```
1616
Change the `--config-file` parameter to [`configs/deepseek/deepseek-longbench.yml`](https://github.com/aws-samples/fmbench-orchestrator/blob/main/configs/deepseek/deepseek-longbench.yml) or [`configs/deepseek/deepseek-openorca.yml`](https://github.com/aws-samples/fmbench-orchestrator/blob/main/configs/deepseek/deepseek-openorca.yml) to use other datasets for benchmarking. These orchestrator files test various Deepseek-R1 distilled models on `g6e` instances, edit this file as per your requirements.
1717

18+
## Benchmark Deepseek-R1 quantized models on Amazon EC2
19+
20+
👉 Make sure your account has enough service quota for vCPUs to run this benchmark. We would be using `g6e.12xlarge` instance for this test.
21+
22+
23+
1. Create a `g6e.12xlarge` instance and run the `DeepSeek-R1 1.58b quantized` model on this instance by following the steps 1 through 8 described [here](https://github.com/aarora79/deepseek-r1-ec2?tab=readme-ov-file#quantized-models).
24+
25+
1. Follow steps 1 through 5 [here](https://aws-samples.github.io/foundation-model-benchmarking-tool/benchmarking_on_ec2.html#benchmarking-on-an-instance-type-with-nvidia-gpus-or-aws-chips) to setup `FMBench` on this instance.
26+
27+
1. Next run the following command to benchmark LongBench
28+
29+
```{.bashrc}
30+
TMP_DIR=/tmp
31+
fmbench --config-file $TMP_DIR/fmbench-read/configs/deepseek/config-deepseek-r1-quant1.58-longbench-byoe.yml --local-mode yes --write-bucket placeholder --tmp-dir $TMP_DIR > fmbench.log 2>&1
32+
```
33+
34+
1. Once the run completes you should see the benchmarking results in a folder called `results-DeepSeek-R1-quant-1.58bit-g6e.12xl` present in your current directory.

docs/manifest.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -37,6 +37,7 @@ Here is a listing of the various configuration files available out-of-the-box wi
3737
[└── byoe/config-model-byo-sagemaker-endpoint.yml](configs/byoe/config-model-byo-sagemaker-endpoint.yml)
3838
**deepseek**
3939
[├── deepseek/config-deepseek-r1-ollama.yml](configs/deepseek/config-deepseek-r1-ollama.yml)
40+
[├── deepseek/config-deepseek-r1-quant1.58-longbench-byoe.yml](configs/deepseek/config-deepseek-r1-quant1.58-longbench-byoe.yml)
4041
[├── deepseek/config-deepseek-r1-vllm-convfinqa.yml](configs/deepseek/config-deepseek-r1-vllm-convfinqa.yml)
4142
[├── deepseek/config-deepseek-r1-vllm-longbench.yml](configs/deepseek/config-deepseek-r1-vllm-longbench.yml)
4243
[└── deepseek/config-deepseek-r1-vllm-openorca.yml](configs/deepseek/config-deepseek-r1-vllm-openorca.yml)

manifest.txt

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -30,6 +30,7 @@ configs/bedrock/config-nova-all-models.yml
3030
configs/bert/config-distilbert-base-uncased.yml
3131
configs/byoe/config-model-byo-sagemaker-endpoint.yml
3232
configs/deepseek/config-deepseek-r1-ollama.yml
33+
configs/deepseek/config-deepseek-r1-quant1.58-longbench-byoe.yml
3334
configs/deepseek/config-deepseek-r1-vllm-convfinqa.yml
3435
configs/deepseek/config-deepseek-r1-vllm-longbench.yml
3536
configs/deepseek/config-deepseek-r1-vllm-openorca.yml

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ build-backend = "setuptools.build_meta"
88

99
[project]
1010
name = "fmbench"
11-
version = "2.1.1"
11+
version = "2.1.2"
1212
description = "Benchmark performance of **any Foundation Model (FM)** deployed on **any AWS Generative AI service**, be it **Amazon SageMaker**, **Amazon Bedrock**, **Amazon EKS**, or **Amazon EC2**. The FMs could be deployed on these platforms either directly through `FMbench`, or, if they are already deployed then also they could be benchmarked through the **Bring your own endpoint** mode supported by `FMBench`."
1313
authors = [
1414
{ name = "Amit Arora", email = "[email protected]" },

0 commit comments

Comments
 (0)