Skip to content

Commit f0c1874

Browse files
edits
1 parent 492604b commit f0c1874

File tree

3 files changed

+1
-121
lines changed

3 files changed

+1
-121
lines changed

tests/functional_tests/test_cases/gpt/gpt_dynamic_inference_tp1_pp1_583m_cuda_graphs_logitsmatch_decode_graphs_only/model_config.yaml.tmp

Lines changed: 0 additions & 62 deletions
This file was deleted.

tests/functional_tests/test_cases/gpt/gpt_dynamic_inference_tp1_pp1_dp8_583m_logitsmatch_zmq/model_config.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -51,7 +51,7 @@ MODEL_ARGS:
5151
--prompts: "Time travel to 2008, and go to a bar or a club or one of the myriad disco-basements on the Lower East Side that does not quite know which of those it is. Dance awkwardly in a room full of other glittered-up nerds, and wait for something to happen, buoyed on the feeling that this is the big swollen heart of life, that this is New York like the movies."
5252
--incoming-requests-per-step: 32
5353
--use-flashinfer-fused-rope: true
54-
--use-inference-optimized-layers: tru
54+
--use-inference-optimized-layers: true
5555

5656
METRICS:
5757
- "generated_tokens"

tests/functional_tests/test_cases/gpt/gpt_dynamic_inference_tp1_pp1_dp8_583m_logitsmatch_zmq/model_config.yaml.tmp

Lines changed: 0 additions & 58 deletions
This file was deleted.

0 commit comments

Comments
 (0)