-
Notifications
You must be signed in to change notification settings - Fork 48
Add useful internal vllm test #200
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
pytest will automatically run every file that is named test_*.py , since this is a manual script example of how to run quantization, maybe add it under examples folder and rename the file name.
As well this repo is used by customers, if we add the script here we will need to maintain it constantly to be aligned with latest vllm interface. (@linoybu , @ulivne, what is your opinion?)
| os.environ['EXPERIMENTAL_WEIGHT_SHARING'] = "0" | ||
| os.environ['PT_HPU_ENABLE_LAZY_COLLECTIVES'] = "true" | ||
| os.environ['PT_HPU_LAZY_MODE'] = "1" | ||
| #PT_HPUGRAPH_DISABLE_TENSOR_CACHE=true |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we want to remain with all the lines the are commented?
Maybe add some description of the available options in each argument and leave only one default LLM creation and the script user will edit it for himself.
| end = time.time() | ||
| if is_fp8: | ||
| #if is_measure: | ||
| # llm.llm_engine.model_executor.shutdown_inc() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
delete?
| @@ -0,0 +1,15 @@ | |||
| { | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we should ship internal tests
No description provided.