-
Notifications
You must be signed in to change notification settings - Fork 214
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
LLM demos adjustments for Windows #2940
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Some comments.
windows_create_package.bat
Outdated
copy %cd%\bazel-out\x64_windows-opt\bin\src\python39.dll dist\windows\ovms | ||
if !errorlevel! neq 0 exit /b !errorlevel! | ||
xcopy C:\opt\ovms-python-3.9.6-embed dist\windows\ovms\python /E /I /H | ||
if %errorlevel% neq 0 ( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
!errorlevel!
setupvars.bat
Outdated
:: limitations under the License. | ||
:: | ||
@echo off | ||
setlocal EnableExtensions EnableDelayedExpansion |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
remove setlocal and endlocal, otherwise the settings will not propagate to the calling terminal.
demos/continuous_batching/README.md
Outdated
pip3 install -U -r demos/common/export_models/requirements.txt | ||
Download export script, install it's dependencies and create directory for the models: | ||
```console | ||
curl https://raw.githubusercontent.com/openvinotoolkit/model_server/refs/heads/main/demos/common/export_models/export_model.py |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
command only shows file content - it does not download it
demos/continuous_batching/README.md
Outdated
|
||
**CPU** | ||
```console | ||
python demos/common/export_models/export_model.py text_generation --source_model meta-llama/Meta-Llama-3-8B-Instruct --weight-format fp16 --kv_cache_precision u8 --config_file_path models/config.json --model_repository_path models --overwrite_models |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
between demos there are commands with:
python demos/common/export_models/export_model.py or python export_model.py
maybe unify?
demos/continuous_batching/README.md
Outdated
git clone --branch v0.6.0 --depth 1 https://github.com/vllm-project/vllm | ||
cd vllm | ||
pip3 install -r requirements-cpu.txt --extra-index-url https://download.pytorch.org/whl/cpu | ||
cd benchmarks | ||
wget https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered/resolve/main/ShareGPT_V3_unfiltered_cleaned_split.json # sample dataset | ||
curl https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered/resolve/main/ShareGPT_V3_unfiltered_cleaned_split.json # sample dataset |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
add "-o"
No description provided.