update

ivy-lv11 · ivy-lv11 · commit f4172561cc4c · 2024-05-23T16:09:26.000+08:00
diff --git a/llama-index-integrations/llms/llama-index-llms-ipex-llm/examples/README.md b/llama-index-integrations/llms/llama-index-llms-ipex-llm/examples/README.md
@@ -44,6 +44,12 @@ The example [low_bit.py](./low_bit.py) shows how to save and load low_bit model
 python low_bit.py -m <path_to_model> -d <cpu_or_xpu> -q <query_to_LLM> -s <save_low_bit_dir>
 ```
 
+> Please note that in this example we'll use [HuggingFaceH4/zephyr-7b-alpha](https://huggingface.co/HuggingFaceH4/zephyr-7b-alpha) model for demonstration. It requires updating `transformers` and `tokenizers` packages.
+>
+> ```bash
+> pip install -U transformers==4.37.0 tokenizers==0.15.2
+> ```
+
 ### More Data Types Example
 
 By default, `IpexLLM` loads the model in int4 format. To load a model in different data formats like `sym_int5`, `sym_int8`, etc., you can use the `load_in_low_bit` option in `IpexLLM`. To load a model on different device like `cpu` or `xpu`, you can use the `device_map` option in `IpexLLM`.