docs: add the Hugging Face secret to readme

yankay · yankay · commit 104d4bc13891 · 2024-12-31T03:18:50.000Z
Signed-off-by: Kay Yan &lt;kay.yan@daocloud.io&gt;
diff --git a/pkg/README.md b/pkg/README.md
@@ -7,12 +7,19 @@ The current manifests rely on Envoy Gateway [v1.2.1](https://gateway.envoyproxy.
 
 1. **Deploy Sample vLLM Application**
 
-   A sample vLLM deployment with the proper protocol to work with LLM Instance Gateway can be found [here](https://github.com/kubernetes-sigs/llm-instance-gateway/tree/main/examples/poc/manifests/vllm/vllm-lora-deployment.yaml#L18).
+   Create a Hugging Face secret to download the model [meta-llama/Llama-2-7b-hf](https://huggingface.co/meta-llama/Llama-2-7b-hf). Ensure that the token grants access to this model. 
+   Deploy A sample vLLM deployment with the proper protocol to work with LLM Instance Gateway.
+   ```bash
+   kubectl create secret generic hf-token --from-literal=token=$HF_TOKEN # Your Hugging Face Token with access of Llama2
+   kubectl apply -f ../examples/poc/manifests/vllm/vllm-lora-deployment.yaml
+   ```
 
 1. **Deploy InferenceModel and InferencePool**
 
-   You can find a sample InferenceModel and InferencePool configuration, based on the vLLM deployments mentioned above, [here](https://github.com/kubernetes-sigs/llm-instance-gateway/tree/main/examples/poc/manifests/inferencepool-with-model.yaml).
-
+   Deploy a sample InferenceModel and InferencePool configuration, based on the vLLM deployments mentioned above.
+   ```bash
+   kubectl apply -f ../examples/poc/manifests/inferencepool-with-model.yaml
+   ```
 
 1. **Update Envoy Gateway Config to enable Patch Policy**