备注:
按照官方文档,排除官方文档不清楚的步骤,安装到最后一步了,只缺 HF_TOKEN了,因为我的kubernetes集群无法访问HF.
root@bastion quickstart\]# cat /etc/redhat-release
Rocky Linux release 9.5 (Blue Onyx)
\[root@bastion quickstart\]#
\[root@bastion quickstart\]# kubectl get nodes
NAME STATUS ROLES AGE VERSION
master01.kcloudonline.com Ready control-plane 46h v1.31.0
worker01.kcloudonline.com Ready \
Saving to: '/usr/local/bin/yq'
/usr/local/bin/yq 100%[=====================================================================================>] 10.95M 1002KB/s in 7.1s
2025-09-26 08:34:22 (1.55 MB/s) - '/usr/local/bin/yq' saved [11477176/11477176]
root@bastion quickstart\]# sudo chmod +x /usr/local/bin/yq \[root@bastion quickstart\]# yq --version yq (https://github.com/mikefarah/yq/) version v4.47.2 \[root@bastion quickstart\]# \[root@bastion llm-d\]# curl -s "https://raw.githubusercontent.com/kubernetes-sigs/kustomize/master/hack/install_kustomize.sh" \| bash v5.7.1 kustomize installed to /software/llm-d/kustomize \[root@bastion llm-d\]# ls kustomize llm-d-deployer \[root@bastion llm-d\]# cp kustomize /usr/local/bin/ \[root@bastion llm-d\]# kustomize version v5.7.1 \[root@bastion llm-d\]# \[root@bastion quickstart\]# ./install-deps.sh Rocky Linux 9 - BaseOS 2.5 kB/s \| 4.1 kB 00:01 Rocky Linux 9 - AppStream 5.0 kB/s \| 4.5 kB 00:00 Rocky Linux 9 - Extras 631 B/s \| 2.9 kB 00:04 Dependencies resolved. ========================================================================================================================================================================= Package Architecture Version Repository Size =========================================================================================================================================================================Installing: make x86_64 1:4.3-8.el9 baseos 529 k Transaction Summary =========================================================================================================================================================================Install 1 Package Total download size: 529 k Installed size: 1.6 M Downloading Packages: make-4.3-8.el9.x86_64.rpm 301 kB/s \| 529 kB 00:01 -------------------------------------------------------------------------------------------------------------------------------------------------------------------------Total 212 kB/s \| 529 kB 00:02 Running transaction check Transaction check succeeded. Running transaction test Transaction test succeeded. Running transaction Preparing : 1/1 Installing : make-1:4.3-8.el9.x86_64 1/1 Running scriptlet: make-1:4.3-8.el9.x86_64 1/1 Verifying : make-1:4.3-8.el9.x86_64 1/1 Installed: make-1:4.3-8.el9.x86_64 Complete! Installing yq... \[root@bastion quickstart\]# ### 要求的凭证和配置 (Required credentials and configuration) llm-d-deployer GitHub repo -- clone here([https://github.com/llm-d/llm-d-deployer.git](https://github.com/llm-d/llm-d-deployer.git "https://github.com/llm-d/llm-d-deployer.git")) HuggingFace HF_TOKEN (https://huggingface.co/docs/hub/en/security-tokens) with download access for the model you want to use. By default the sample application will use meta-llama/Llama-3.2-3B-Instruct. ⚠️ Your Hugging Face account must have access to the model you want to use. You may need to visit Hugging Face meta-llama/Llama-3.2-3B-Instruct and accept the usage terms if you have not already done so. ### 目标平台 (Target Platforms) Since the llm-d-deployer is based on helm charts, llm-d can be deployed on a variety of Kubernetes platforms. ### 安装llm-d (llm-d Installation) Only a single installation of llm-d on a cluster is currently supported. In the future, multiple model services will be supported. Until then, uninstall llm-d before reinstalling. The llm-d-deployer contains all the helm charts necessary to deploy llm-d. To facilitate the installation of the helm charts, the llmd-installer.sh script is provided. This script will populate the necessary manifests in the manifests directory. After this, it will apply all the manifests in order to bring up the cluster. The llmd-installer.sh script aims to simplify the installation of llm-d using the llm-d-deployer as it's main function. It scripts as many of the steps as possible to make the installation process more streamlined. This includes: Installing the GAIE infrastructure Creating the namespace with any special configurations Creating the pull secret to download the images Creating the model service CRDs Applying the helm charts Deploying the sample app (model service) It also supports uninstalling the llm-d infrastructure and the sample app. Before proceeding with the installation, ensure you have completed the prerequisites and are able to issue kubectl or oc commands to your cluster by configuring your \~/.kube/config file or by using the oc login command. #### Usage The installer needs to be run from the llm-d-deployer/quickstart directory as a cluster admin with CLI access to the cluster. ./llmd-installer.sh \[OPTIONS
Flags
案例(Examples)
在Kubernetes 安装 (Install llm-d on an Existing Kubernetes Cluster)
export HF_TOKEN="your-token"
./llmd-installer.sh
root@bastion quickstart\]# ./llmd-installer.sh
📂 Setting up script environment...
kubectl can reach to a running Kubernetes cluster.
❌ HF_TOKEN not set; Run: export HF_TOKEN=\