You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
2.**Run the Conversion Script:** Execute the following command that downloads the specified Hugging Face model and converts its weights into the MaxText format. The conversion script only supports official versions of models from Hugging Face. To see the specific models and versions currently supported for conversion, please refer to the `HF_IDS` dictionary in the MaxText utility file [here](https://github.com/AI-Hypercomputer/maxtext/blob/main/src/MaxText/utils/ckpt_conversion/utils/utils.py).
85
85
86
86
```sh
87
-
pip install torch # Ensure torch is installed for the conversion script
87
+
python3 -m pip install torch --index-url https://download.pytorch.org/whl/cpu# Ensure torch is installed for the conversion script
Copy file name to clipboardExpand all lines: docs/tutorials/sft_on_multi_host.md
+7-12Lines changed: 7 additions & 12 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -50,7 +50,7 @@ The `docker_upload_runner.sh` script uploads your Docker image to Artifact Regis
50
50
Install XPK by following the instructions in the [official documentation](https://github.com/AI-Hypercomputer/xpk?tab=readme-ov-file#installation-via-pip).
51
51
52
52
## 3. Create GKE cluster
53
-
Use a pathways ready GKE cluster as described [here](https://docs.cloud.google.com/ai-hypercomputer/docs/workloads/pathways-on-cloud/create-gke-cluster)
53
+
Use a pathways ready GKE cluster as described [here](https://docs.cloud.google.com/ai-hypercomputer/docs/workloads/pathways-on-cloud/create-gke-cluster).
54
54
55
55
## 4. Environment configuration
56
56
```bash
@@ -60,7 +60,7 @@ export CLUSTER_NAME=<Name of GKE Cluster>
60
60
export ZONE=<GKE Cluster Zone>
61
61
62
62
# -- Workload Configuration --
63
-
export WORKLOAD_NAME=<Name of Workload># e.g., $(date +%Y-%m-%d-%H-%M-%S)
63
+
export WORKLOAD_NAME=<Name of Workload># e.g., sft-$(date +%s)
2.**Run the Conversion Script:** Execute the following command that downloads the specified Hugging Face model and converts its weights into the MaxText format. The conversion script only supports official versions of models from Hugging Face. To see the specific models and versions currently supported for conversion, please refer to the `HF_IDS` dictionary in the MaxText utility file [here](https://github.com/AI-Hypercomputer/maxtext/blob/main/src/MaxText/utils/ckpt_conversion/utils/utils.py).
105
+
2.**Run the Conversion Script:** Execute the following commands on a CPU machine that downloads the specified HuggingFace model and converts its weights into the MaxText format. This command will download the HuggingFace model and convert it to the MaxText format, saving it to the specified GCS bucket. The conversion script only supports official versions of models from HuggingFace. To see the specific models and versions currently supported for conversion, please refer to the `HF_IDS` dictionary in the MaxText utility file [here](https://github.com/AI-Hypercomputer/maxtext/blob/main/src/MaxText/utils/ckpt_conversion/utils/utils.py).
106
106
107
107
```bash
108
108
USE_ZARR3=<Flag to use zarr3># True to run SFT with McJAX, False to run SFT with Pathways
109
109
USE_OCDBT=<Flag to use ocdbt># True to run SFT with McJAX, False to run SFT with Pathways
0 commit comments