this is gpt2 scripts for musa/cpu/cuda
Run the script directly and it will automatically download the dataset from huggingface
- python start_clm_wiki2.py # for wiki2 dataset
- python start_clm_wiki103.py # for wiki103 dataset
There are a few changes to run on the musa card.