Implementing QLoRA and best-model checkpointing (2 issues) #47
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
🚀 Key Feature 1: QLoRA for Efficient Fine-Tuning
💾 Key Feature 2: Best-Model Checkpointing
📈 Other Improvements:
.gitignore
: The wandb/ directory has been added to .gitignore to prevent local log files from being committed.was able to run it on google colab - https://colab.research.google.com/drive/1H4cLS-HotsaS_40Bfo_7d_fZVwqBcrLW#scrollTo=yiE6lvpvdmio, i was able to get save the final lora adapters in the output file with best models, will hape to add your own hugging face token here
These changes make the project significantly more practical and user-friendly, especially for those without access to high-end industrial hardware. looking forward to my first commit , feedback is really appreaciated will try my best