-
Notifications
You must be signed in to change notification settings - Fork 422
Refactor Tokenizer -> BaseTokenizer #1333
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -3,3 +3,4 @@ pytest==7.3.2 | |
pytest-cov | ||
pre-commit | ||
tomli-w >= 1.1.0 | ||
transformers | ||
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -2,7 +2,6 @@ torchdata >= 0.8.0 | |
datasets >= 3.6.0 | ||
tomli >= 1.1.0 ; python_version < "3.11" | ||
tensorboard | ||
tiktoken | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. please update https://github.com/pytorch/torchtitan/blob/main/pyproject.toml as well |
||
blobfile | ||
tabulate | ||
wandb | ||
|
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -14,7 +14,7 @@ We actively welcome your pull requests. | |
2. If you've added code that should be tested, add tests. | ||
3. If you've changed APIs, update the documentation. | ||
4. Ensure the test suite passes. | ||
5. Make sure your code lints (`pre-commit run --all-files`). | ||
5. Make sure your code lints (`pre-commit run --from-ref origin/main --to-ref HEAD`). | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. IIUC this is restricting the linting to be changes between current main and the latest commit. Can I ask why? |
||
6. If you haven't already, complete the Contributor License Agreement ("CLA"). | ||
|
||
### Contributor License Agreement ("CLA") | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
is this for running the unit tests?