Skip to content

allow using gloo from flag #239

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Jul 22, 2025
Merged

allow using gloo from flag #239

merged 1 commit into from
Jul 22, 2025

Conversation

tushar00jain
Copy link
Contributor

@tushar00jain tushar00jain commented Jul 22, 2025

Summary:
add an env var that enables using gloo for the sample training script


Stack created with Sapling. Best reviewed with ReviewStack.

Summary:
add an env var that enables using gloo for the sample training script
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jul 22, 2025
Copy link
Member

@d4l3k d4l3k left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Member

@H-Huang H-Huang left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@@ -14,6 +14,7 @@
os.environ["NCCL_HOSTID"] = str(REPLICA_GROUP_ID)

USE_STREAMING = os.getenv("USE_STREAMING", "False") == "True"
USE_NCCL = os.getenv("USE_NCCL", "False") == "True"
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

so we use gloo by default?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yeah since nccl doesn't work very well right now

@tushar00jain tushar00jain merged commit be3e833 into pytorch:main Jul 22, 2025
8 checks passed
@tushar00jain tushar00jain deleted the pr239 branch July 22, 2025 21:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Meta Open Source bot.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants