Skip to content

Conversation

@yeshsurya
Copy link
Contributor

Summary

This work adds support for gpt oss models

Testing Done:

Inference Benchmark Execution
Total inference runs: 18 executions
12 successful benchmark measurements
6 pre-run warmup iterations
Configurations tested: 3 scenarios with 2 runs each
All scenarios passed with consistent, measurable improvements

  • Hardware Type: RTX A6000
  • run make test to ensure correctness
  • run make checkstyle to ensure code style
  • run make test-convergence to ensure convergence

@yeshsurya yeshsurya changed the title Yeshwanth/gpt oss [feat]: Add support for gpt-oss Nov 22, 2025
@yeshsurya yeshsurya requested a review from kashif November 24, 2025 09:53
@yeshsurya yeshsurya requested a review from kashif November 29, 2025 06:36
yeshsurya and others added 6 commits December 2, 2025 22:16
Added GPT-OSS to the supported models table in README.md with its supported operations (RoPE, RMSNorm, CrossEntropyLoss, FusedLinearCrossEntropy).

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <[email protected]>
@yeshsurya
Copy link
Contributor Author

@Tcc0403, help review

@yeshsurya
Copy link
Contributor Author

@shimizust , help review

Tcc0403
Tcc0403 previously approved these changes Dec 3, 2025
Copy link
Collaborator

@Tcc0403 Tcc0403 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you, lgtm!

@yeshsurya yeshsurya requested a review from Tcc0403 December 6, 2025 09:36
@yeshsurya
Copy link
Contributor Author

Thank you, lgtm!

please help to merge this, looks like gpus tests are to be skipped ? since no cloud secrets are configured for them ?

@Tcc0403 Tcc0403 merged commit 720cc68 into linkedin:main Dec 6, 2025
3 of 7 checks passed
@ParagEkbote ParagEkbote mentioned this pull request Dec 6, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants