[QUESTION] why the time of one iter in nsys longer than that in the ouput log? #1174
Unanswered
hanwen-sun
asked this question in
Q&A
Replies: 1 comment
-
|
Marking as stale. No activity in 60 days. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I want to compare the speed of training llama2-7b between libai(https://github.com/Oneflow-Inc/libai) and Megatron-LM in NVIDIA A800-SXM4-80G.
But I find the time of one iter in nsys is longer than the output in log when using Megatron-LM;
and I can't find many gap in the cuda stream.
Can anyone explain this to me?
Beta Was this translation helpful? Give feedback.
All reactions