-
I noticed that while developing my IBM zDNN backend, the Thinking out loud, is the operation still used? Or has it been deprecated? Or is it model-dependent where only a specific few models will utilise the operation? |
Beta Was this translation helpful? Give feedback.
Answered by
slaren
Jul 23, 2025
Replies: 1 comment
-
It's not important for inference, but during training it's used in the backwards pass of |
Beta Was this translation helpful? Give feedback.
0 replies
Answer selected by
taronaeo
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
It's not important for inference, but during training it's used in the backwards pass of
GGML_OP_MUL_MAT
.