Skip to content

Conversation

@xt2357
Copy link

@xt2357 xt2357 commented Aug 7, 2018

Subtracting max_y from all y makes the output of softmax harder to overflow.

Subtracting max_y from all y makes the output of softmax harder to overflow.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant