You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[fix]: correct condition for restoring layer embeddings in TransformerDecorator/AdaptiveLayerLoss (#3560)
* fix: correct condition for restoring layer embeddings in TransformerDecorator/AdaptiveLayerLoss
When training with AdaptiveLayerLoss, the `all_layer_embeddings` are deleted erroneously when `output_hidden_states` is True.
* fix: include all layer embeddings in call_use_cache if requested
* AdaptiveLayerLoss simplification
We can rely on the original 'features' some more, as @emapco also proposed in his initial commit
* refactor: remove unused call_idx in TransformerDecorator
* Simplify call method in TransformerDecorator even further
---------
Co-authored-by: Tom Aarsen <[email protected]>
0 commit comments