Skip to content

Concatenated dim #336

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 16 commits into
base: main
Choose a base branch
from
Open

Concatenated dim #336

wants to merge 16 commits into from

Conversation

jlamypoirier
Copy link
Collaborator

✨ Description

Another round of prequisites for TP Mamba and misc improvements.

  • Add ConcatenatedTensorDim to support complex parallel dimensions (ex. SSM inner projection). Clean up the TensorDim interface and move some functionality from TensortoTensorDim` for smoother integration.
  • Add an Initializer class to allow for more detailed initialization strategies. Add requires_global_initialization for initializers that won't work on local slices (ex. init_A).
  • Add init_uniform_centered

@jlamypoirier jlamypoirier marked this pull request as ready for review July 28, 2025 22:14
@property
def global_expanded_shape(self) -> tuple[int, ...]:
return sum((dim.global_expanded_shape for dim in self._dims), ())
def local_to_global_partial(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is it possible to add function signatures at least to newly introduced functions including short description of what new functions do? This would simplify reviewing a lot.

Copy link
Collaborator Author

@jlamypoirier jlamypoirier Aug 12, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'll add some doc. Where is the signature missing? See also their descriptions in the associated TensorMeta methods. https://github.com/ServiceNow/Fast-LLM/pull/336/files#diff-3517f1a1541e286fce5d2a79d1eb3efab238c23f888696fc44e37d85a1692671R195

Base automatically changed from debug_mamba to main August 12, 2025 16:18
@jlamypoirier jlamypoirier requested a review from oleksost August 12, 2025 18:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants