Skip to content

[Task][Megatron] 3.3: Extend DeviceMesh beyond 2D #260

Description

@lvyufeng

Parent Issue

Part of #206 — Megatron Roadmap, Phase 3 (Megatron Core Execution MVP): Task 3.3

Description

Extend DeviceMesh to support >2 dimensions, needed for realistic TP × PP × DP × EP × CP layouts.

Requirements

  • 3D DeviceMesh (TP × PP × DP)
  • 4D DeviceMesh (TP × PP × DP × EP)
  • 5D DeviceMesh (TP × PP × DP × EP × CP)
  • Submesh extraction for each parallel dimension
  • Process group creation from mesh dimensions
  • Compatible with Megatron's initialize_model_parallel

Key Files

  • src/candle/distributed/ (DeviceMesh)

Blocked By

Blocks

  • Multi-dimensional parallelism

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions