Skip to content

Generalization of the merging operation #121

@toyot-li

Description

@toyot-li

Hi @zhengkw18 @jt-zhang Thanks for your great efforts!

Regarding turbodiffusion/scripts/merge_models.py, could it be employed as a common practice of merging step distillation weights (e.g., DMD, discrete / continuous CD, adversarial distillation, etc.) and sparse attn weights (e.g., VSA, STA, radial attn, etc.), in order to combine their strength?

Or else, it is just a nice property of rCM together with SLA? Thanks in advance.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions