Skip to content

Temporal Misalignment problem of 16kHz model #118

@RuizhePang

Description

@RuizhePang

When I use the 16kHz model to reconstruct the signals, the output wave form consistently exhibits a slight temporal shift relative to the groud-truth signal. This misalignment leads to a significant degradation in SI-SDR.
But when I first resample the signals to 44.1kHz and process them by 44.1kHz model, the reconstructed waveform aligns much more accurately with the reference signals, resulting in a substantially more reasonable SI-SDR.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions