Skip to content
This repository was archived by the owner on Jul 30, 2025. It is now read-only.
This repository was archived by the owner on Jul 30, 2025. It is now read-only.

inconsistent size when I use the huggingface model #196

@mathetian

Description

@mathetian

size mismatch for model.layers.0.self_attn.k_proj.weight: copying a param with shape torch.Size([256, 2048]) from checkpoint, the shape in current model is torch.Size([2048, 2048]).

there are error when I use

model = AutoModelForCausalLM.from_pretrained(path)

load https://huggingface.co/TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T

can anyone help me ?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions