Skip to content

Add two our MLLM papers #7

@sccbhxc

Description

@sccbhxc

We have submit pull request as the template of README.
BradyFU/Awesome-Multimodal-Large-Language-Models#249

{
"title": Learning Compact Vision Tokens for Efficient Large Multimodal Models,
"url": paper URL,
"venue": arXiv,
"category": Visual Understanding,
"code": https://github.com/visresearch/LLaVA-STF,
"collections": https://huggingface.co/visresearch/LLaVA-STF/tree/main
}

{
"title": Diversity-Guided MLP Reduction for Efficient Large Vision Transformers,
"url": https://arxiv.org/abs/2506.08591,
"venue": arXiv,
"category": Visual Understanding,
"code": https://github.com/visresearch/DGMR,
"collections": https://huggingface.co/visresearch/DGMR/tree/main
}

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions