Add two our MLLM papers

We have submit pull request as the template of README.
https://github.com/BradyFU/Awesome-Multimodal-Large-Language-Models/pull/249

{
    "title": Learning Compact Vision Tokens for Efficient Large Multimodal Models,
    "url": [paper URL](https://arxiv.org/abs/2506.07138),
    "venue": arXiv,
    "category": Visual Understanding,
    "code": https://github.com/visresearch/LLaVA-STF,
    "collections": https://huggingface.co/visresearch/LLaVA-STF/tree/main
}

{
    "title": Diversity-Guided MLP Reduction for Efficient Large Vision Transformers,
    "url": https://arxiv.org/abs/2506.08591,
    "venue": arXiv,
    "category": Visual Understanding,
    "code": https://github.com/visresearch/DGMR,
    "collections": https://huggingface.co/visresearch/DGMR/tree/main
}



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add two our MLLM papers #7

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Add two our MLLM papers #7

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions