Skip to content

Conversation

@zhangshaolei1998
Copy link

LLaVA-Mini is a unified large multimodal model that can support the understanding of images, high-resolution images, and videos in an efficient manner.

Paper: https://arxiv.org/abs/2501.03895
Code & Demo: https://github.com/ictnlp/LLaVA-Mini

LLaVA-Mini is a unified large multimodal model that can support the understanding of images, high-resolution images, and videos in an efficient manner.
Paper: https://arxiv.org/abs/2501.03895
Code & Demo: https://github.com/ictnlp/LLaVA-Mini
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant