Skip to content
View wcy1122's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@dvlab-research

Block or report wcy1122

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. dvlab-research/MGM dvlab-research/MGM Public

    Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

    Python 3.3k 279

  2. dvlab-research/LLaMA-VID dvlab-research/LLaMA-VID Public

    LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)

    Python 849 53

  3. dvlab-research/MGM-Omni dvlab-research/MGM-Omni Public

    MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech

    Python 256 17

  4. dvlab-research/Lyra dvlab-research/Lyra Public

    [ICCV 2025] Official Implementation for "Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition"

    Python 302 29