wcy1122

Follow

🎯

Focusing

Chengyao Wang wcy1122

🎯

Focusing

Follow

PhD student @dvlab-research, CSE@CUHK, Multimodal Intelligence

76 followers · 58 following

The Chinese University of Hong Kong
Hong Kong SAR
09:12 (UTC +08:00)
https://wcy1122.github.io/
https://scholar.google.com/citations?user=1pZcoqgAAAAJ&hl=en

Achievements

Achievements

Organizations

Pinned Loading

dvlab-research/MGM dvlab-research/MGM Public

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3.3k 279
dvlab-research/LLaMA-VID dvlab-research/LLaMA-VID Public

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)

Python 849 53
dvlab-research/MGM-Omni dvlab-research/MGM-Omni Public

MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech

Python 256 17
dvlab-research/Lyra dvlab-research/Lyra Public

[ICCV 2025] Official Implementation for "Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition"

Python 302 29