@@ -22,6 +22,7 @@ The following tables detail the models supported by LMDeploy's TurboMind engine
2222| Qwen2<sup >\[ 2\] </sup > | 0.5B - 72B | LLM | Yes | Yes\* | Yes\* | Yes |
2323| Qwen2-MoE | 57BA14B | LLM | Yes | Yes | Yes | Yes |
2424| Qwen2.5<sup >\[ 2\] </sup > | 0.5B - 72B | LLM | Yes | Yes\* | Yes\* | Yes |
25+ | Qwen3 | 0.6B-235B | LLM | Yes | Yes | Yes\* | Yes\* |
2526| Mistral<sup >\[ 1\] </sup > | 7B | LLM | Yes | Yes | Yes | No |
2627| Mixtral | 8x7B, 8x22B | LLM | Yes | Yes | Yes | Yes |
2728| DeepSeek-V2 | 16B, 236B | LLM | Yes | Yes | Yes | No |
@@ -36,6 +37,7 @@ The following tables detail the models supported by LMDeploy's TurboMind engine
3637| InternVL | v1.1 - v1.5 | MLLM | Yes | Yes | Yes | Yes |
3738| InternVL2<sup >\[ 2\] </sup > | 1 - 2B, 8B - 76B | MLLM | Yes | Yes\* | Yes\* | Yes |
3839| InternVL2.5(MPO)<sup >\[ 2\] </sup > | 1 - 78B | MLLM | Yes | Yes\* | Yes\* | Yes |
40+ | InternVL3<sup >\[ 2\] </sup > | 1 - 78B | MLLM | Yes | Yes\* | Yes\* | Yes |
3941| ChemVLM | 8B - 26B | MLLM | Yes | Yes | Yes | Yes |
4042| MiniCPM-Llama3-V-2_5 | - | MLLM | Yes | Yes | Yes | Yes |
4143| MiniCPM-V-2_6 | - | MLLM | Yes | Yes | Yes | Yes |
@@ -76,6 +78,7 @@ The following tables detail the models supported by LMDeploy's TurboMind engine
7678| QWen1.5-MoE | A2.7B | LLM | Yes | Yes | Yes | No | No |
7779| QWen2 | 0.5B - 72B | LLM | Yes | Yes | No | Yes | Yes |
7880| Qwen2.5 | 0.5B - 72B | LLM | Yes | Yes | No | Yes | Yes |
81+ | Qwen3 | 0.6B - 235B | LLM | Yes | Yes | Yes\* | - | Yes\* |
7982| QWen2-VL | 2B, 7B | MLLM | Yes | Yes | No | No | Yes |
8083| QWen2.5-VL | 3B - 72B | MLLM | Yes | No | No | No | No |
8184| DeepSeek-MoE | 16B | LLM | Yes | No | No | No | No |
@@ -95,6 +98,7 @@ The following tables detail the models supported by LMDeploy's TurboMind engine
9598| InternVL(v1.5) | 2B-26B | MLLM | Yes | Yes | Yes | No | Yes |
9699| InternVL2 | 1B-76B | MLLM | Yes | Yes | Yes | - | - |
97100| InternVL2.5(MPO) | 1B-78B | MLLM | Yes | Yes | Yes | - | - |
101+ | InternVL3 | 1B-78B | MLLM | Yes | Yes | Yes | - | - |
98102| Mono-InternVL<sup >\[ 1\] </sup > | 2B | MLLM | Yes | Yes | Yes | - | - |
99103| ChemVLM | 8B-26B | MLLM | Yes | Yes | No | - | - |
100104| Gemma2 | 9B-27B | LLM | Yes | Yes | Yes | - | - |
@@ -114,20 +118,25 @@ The following tables detail the models supported by LMDeploy's TurboMind engine
114118
115119## PyTorchEngine on Huawei Ascend Platform
116120
117- | Model | Size | Type | FP16/BF16(eager) | FP16/BF16(graph) | W4A16(eager) |
118- | :------------: | :------: | :--: | :--------------: | :--------------: | :----------: |
119- | Llama2 | 7B - 70B | LLM | Yes | Yes | Yes |
120- | Llama3 | 8B | LLM | Yes | Yes | Yes |
121- | Llama3.1 | 8B | LLM | Yes | Yes | Yes |
122- | InternLM2 | 7B - 20B | LLM | Yes | Yes | Yes |
123- | InternLM2.5 | 7B - 20B | LLM | Yes | Yes | Yes |
124- | InternLM3 | 8B | LLM | Yes | Yes | Yes |
125- | Mixtral | 8x7B | LLM | Yes | Yes | No |
126- | QWen1.5-MoE | A2.7B | LLM | Yes | - | No |
127- | QWen2(.5) | 7B | LLM | Yes | Yes | No |
128- | QWen2-MoE | A14.57B | LLM | Yes | - | No |
129- | DeepSeek-V2 | 16B | LLM | No | Yes | No |
130- | InternVL(v1.5) | 2B-26B | MLLM | Yes | - | Yes |
131- | InternVL2 | 1B-40B | MLLM | Yes | Yes | Yes |
132- | CogVLM2-chat | 19B | MLLM | Yes | No | - |
133- | GLM4V | 9B | MLLM | Yes | No | - |
121+ | Model | Size | Type | FP16/BF16(eager) | FP16/BF16(graph) | W8A8(graph) | W4A16(eager) |
122+ | :------------: | :-------: | :--: | :--------------: | :--------------: | :---------: | :----------: |
123+ | Llama2 | 7B - 70B | LLM | Yes | Yes | Yes | Yes |
124+ | Llama3 | 8B | LLM | Yes | Yes | Yes | Yes |
125+ | Llama3.1 | 8B | LLM | Yes | Yes | Yes | Yes |
126+ | InternLM2 | 7B - 20B | LLM | Yes | Yes | Yes | Yes |
127+ | InternLM2.5 | 7B - 20B | LLM | Yes | Yes | Yes | Yes |
128+ | InternLM3 | 8B | LLM | Yes | Yes | Yes | Yes |
129+ | Mixtral | 8x7B | LLM | Yes | Yes | No | No |
130+ | QWen1.5-MoE | A2.7B | LLM | Yes | - | No | No |
131+ | QWen2(.5) | 7B | LLM | Yes | Yes | Yes | Yes |
132+ | QWen2-VL | 2B, 7B | MLLM | Yes | Yes | - | - |
133+ | QWen2.5-VL | 3B - 72B | MLLM | Yes | Yes | - | - |
134+ | QWen2-MoE | A14.57B | LLM | Yes | - | No | No |
135+ | QWen3 | 0.6B-235B | LLM | Yes | Yes | No | No |
136+ | DeepSeek-V2 | 16B | LLM | No | Yes | No | No |
137+ | InternVL(v1.5) | 2B-26B | MLLM | Yes | - | Yes | Yes |
138+ | InternVL2 | 1B-40B | MLLM | Yes | Yes | Yes | Yes |
139+ | InternVL2.5 | 1B-78B | MLLM | Yes | Yes | Yes | Yes |
140+ | InternVL3 | 1B-78B | MLLM | Yes | Yes | Yes | Yes |
141+ | CogVLM2-chat | 19B | MLLM | Yes | No | - | - |
142+ | GLM4V | 9B | MLLM | Yes | No | - | - |
0 commit comments