Commit 4a015e3
* [None][infra] Pin the version for triton to 3.3.1 (NVIDIA#6508)
Signed-off-by: qqiao <[email protected]>
* [None][infra] Pin the version for triton to 3.3.1 (NVIDIA#6508) (NVIDIA#6519) (NVIDIA#6549)
Signed-off-by: Yanchao Lu <[email protected]>
* [fix]: use safeInitRowMax instead of fp32_lowest to avoid NaN (NVIDIA#7087)
Signed-off-by: Yao Yao <[email protected]>
* [None][fix] Fix a numerical stability issue for XQA with spec dec
Signed-off-by: Yao Yao <[email protected]>
* fix typo
Signed-off-by: Jhao-Ting Chen <[email protected]>
* fix precompiled multi_query_token kernel not having is_fp8_out hash key (NVIDIA#6279)
Signed-off-by: Jhao-Ting Chen <[email protected]>
* [fix] Fix missing fields in xqa kernel cache key (NVIDIA#6282)
Signed-off-by: Yao Yao <[email protected]>
---------
Signed-off-by: qqiao <[email protected]>
Signed-off-by: Yanchao Lu <[email protected]>
Signed-off-by: Yao Yao <[email protected]>
Signed-off-by: Jhao-Ting Chen <[email protected]>
Co-authored-by: Emma Qiao <[email protected]>
Co-authored-by: Yanchao Lu <[email protected]>
Co-authored-by: Yao Yao <[email protected]>
1 parent 9270041 commit 4a015e3
File tree
7 files changed
+47
-14
lines changed- cpp
- kernels/xqa
- tensorrt_llm/kernels/decoderMaskedMultiheadAttention
- decoderXQAImplJIT
7 files changed
+47
-14
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
630 | 630 | | |
631 | 631 | | |
632 | 632 | | |
| 633 | + | |
| 634 | + | |
633 | 635 | | |
634 | 636 | | |
635 | 637 | | |
| |||
999 | 1001 | | |
1000 | 1002 | | |
1001 | 1003 | | |
1002 | | - | |
| 1004 | + | |
1003 | 1005 | | |
1004 | 1006 | | |
1005 | 1007 | | |
| |||
1075 | 1077 | | |
1076 | 1078 | | |
1077 | 1079 | | |
| 1080 | + | |
| 1081 | + | |
| 1082 | + | |
| 1083 | + | |
| 1084 | + | |
| 1085 | + | |
| 1086 | + | |
| 1087 | + | |
| 1088 | + | |
| 1089 | + | |
| 1090 | + | |
| 1091 | + | |
| 1092 | + | |
| 1093 | + | |
| 1094 | + | |
| 1095 | + | |
| 1096 | + | |
1078 | 1097 | | |
1079 | 1098 | | |
1080 | 1099 | | |
| |||
1887 | 1906 | | |
1888 | 1907 | | |
1889 | 1908 | | |
1890 | | - | |
| 1909 | + | |
1891 | 1910 | | |
1892 | 1911 | | |
1893 | 1912 | | |
1894 | 1913 | | |
1895 | 1914 | | |
1896 | 1915 | | |
1897 | 1916 | | |
1898 | | - | |
| 1917 | + | |
1899 | 1918 | | |
1900 | 1919 | | |
1901 | 1920 | | |
| |||
2009 | 2028 | | |
2010 | 2029 | | |
2011 | 2030 | | |
2012 | | - | |
| 2031 | + | |
2013 | 2032 | | |
2014 | 2033 | | |
2015 | 2034 | | |
| |||
2302 | 2321 | | |
2303 | 2322 | | |
2304 | 2323 | | |
2305 | | - | |
| 2324 | + | |
2306 | 2325 | | |
2307 | | - | |
| 2326 | + | |
2308 | 2327 | | |
2309 | 2328 | | |
2310 | 2329 | | |
| |||
2332 | 2351 | | |
2333 | 2352 | | |
2334 | 2353 | | |
2335 | | - | |
| 2354 | + | |
2336 | 2355 | | |
2337 | 2356 | | |
2338 | 2357 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
30 | 30 | | |
31 | 31 | | |
32 | 32 | | |
33 | | - | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
34 | 40 | | |
35 | 41 | | |
36 | 42 | | |
| |||
Lines changed: 3 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
52 | 52 | | |
53 | 53 | | |
54 | 54 | | |
| 55 | + | |
55 | 56 | | |
56 | 57 | | |
57 | | - | |
| 58 | + | |
| 59 | + | |
58 | 60 | | |
59 | 61 | | |
60 | 62 | | |
Lines changed: 4 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
67 | 67 | | |
68 | 68 | | |
69 | 69 | | |
| 70 | + | |
70 | 71 | | |
71 | 72 | | |
72 | 73 | | |
73 | 74 | | |
74 | 75 | | |
75 | 76 | | |
76 | 77 | | |
77 | | - | |
| 78 | + | |
78 | 79 | | |
79 | 80 | | |
80 | 81 | | |
| |||
103 | 104 | | |
104 | 105 | | |
105 | 106 | | |
| 107 | + | |
| 108 | + | |
106 | 109 | | |
107 | 110 | | |
108 | 111 | | |
| |||
Lines changed: 2 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
37 | 37 | | |
38 | 38 | | |
39 | 39 | | |
40 | | - | |
41 | | - | |
| 40 | + | |
| 41 | + | |
42 | 42 | | |
43 | 43 | | |
44 | 44 | | |
| |||
Lines changed: 4 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
97 | 97 | | |
98 | 98 | | |
99 | 99 | | |
100 | | - | |
| 100 | + | |
101 | 101 | | |
102 | 102 | | |
103 | 103 | | |
| |||
124 | 124 | | |
125 | 125 | | |
126 | 126 | | |
| 127 | + | |
127 | 128 | | |
128 | 129 | | |
129 | 130 | | |
130 | | - | |
| 131 | + | |
| 132 | + | |
131 | 133 | | |
132 | 134 | | |
133 | 135 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
59 | 59 | | |
60 | 60 | | |
61 | 61 | | |
| 62 | + | |
0 commit comments