Commit 871ace6
refactor: weight refactor, including norm, mm, quantization and embedding (#1193)
Co-authored-by: sufubao <[email protected]>
Co-authored-by: wangzaijun <[email protected]>1 parent 91644a4 commit 871ace6
File tree
161 files changed
+3356
-4769
lines changed- docs
- CN/source
- models
- tutorial
- EN/source
- models
- tutorial
- lightllm
- common
- basemodel
- attention/fa3
- layer_weights
- meta_weights
- fused_moe
- impl
- mm_weight
- triton_kernel
- fused_moe
- norm
- quantization
- quantization
- triton_quant
- triton_utils/autotune_kernel_configs/triton_3.5.1
- NVIDIA_GeForce_RTX_4090_D/scaled_mm_per_token:v1
- NVIDIA_GeForce_RTX_5090/scaled_mm_per_token:v1
- distributed
- models
- bloom
- layer_infer
- layer_weights
- deepseek2
- layer_infer
- layer_weights
- deepseek_mtp
- layer_infer
- layer_weights
- gemma3
- layer_infer
- layer_weights
- gemma_2b
- layer_infer
- layer_weights
- gpt_oss
- layer_weights
- internlm2_reward/layer_weights
- internlm2/layer_weights
- llama
- layer_infer
- layer_weights
- mistral_mtp
- layer_infer
- layer_weights
- mixtral/layer_weights
- qwen2/layer_weights
- qwen3_moe_mtp
- layer_weights
- qwen3_moe
- layer_infer
- layer_weights
- qwen3_vl_moe
- layer_infer
- layer_weights
- qwen3_vl/layer_infer
- qwen3
- layer_infer
- layer_weights
- qwen/layer_weights
- stablelm
- layer_infer
- layer_weights
- starcoder2
- layer_infer
- layer_weights
- starcoder
- layer_infer
- layer_weights
- vit
- layer_infer
- layer_weights
- server
- core/objs
- router/model_infer/mode_backend
- utils
- test/start_scripts
- multi_pd_master
- single_pd_master
- unit_tests/common
- fused_moe
- quantization
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
161 files changed
+3356
-4769
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
162 | 162 | | |
163 | 163 | | |
164 | 164 | | |
165 | | - | |
166 | | - | |
167 | | - | |
168 | | - | |
169 | | - | |
170 | | - | |
171 | | - | |
172 | | - | |
173 | | - | |
174 | | - | |
175 | | - | |
176 | | - | |
177 | | - | |
178 | 165 | | |
179 | 166 | | |
180 | 167 | | |
| |||
204 | 191 | | |
205 | 192 | | |
206 | 193 | | |
207 | | - | |
208 | | - | |
209 | | - | |
210 | | - | |
211 | | - | |
212 | | - | |
213 | | - | |
214 | | - | |
215 | | - | |
216 | | - | |
217 | | - | |
218 | | - | |
219 | | - | |
220 | | - | |
221 | | - | |
222 | | - | |
223 | | - | |
224 | | - | |
225 | | - | |
226 | | - | |
227 | | - | |
228 | | - | |
229 | | - | |
230 | | - | |
231 | 194 | | |
232 | 195 | | |
233 | 196 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
367 | 367 | | |
368 | 368 | | |
369 | 369 | | |
370 | | - | |
371 | | - | |
372 | | - | |
373 | | - | |
374 | | - | |
375 | | - | |
376 | | - | |
| 370 | + | |
377 | 371 | | |
378 | 372 | | |
379 | 373 | | |
| 374 | + | |
380 | 375 | | |
| 376 | + | |
| 377 | + | |
381 | 378 | | |
382 | 379 | | |
383 | 380 | | |
| |||
389 | 386 | | |
390 | 387 | | |
391 | 388 | | |
392 | | - | |
393 | | - | |
394 | | - | |
395 | | - | |
396 | | - | |
397 | | - | |
398 | | - | |
| 389 | + | |
399 | 390 | | |
400 | 391 | | |
401 | 392 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
49 | 49 | | |
50 | 50 | | |
51 | 51 | | |
52 | | - | |
| 52 | + | |
53 | 53 | | |
54 | 54 | | |
55 | | - | |
| 55 | + | |
| 56 | + | |
56 | 57 | | |
57 | 58 | | |
58 | | - | |
| 59 | + | |
59 | 60 | | |
60 | 61 | | |
61 | 62 | | |
| |||
119 | 120 | | |
120 | 121 | | |
121 | 122 | | |
122 | | - | |
| 123 | + | |
123 | 124 | | |
124 | 125 | | |
125 | 126 | | |
126 | 127 | | |
127 | 128 | | |
128 | 129 | | |
129 | | - | |
| 130 | + | |
130 | 131 | | |
131 | 132 | | |
132 | 133 | | |
| |||
135 | 136 | | |
136 | 137 | | |
137 | 138 | | |
138 | | - | |
| 139 | + | |
139 | 140 | | |
140 | 141 | | |
141 | 142 | | |
142 | 143 | | |
143 | 144 | | |
144 | 145 | | |
145 | | - | |
| 146 | + | |
146 | 147 | | |
147 | 148 | | |
148 | 149 | | |
| |||
179 | 180 | | |
180 | 181 | | |
181 | 182 | | |
182 | | - | |
| 183 | + | |
183 | 184 | | |
184 | 185 | | |
185 | 186 | | |
| |||
189 | 190 | | |
190 | 191 | | |
191 | 192 | | |
192 | | - | |
| 193 | + | |
| 194 | + | |
193 | 195 | | |
194 | 196 | | |
195 | 197 | | |
| |||
202 | 204 | | |
203 | 205 | | |
204 | 206 | | |
205 | | - | |
| 207 | + | |
206 | 208 | | |
207 | 209 | | |
208 | 210 | | |
| |||
212 | 214 | | |
213 | 215 | | |
214 | 216 | | |
215 | | - | |
| 217 | + | |
| 218 | + | |
216 | 219 | | |
217 | 220 | | |
218 | 221 | | |
| |||
269 | 272 | | |
270 | 273 | | |
271 | 274 | | |
272 | | - | |
| 275 | + | |
273 | 276 | | |
274 | 277 | | |
275 | 278 | | |
| |||
279 | 282 | | |
280 | 283 | | |
281 | 284 | | |
282 | | - | |
| 285 | + | |
| 286 | + | |
283 | 287 | | |
284 | 288 | | |
285 | 289 | | |
286 | 290 | | |
287 | 291 | | |
288 | 292 | | |
289 | 293 | | |
290 | | - | |
| 294 | + | |
291 | 295 | | |
292 | 296 | | |
293 | 297 | | |
| |||
296 | 300 | | |
297 | 301 | | |
298 | 302 | | |
299 | | - | |
| 303 | + | |
| 304 | + | |
300 | 305 | | |
301 | 306 | | |
302 | 307 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
162 | 162 | | |
163 | 163 | | |
164 | 164 | | |
165 | | - | |
166 | | - | |
167 | | - | |
168 | | - | |
169 | | - | |
170 | | - | |
171 | | - | |
172 | | - | |
173 | | - | |
174 | | - | |
175 | | - | |
176 | | - | |
177 | 165 | | |
178 | 166 | | |
179 | 167 | | |
| |||
204 | 192 | | |
205 | 193 | | |
206 | 194 | | |
207 | | - | |
208 | | - | |
209 | | - | |
210 | | - | |
211 | | - | |
212 | | - | |
213 | | - | |
214 | | - | |
215 | | - | |
216 | | - | |
217 | | - | |
218 | | - | |
219 | | - | |
220 | | - | |
221 | | - | |
222 | | - | |
223 | | - | |
224 | | - | |
225 | | - | |
226 | | - | |
227 | | - | |
228 | | - | |
229 | | - | |
230 | | - | |
231 | 195 | | |
232 | 196 | | |
233 | 197 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
359 | 359 | | |
360 | 360 | | |
361 | 361 | | |
362 | | - | |
363 | | - | |
364 | | - | |
365 | | - | |
366 | | - | |
367 | | - | |
368 | | - | |
| 362 | + | |
369 | 363 | | |
370 | 364 | | |
371 | 365 | | |
| 366 | + | |
372 | 367 | | |
| 368 | + | |
| 369 | + | |
373 | 370 | | |
374 | 371 | | |
375 | 372 | | |
| |||
381 | 378 | | |
382 | 379 | | |
383 | 380 | | |
384 | | - | |
385 | | - | |
386 | | - | |
387 | | - | |
388 | | - | |
389 | | - | |
390 | | - | |
| 381 | + | |
391 | 382 | | |
392 | 383 | | |
393 | 384 | | |
| |||
0 commit comments