Skip to content

Commit 04d6d92

Browse files
authored
Extends quantization predicate with config (#476)
Adds config parameter to quantization predicate Enables fine-grained quantization control Supports per-parameter quantization strategies Improves flexibility in model quantization configuration
1 parent 38dc092 commit 04d6d92

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

mlx_lm/utils.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -496,7 +496,7 @@ def wrapped_predicate(path, module):
496496
return False
497497
bool_or_params = True
498498
if quant_predicate is not None:
499-
bool_or_params = quant_predicate(path, module)
499+
bool_or_params = quant_predicate(path, module, config)
500500
if isinstance(bool_or_params, dict):
501501
quantized_config["quantization"][path] = bool_or_params
502502
elif fine_grained_config and bool_or_params:

0 commit comments

Comments
 (0)