vllm.model_executor.layers.quantization.kernels.mixed_precision.conch
_CONCH_SUPPORTED_WEIGHT_TYPES
module-attribute
¶
ConchLinearKernel
¶
Bases: MPLinearKernel
Source code in vllm/model_executor/layers/quantization/kernels/mixed_precision/conch.py
apply_weights
¶
Source code in vllm/model_executor/layers/quantization/kernels/mixed_precision/conch.py
can_implement
classmethod
¶
can_implement(
c: MPLinearLayerConfig,
) -> tuple[bool, Optional[str]]
Source code in vllm/model_executor/layers/quantization/kernels/mixed_precision/conch.py
process_weights_after_loading
¶
process_weights_after_loading(layer: Module) -> None