vllm.pooling_params
PoolingParams
¶
Bases: Struct
API parameters for pooling models. This
Attributes:
| Name | Type | Description |
|---|---|---|
dimensions |
Optional[int]
|
Reduce the dimensions of embeddings if model support matryoshka representation. |
Source code in vllm/pooling_params.py
logits_processing_needs_token_ids
class-attribute
instance-attribute
¶
logits_processing_needs_token_ids: bool = False
Internal use only.
use_cross_encoder
class-attribute
instance-attribute
¶
use_cross_encoder: bool = False
Internal use only.
__post_init__
¶
clone
¶
clone() -> PoolingParams
Returns a deep copy of the PoolingParams instance.
Source code in vllm/pooling_params.py
verify
¶
verify(model_config: ModelConfig) -> None