pixano_inference.models.llm
LLM (Large Language Model) base class and I/O types (stub).
LLMInput(**data)
Bases: BaseModel
Input for LLM text generation.
Attributes:
| Name | Type | Description |
|---|---|---|
prompt |
str | list[dict[str, Any]]
|
Text prompt or chat messages. |
max_new_tokens |
int
|
Maximum number of new tokens to generate. |
temperature |
float
|
Sampling temperature. |
Source code in pydantic/main.py
LLMModel(config)
Bases: InferenceModel
Base class for large language models (stub).
Source code in pixano_inference/models/base.py
predict(input)
abstractmethod
LLMOutput(**data)
Bases: BaseModel
Output for LLM text generation.
Attributes:
| Name | Type | Description |
|---|---|---|
generated_text |
str
|
Generated text. |
usage |
UsageInfo
|
Usage metadata. |
generation_config |
dict[str, Any]
|
Configuration used for the generation. |