pixano_inference.pydantic.tasks.multimodal.conditional_generation
Pydantic models for text-image conditional generation task.
TextImageConditionalGenerationInput
Bases: BaseModel
Input for text-image conditional generation.
Attributes:
Name | Type | Description |
---|---|---|
prompt |
str | list[dict[str, Any]]
|
Prompt for the generation. Can be a string or a list of dictionaries to apply a chat template. |
images |
list[str | Path] | None
|
Images for the generation. Can be None if images are passed in the prompt. |
max_new_tokens |
int
|
Maximum number of new tokens to generate. |
temperature |
float
|
Temperature for the generation. |
TextImageConditionalGenerationOutput
TextImageConditionalGenerationRequest
Bases: BaseRequest
, TextImageConditionalGenerationInput
Request for text-image conditional generation.
TextImageConditionalGenerationResponse
Bases: BaseResponse
Response for text-image conditional generation.
Attributes:
Name | Type | Description |
---|---|---|
data |
TextImageConditionalGenerationOutput
|
Output of the generation. |