pixano.data.dataset.dataset_item
DatasetItem(**data)
Bases: BaseModel
DatasetItem
Attributes:
Name | Type | Description |
---|---|---|
id |
str
|
Item ID |
original_id |
str
|
Item original ID |
split |
str
|
Item split |
features |
dict[str, ItemFeature]
|
Item features |
views |
dict[str, ItemView]
|
Item views |
objects |
dict[str, ItemObject]
|
Item objects |
embeddings |
dict[str, ItemEmbedding]
|
Item embeddings |
Raises ValidationError
if the input data cannot be
validated to form a valid model.
self
is explicitly positional-only to allow self
as a field name.
Source code in pydantic/main.py
delete_objects(ds_tables)
Delete remove objects from dataset item
Parameters:
Name | Type | Description | Default |
---|---|---|---|
ds_tables
|
dict[str, dict[str, LanceTable]]
|
Dataset tables |
required |
Source code in pixano/data/dataset/dataset_item.py
from_pyarrow(pyarrow_item, info, media_dir, media_features=False, model_id=None)
staticmethod
Format PyArrow item
Parameters:
Name | Type | Description | Default |
---|---|---|---|
pyarrow_item
|
dict[str, dict[str, Table]]
|
PyArrow item |
required |
info
|
DatasetInfo
|
Dataset info |
required |
media_dir
|
Path
|
Dataset media directory |
required |
media_features
|
bool
|
Load media features like image width and height (slow for large item batches) |
False
|
model_id
|
str
|
Model ID (ONNX file path) of embeddings to load. Defaults to None. |
None
|
Returns:
Type | Description |
---|---|
DatasetItem
|
Formatted item |
Source code in pixano/data/dataset/dataset_item.py
to_pyarrow()
Return DatasetItem in PyArrow format
Returns:
Type | Description |
---|---|
dict[str, Any]
|
Item in PyArrow format |
Source code in pixano/data/dataset/dataset_item.py
update(ds_table)
Update dataset item
Parameters:
Name | Type | Description | Default |
---|---|---|---|
ds_table
|
LanceTable
|
Item table |
required |