pixano_inference.ray.deployment
Ray actor wrapper for InferenceModel subclasses.
create_model_deployment(model_class, config)
Wrap an InferenceModel subclass as a Ray remote actor.
Creates a Ray actor class with:
- predict(input_data) method forwarding to the model
- get_metadata() method
- get_stats() method (request count, avg time)
- unload() method
The actor's __init__ instantiates the model class and calls
load_model(). Ray actor options (GPU/CPU/memory) come from
config.resources.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
model_class
|
type[InferenceModel]
|
An InferenceModel subclass to deploy. |
required |
config
|
ModelDeploymentConfig
|
Deployment configuration. |
required |
Returns:
| Type | Description |
|---|---|
Any
|
A Ray remote actor handle (already created and running). |
Source code in pixano_inference/ray/deployment.py
25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 | |