Skip to main content

Available infrastructures

This page lists the available compute infrastructures for your deployments.

Deployment environment

All deployments runs on a Linux-based environment, with the versions of Python and Ikomia API defined by the workflow.

Serverless deployments

These deployments runs on serverless functions (CPU only), they are usually the smallest and cheapest deployments available. Serverless deployments come with auto-scaling capability, making them a good choice for applications that need to adapt to traffic loads. Conversely, your workflow should not require heavy computing power.

Such deployments may have cold starts, which means that the first request to your deployment may take more time to respond.

When you run these deployments, you are charged based on the workflow execution time.

SizevCPURAMAvailable regions
AWS
S34GBFrance, Germany, Ireland
M46GBFrance, Germany, Ireland
L58GBFrance, Germany, Ireland
XL610GBFrance, Germany, Ireland

Instance GPU deployments

These deployments run on dedicated instances with GPU acceleration. They are generally the most powerful deployments, and are a good choice for workflows running deep learning models or other GPU intensive tasks.

When you run these deployments, you are charged based on the time the instance is running, not depending on the number of requests.

SizevCPURAMGPUAvailable regions
AWS
XS416GBNVIDIA Tesla T4 (16GB)France, Germany, Ireland
S832GBNVIDIA Tesla T4 (16GB)France, Germany, Ireland
M416GBNVIDIA A10 (24GB)Germany, Ireland
L832GBNVIDIA A10 (24GB)Germany, Ireland
XL1664GBNVIDIA A10 (24GB)Germany, Ireland
Google Cloud
XS416GBNVIDIA L4 (24GB)Netherlands, United States (Central)
S832GBNVIDIA L4 (24GB)Netherlands, United States (Central)
M1664GBNVIDIA L4 (24GB)Netherlands, United States (Central)
L1285GBNVIDIA A100 (40GB)Netherlands, United States (Central)
XL12170GBNVIDIA A100 (80GB)Netherlands, United States (Central)
Scaleway
S816GBNVIDIA RTX 3070 (8GB)France
M1042GBNVIDIA Tesla P100 (16GB)France

Instance CPU deployments

These deployments run on dedicated CPU instances. Once they are running, they respond instantly and are generally more powerful than serverless deployments. Conversely, such deployment does not include auto-scaling capability.

When you run these deployments, you are charged based on the time the instance is running, not depending on the number of requests.

SizevCPURAMAvailable regions
AWS
XS28GBFrance, Germany, Ireland
S48GBFrance, Germany, Ireland
M816GBFrance, Germany, Ireland
L1632GBFrance, Germany, Ireland
XL3264GBFrance, Germany, Ireland
Google Cloud
XS28GBNetherlands, United States (Central)
S416GBNetherlands, United States (Central)
M832GBNetherlands, United States (Central)
L1664GBNetherlands, United States (Central)
XL32128GBNetherlands, United States (Central)
Scaleway
S34GBFrance