Contact Us
Sign up for free
Sign up for free

Home

Documentation

Reseller Documentation

API Reference

Home
Gcore Status

Overview

Introduction
Quick Access

IAM

CDN

Managed DNS

Cloud

- GET
  List inference API keys
- POST
  Create inference API key
- GET
  Get inference API key
- DEL
  Delete inference API key
- PATCH
  Update inference API key
- GET
  List inference deployments
- POST
  Create inference deployment
- GET
  Get inference deployment
- DEL
  Delete inference deployment
- PATCH
  Update inference deployment
- GET
  Get inference deployment logs
- POST
  Start inference deployment
- POST
  Stop inference deployment
- GET
  List inference registry credentials
- POST
  Create inference registry credential
- GET
  Get inference registry credential
- PUT
  Replace inference registry credential
- DEL
  Delete inference registry credential
- GET
  List inference secrets
- POST
  Create inference secret
- GET
  Get inference secret
- PUT
  Replace inference secret
- DEL
  Delete Inference Secret
- GET
  Get inference capacity by region
- GET
  List inference flavors
- GET
  Get inference flavor
- GET
  List models from catalog
- GET
  Get model from catalog
- POST
  Preview inference deployment price
- GET
  Get inference deployment API key
  deprecated

DDoS Protection

FastEdge

WAAP

Streaming

Object Storage

Resellers

Everywhere Inference

Start inference deployment

This operation initializes an inference deployment after it was stopped, making it available to handle inference requests again. The instance will launch with the minimum number of replicas defined in the scaling settings.

If the minimum replicas are set to 0, the instance will initially start with 0 replicas.
It will automatically scale up when it receives requests or SQS messages, according to the configured scaling rules.

Was this page helpful?

Get inference deployment logs Stop inference deployment

⌘I

instagram youtube x linkedin