gcloud ai endpoints - manage Vertex AI endpoints
gcloud ai endpoints COMMAND [GCLOUD_WIDE_FLAG ...]
An endpoint contains one or more deployed models, all of which must have the same interface but may come from different models. An endpoint is to obtain online prediction and explanation from one of its deployed models.
When you communicate with Vertex AI services, you identify a specific endpoint that is deployed in the cloud using a combination of the current project, the region, and the endpoint.
These flags are available to all commands: --help.
Run $ gcloud help for details.
COMMAND is one of the following:
- create
Create a new Vertex AI endpoint.
- delete
Delete an existing Vertex AI endpoint.
- deploy-model
Deploy a model to an existing Vertex AI endpoint.
- describe
Describe an existing Vertex AI endpoint.
- explain
Request an online explanation from an Vertex AI endpoint.
- list
List existing Vertex AI endpoints.
- predict
Run Vertex AI online prediction.
- raw-predict
Run Vertex AI online raw prediction.
- undeploy-model
Undeploy a model from an existing Vertex AI endpoint.
- update
Update an existing Vertex AI endpoint.
These variants are also available:
$ gcloud alpha ai endpoints
$ gcloud beta ai endpoints