NAME

gcloud ai endpoints - manage Vertex AI endpoints

SYNOPSIS

gcloud ai endpoints COMMAND [GCLOUD_WIDE_FLAG ...]

DESCRIPTION

An endpoint contains one or more deployed models, all of which must have the same interface but may come from different models. An endpoint is to obtain online prediction and explanation from one of its deployed models.

When you communicate with Vertex AI services, you identify a specific endpoint that is deployed in the cloud using a combination of the current project, the region, and the endpoint.

GCLOUD WIDE FLAGS

These flags are available to all commands: --help.

Run $ gcloud help for details.

COMMANDS

COMMAND is one of the following:

create

Create a new Vertex AI endpoint.

delete

Delete an existing Vertex AI endpoint.

deploy-model

Deploy a model to an existing Vertex AI endpoint.

describe

Describe an existing Vertex AI endpoint.

explain

Request an online explanation from an Vertex AI endpoint.

list

List existing Vertex AI endpoints.

predict

Run Vertex AI online prediction.

raw-predict

Run Vertex AI online raw prediction.

undeploy-model

Undeploy a model from an existing Vertex AI endpoint.

update

Update an existing Vertex AI endpoint.

NOTES

These variants are also available:

$ gcloud alpha ai endpoints

$ gcloud beta ai endpoints