NAME

gcloud alpha ai endpoints create - create a new Vertex AI endpoint

SYNOPSIS

gcloud alpha ai endpoints create --display-name=DISPLAY_NAME [--description=DESCRIPTION] [--encryption-kms-key-name=ENCRYPTION_KMS_KEY_NAME] [--endpoint-id=ENDPOINT_ID] [--labels=[KEY=VALUE,...]] [--network=NETWORK] [--region=REGION] [--request-response-logging-rate=REQUEST_RESPONSE_LOGGING_RATE --request-response-logging-table=REQUEST_RESPONSE_LOGGING_TABLE] [GCLOUD_WIDE_FLAG ...]

EXAMPLES

To create an endpoint under project example in region us-central1, run:

$ gcloud alpha ai endpoints create --project=example \ --region=us-central1 --display-name=my_endpoint

REQUIRED FLAGS

--display-name=DISPLAY_NAME

Display name of the endpoint.

OPTIONAL FLAGS

--description=DESCRIPTION

Description of the endpoint.

--encryption-kms-key-name=ENCRYPTION_KMS_KEY_NAME

The Cloud KMS resource identifier of the customer managed encryption key used to protect a resource. Has the form: projects/my-project/locations/my-region/keyRings/my-kr/cryptoKeys/my-key.

The key needs to be in the same region as where the compute resource is created.

--endpoint-id=ENDPOINT_ID

User-specified ID of the endpoint.

--labels=[KEY=VALUE,...]

List of label KEY=VALUE pairs to add.

Keys must start with a lowercase character and contain only hyphens (-), underscores (_), lowercase characters, and numbers. Values must contain only hyphens (-), underscores (_), lowercase characters, and numbers.

--network=NETWORK

The full name of the Google Compute Engine network to which the endpoint should be peered.

Region resource - Cloud region to create endpoint. This represents a Cloud

resource. (NOTE) Some attributes are not given arguments in this group but can be set in other ways. To set the project attribute:

provide the argument --region on the command line with a fully specified name;

set the property ai/region with a fully specified name;

choose one from the prompted list of available regions with a fully specified name;

provide the argument --project on the command line;

set the property core/project.

--region=REGION

ID of the region or fully qualified identifier for the region. To set the region attribute:

  • provide the argument --region on the command line;

  • set the property ai/region;

  • choose one from the prompted list of available regions.

--request-response-logging-rate=REQUEST_RESPONSE_LOGGING_RATE

Prediction request & response sampling rate for logging to BigQuery table.

--request-response-logging-table=REQUEST_RESPONSE_LOGGING_TABLE

BigQuery table uri for prediction request & response logging.

GCLOUD WIDE FLAGS

These flags are available to all commands: --access-token-file, --account, --billing-project, --configuration, --flags-file, --flatten, --format, --help, --impersonate-service-account, --log-http, --project, --quiet, --trace-token, --user-output-enabled, --verbosity.

Run $ gcloud help for details.

NOTES

This command is currently in alpha and might change without notice. If this command fails with API permission errors despite specifying the correct project, you might be trying to access an API with an invitation-only early access allowlist. These variants are also available:

$ gcloud ai endpoints create

$ gcloud beta ai endpoints create