gcloud ai endpoints create - create a new Vertex AI endpoint
gcloud ai endpoints create --display-name=DISPLAY_NAME [--description=DESCRIPTION] [--encryption-kms-key-name=ENCRYPTION_KMS_KEY_NAME] [--endpoint-id=ENDPOINT_ID] [--labels=[KEY=VALUE,...]] [--network=NETWORK] [--region=REGION] [--request-response-logging-rate=REQUEST_RESPONSE_LOGGING_RATE --request-response-logging-table=REQUEST_RESPONSE_LOGGING_TABLE] [GCLOUD_WIDE_FLAG ...]
To create an endpoint under project example in region us-central1, run:
$ gcloud ai endpoints create --project=example \ --region=us-central1 --display-name=my_endpoint
- --display-name=DISPLAY_NAME
Display name of the endpoint.
- --description=DESCRIPTION
Description of the endpoint.
- --encryption-kms-key-name=ENCRYPTION_KMS_KEY_NAME
The Cloud KMS resource identifier of the customer managed encryption key used to protect a resource. Has the form: projects/my-project/locations/my-region/keyRings/my-kr/cryptoKeys/my-key.
The key needs to be in the same region as where the compute resource is created.
- --endpoint-id=ENDPOINT_ID
User-specified ID of the endpoint.
- --labels=[KEY=VALUE,...]
List of label KEY=VALUE pairs to add.
Keys must start with a lowercase character and contain only hyphens (-), underscores (_), lowercase characters, and numbers. Values must contain only hyphens (-), underscores (_), lowercase characters, and numbers.
- --network=NETWORK
The full name of the Google Compute Engine network to which the endpoint should be peered.
- Region resource - Cloud region to create endpoint. This represents a Cloud
resource. (NOTE) Some attributes are not given arguments in this group but can be set in other ways. To set the project attribute:
- —
provide the argument --region on the command line with a fully specified name;
- —
set the property ai/region with a fully specified name;
- —
choose one from the prompted list of available regions with a fully specified name;
- —
provide the argument --project on the command line;
- —
set the property core/project.
- --region=REGION
ID of the region or fully qualified identifier for the region. To set the region attribute:
provide the argument --region on the command line;
set the property ai/region;
choose one from the prompted list of available regions.
- --request-response-logging-rate=REQUEST_RESPONSE_LOGGING_RATE
Prediction request & response sampling rate for logging to BigQuery table.
- --request-response-logging-table=REQUEST_RESPONSE_LOGGING_TABLE
BigQuery table uri for prediction request & response logging.
These flags are available to all commands: --access-token-file, --account, --billing-project, --configuration, --flags-file, --flatten, --format, --help, --impersonate-service-account, --log-http, --project, --quiet, --trace-token, --user-output-enabled, --verbosity.
Run $ gcloud help for details.
These variants are also available:
$ gcloud alpha ai endpoints create
$ gcloud beta ai endpoints create