gcloud beta ml language analyze-syntax - use Google Cloud Natural Language API to identify linguistic information
gcloud beta ml language analyze-syntax (--content=CONTENT | --content-file=CONTENT_FILE) [--content-type=CONTENT_TYPE; default="plain-text"] [--encoding-type=ENCODING_TYPE; default="utf8"] [--language=LANGUAGE] [GCLOUD_WIDE_FLAG ...]
(BETA) Syntactic Analysis extracts linguistic information, breaking up the given text into a series of sentences and tokens (generally, word boundaries), providing further analysis on those tokens.
Currently English, Spanish, Japanese, Chinese (Simplified and Traditional), French, German, Italian, Korean, and Portuguese are supported.
To analyze syntax in raw content 'They drink.':
$ gcloud beta ml language analyze-syntax --content='They drink'
To analyze syntax in file 'myfile.txt':
$ gcloud beta ml language analyze-syntax --content-file='myfile.txt'
To analyze syntax in a remote file 'gs://bucket_name/mycontent.html' for Japanese language using UTF-8 encoding:
$ gcloud beta ml language analyze-syntax \ --content-file='gs://bucket_name/mycontent.html' \ --content-type=HTML --encoding-type=utf8 --language=ja-JP
- Exactly one of these must be specified:
- --content=CONTENT
Specify input text on the command line. Useful for experiments, or for extremely short text.
- --content-file=CONTENT_FILE
Specify a local file or Google Cloud Storage (format gs://bucket/object) file path containing the text to be analyzed. More useful for longer text or data output from another system.
- --content-type=CONTENT_TYPE; default="plain-text"
Specify the format of the input text. CONTENT_TYPE must be one of: html, plain-text.
- --encoding-type=ENCODING_TYPE; default="utf8"
The encoding type used by the API to calculate offsets. If set to none, encoding-dependent offsets will be set at -1. This is an optional flag only used for the entity mentions in results, and does not affect how the input is read or analyzed. ENCODING_TYPE must be one of: none, utf16, utf32, utf8.
- --language=LANGUAGE
Specify the language of the input text. If omitted, the server will attempt to auto-detect. Both ISO (such as en or es) and BCP-47 (such as en-US or ja-JP) language codes are accepted.
These flags are available to all commands: --access-token-file, --account, --billing-project, --configuration, --flags-file, --flatten, --format, --help, --impersonate-service-account, --log-http, --project, --quiet, --trace-token, --user-output-enabled, --verbosity.
Run $ gcloud help for details.
This command uses the language/v1beta2 API. The full documentation for this API can be found at: https://cloud.google.com/natural-language/
This command is currently in beta and might change without notice. These variants are also available:
$ gcloud ml language analyze-syntax
$ gcloud alpha ml language analyze-syntax