gcloud ml language analyze-syntax - use Google Cloud Natural Language API to identify linguistic information
gcloud ml language analyze-syntax (--content=CONTENT | --content-file=CONTENT_FILE) [--content-type=CONTENT_TYPE; default="plain-text"] [--encoding-type=ENCODING_TYPE; default="utf8"] [--language=LANGUAGE] [GCLOUD_WIDE_FLAG ...]
Syntactic Analysis extracts linguistic information, breaking up the given text into a series of sentences and tokens (generally, word boundaries), providing further analysis on those tokens.
Currently English, Spanish, and Japanese are supported.
To analyze syntax in raw content 'They drink.':
$ gcloud ml language analyze-syntax --content='They drink'
To analyze syntax in file 'myfile.txt':
$ gcloud ml language analyze-syntax --content-file='myfile.txt'
To analyze syntax in a remote file 'gs://bucket_name/mycontent.html' for Japanese language using UTF-8 encoding:
$ gcloud ml language analyze-syntax \ --content-file='gs://bucket_name/mycontent.html' \ --content-type=HTML --encoding-type=utf8 --language=ja-JP
- Exactly one of these must be specified:
- --content=CONTENT
Specify input text on the command line. Useful for experiments, or for extremely short text.
- --content-file=CONTENT_FILE
Specify a local file or Google Cloud Storage (format gs://bucket/object) file path containing the text to be analyzed. More useful for longer text or data output from another system.
- --content-type=CONTENT_TYPE; default="plain-text"
Specify the format of the input text. CONTENT_TYPE must be one of: html, plain-text.
- --encoding-type=ENCODING_TYPE; default="utf8"
The encoding type used by the API to calculate offsets. If set to none, encoding-dependent offsets will be set at -1. This is an optional flag only used for the entity mentions in results, and does not affect how the input is read or analyzed. ENCODING_TYPE must be one of: none, utf16, utf32, utf8.
- --language=LANGUAGE
Specify the language of the input text. If omitted, the server will attempt to auto-detect. Both ISO (such as en or es) and BCP-47 (such as en-US or ja-JP) language codes are accepted.
These flags are available to all commands: --access-token-file, --account, --billing-project, --configuration, --flags-file, --flatten, --format, --help, --impersonate-service-account, --log-http, --project, --quiet, --trace-token, --user-output-enabled, --verbosity.
Run $ gcloud help for details.
This command uses the language/v1 API. The full documentation for this API can be found at: https://cloud.google.com/natural-language/
These variants are also available:
$ gcloud alpha ml language analyze-syntax
$ gcloud beta ml language analyze-syntax