gcloud alpha healthcare datasets deidentify - create a new Cloud Healthcare API dataset containing de-identified data from the source dataset
gcloud alpha healthcare datasets deidentify (DATASET : --location=LOCATION) --destination-dataset=DESTINATION_DATASET [--annotation-store=ANNOTATION_STORE] [--async] [--default-fhir-config] [--dicom-filter-tags=[DICOM_FILTER_TAGS,...]] [--text-redaction-mode=TEXT_REDACTION_MODE] [GCLOUD_WIDE_FLAG ...]
(ALPHA) Create a new Cloud Healthcare API dataset containing de-identified data from the source dataset.
To generate a de-identified version of the dataset 'test-dataset', run the command below. To skip the FHIR stores, omit the --default-fhir-config flag, and to skip DICOM stores, omit the --dicom-filter-tags flag.
$ gcloud alpha healthcare datasets deidentify test-dataset \ --destination-dataset=projects/{projectId}/locations/\ us-central1/datasets/test-deid-dataset --default-fhir-config \ --dicom-filter-tags=MediaStorageSOPClassUID,SeriesInstanceUID,\ StudyInstanceUID
- Dataset resource - Source Cloud Healthcare API dataset to deidentify. The
arguments in this group can be used to specify the attributes of this resource. (NOTE) Some attributes are not given arguments in this group but can be set in other ways. To set the project attribute:
- —
provide the argument dataset on the command line with a fully specified name;
- —
provide the argument --project on the command line;
- —
set the property core/project.
This must be specified.
- DATASET
ID of the dataset or fully qualified identifier for the dataset. To set the dataset attribute:
provide the argument dataset on the command line.
This positional argument must be specified if any of the other arguments in this group are specified.
- --location=LOCATION
Google Cloud location. To set the location attribute:
provide the argument dataset on the command line with a fully specified name;
provide the argument --location on the command line;
set the property healthcare/location.
- --destination-dataset=DESTINATION_DATASET
The name of the dataset resource to which the redacted data should be written (e.g., projects/{projectId}/locations/{locationId}/datasets/{datasetId}). The new dataset must not exist, or the request will fail.
- --annotation-store=ANNOTATION_STORE
The name of the annotation store to output the location of sensitive data (e.g., projects/{projectId}/locations/{locationId}/datasets/{datasetId}/annotationStores/{annotationStoreId}). The annotation store, if not in the destination dataset, must already exist. The generated annotations will contain the original values of the de-identified data in the 'quote' field. See https://cloud.google.com/healthcare/docs/how-tos/annotations-deidentify for more details.
- --async
Return immediately, without waiting for the operation in progress to complete.
- --default-fhir-config
Deidentify FHIR data with default configurations.
- --dicom-filter-tags=[DICOM_FILTER_TAGS,...]
Tags to be filtered. Tags must be DICOM Data Elements, File Meta Elements, or Directory Structuring Elements, as defined at: http://dicom.nema.org/medical/dicom/current/output/html/part06.html#table_6-1,. They may be provided by "Keyword" or "Tag". For example "PatientID", "0010,0010".
- --text-redaction-mode=TEXT_REDACTION_MODE
Determines how to redact text from image. TEXT_REDACTION_MODE must be (currently only one value is supported):
- all
Redact all text.
These flags are available to all commands: --access-token-file, --account, --billing-project, --configuration, --flags-file, --flatten, --format, --help, --impersonate-service-account, --log-http, --project, --quiet, --trace-token, --user-output-enabled, --verbosity.
Run $ gcloud help for details.
This command uses the healthcare/v1alpha2 API. The full documentation for this API can be found at: https://cloud.google.com/healthcare
This command is currently in alpha and might change without notice. If this command fails with API permission errors despite specifying the correct project, you might be trying to access an API with an invitation-only early access allowlist. These variants are also available:
$ gcloud healthcare datasets deidentify
$ gcloud beta healthcare datasets deidentify