組織では、従業員情報をBigQueryデータセットに保存しています。人事(HR)管理チームはデータへのフルアクセスを必要としていますが、HRアナリストチームは個人を特定できる情報(PII)にアクセスすることなく給与分析を行う必要があります。Dataplexで管理される役割に応じて、ユーザーに適切なレベルのアクセス権限を付与しつつ、データの重複を削減したいと考えています。どうすればよいでしょうか?
正解:B
Comprehensive and Detailed Explanation:
The requirements are role-based access, masking specific columns (PII) for one role, and avoiding data duplication.
Option B is the correct answer because it directly addresses all requirements using Google Cloud's modern data governance tools. You use Data Catalog (governed by Dataplex) to create a taxonomy and policy tags (e.
g., a "PII" tag). You apply this tag to the sensitive columns in your BigQuery table. Then, using IAM, you can grant the HR Analyst role access to the table but apply a data masking policy to the "PII" tag for that role.
When an analyst queries the table, BigQuery dynamically masks the tagged columns for them at query time.
The HR Admin role is granted full access without the masking policy. This provides fine-grained security without creating any copies of the data.
Option A is a valid but older approach. An authorized view could work, but it requires manually maintaining the view's SQL definition. Policy tags are more scalable and manageable, especially as the number of tables and policies grows.
Options C and D are incorrect because they both involve creating a new, separate dataset with masked data.
This explicitly violates the requirement to "reduce data duplication."
Reference (Google Cloud Documentation Concepts):
This is the primary use case for column-level security in BigQuery. The official documentation shows how to
"Restrict access with BigQuery column-level security" by creating a taxonomy and policy tags in Data Catalog. You then grant IAM permissions on these policy tags (roles/datacatalog.categoryFineGrainedReader) to different groups of users. For users who should see masked data, you do not grant them this role on the sensitive tags, and a data masking rule (e.g., nullification, hashing) can be applied. This entire governance framework can be managed centrally via Dataplex.