Azure Data Lake Storage Gen2 コンテナーのフォルダー構造を設計しています。 ユーザーは、Azure Databricks や Azure Synapse Analytics サーバーレス SQL プールなど、さまざまなサービスを使用してデータをクエリします。データはサブジェクトごとに保護されます。ほとんどのクエリには、現在の年または現在の月のデータが含まれます。 高速なクエリと簡素化されたフォルダ セキュリティをサポートするには、どのフォルダ構造を推奨しますか?
正解:D
There's an important reason to put the date at the end of the directory structure. If you want to lock down certain regions or subject matters to users/groups, then you can easily do so with the POSIX permissions. Otherwise, if there was a need to restrict a certain security group to viewing just the UK data or certain planes, with the date structure in front a separate permission would be required for numerous directories under every hour directory. Additionally, having the date structure in front would exponentially increase the number of directories as time went on. Note: In IoT workloads, there can be a great deal of data being landed in the data store that spans across numerous products, devices, organizations, and customers. It's important to pre-plan the directory layout for organization, security, and efficient processing of the data for down-stream consumers. A general template to consider might be the following layout: {Region}/{SubjectMatter(s)}/{yyyy}/{mm}/{dd}/{hh}/