Your AI training pipeline involves a pre-processing step that reads data from a large HDF5 file. You notice significant delays during this step. You suspect the HDF5 file structure might be contributing to the slow read times. What optimization technique is MOST likely to improve read performance from this HDF5 file?
正解:C
Reorganizing the HDF5 file (option C) to improve data contiguity and chunking is the most effective optimization. HDF5 performance is highly dependent on how the data is laid out within the file. Contiguous data and optimal chunk sizes allow for more efficient 1/0 operations. Converting to CSV (A) loses the hierarchical structure of HDF5. Storing on NFS (B) adds network overhead. Compression (D) can reduce storage space but increases decompression overhead. Encryption (E) adds overhead without improving read performance.