Consider a scenario where you are setting up a high-performance computing cluster with several GPU-accelerated nodes using Slurm as the resource manager. You want to ensure that jobs requesting GPUs are only scheduled on nodes with the appropriate NVIDIA drivers and CUDA toolkit installed. How can you achieve this within Slurm?
正解:C
Using Slurm's node features is the most straightforward and recommended approach for tagging nodes with specific capabilities. The '-constraint' option allows jobs to request nodes with particular features. GresTypeS can be used, but node features provide more flexibility and control. Installing drivers dynamically is impractical and inefficient. DCGM is primarily for monitoring, not core scheduling requirements.