Setting up Kueue resources
If you are using the Red Hat Developer Sandbox you can skip this step and move on to the next section, Create a Workbench. |
If you do not intend to complete the Distributing training jobs with Training operator section of this workshop you can skip this step and move on to the next section, Create a Workbench. |
In this section, you prepare your workshop environment so that you can use Distributing training with Training operator.
Later in this workshop, you implement a Distributed training job using Kueue for managing job resources. With Kueue, you can manage cluster resource quotas and how different workloads consume them.
-
In the OpenShift AI dashboard, click the application launcher icon and then select the OpenShift Console option.
-
In the OpenShift console, click + in the top navigation bar.
-
Select your project from the list of projects.
-
Verify that you selected the correct project.
-
Copy the following code and paste it into the Import YAML editor.
--- apiVersion: kueue.x-k8s.io/v1beta1 kind: ResourceFlavor metadata: name: default-flavor --- apiVersion: kueue.x-k8s.io/v1beta1 kind: ClusterQueue metadata: name: cluster-queue spec: namespaceSelector: {} resourceGroups: - coveredResources: ["cpu", "memory"] flavors: - name: "default-flavor" resources: - name: "cpu" nominalQuota: 4 - name: "memory" nominalQuota: 8Gi --- apiVersion: kueue.x-k8s.io/v1beta1 kind: LocalQueue metadata: name: local-queue spec: clusterQueue: cluster-queue
-
Click Create.
-
In the OpenShift console, you should see a "Resources successfully created" message and the following resources listed:
-
default-flavor
-
cluster-queue
-
local-queue
-