Setting up Kueue resources

If you are using the Red Hat Developer Sandbox you can skip this step and move on to the next section, Create a Workbench.
If you do not intend to complete the Distributing training jobs with Training operator section of this workshop you can skip this step and move on to the next section, Create a Workbench.

In this section, you prepare your workshop environment so that you can use Distributing training with Training operator.

Later in this workshop, you implement a Distributed training job using Kueue for managing job resources. With Kueue, you can manage cluster resource quotas and how different workloads consume them.

Procedure
  1. In the OpenShift AI dashboard, click the application launcher icon and then select the OpenShift Console option.

    OpenShift Console Link
  2. In the OpenShift console, click + in the top navigation bar.

    Add resources Icon
  3. Select your project from the list of projects.

    Select a project
  4. Verify that you selected the correct project.

    Selected project
  5. Copy the following code and paste it into the Import YAML editor.

    ---
    apiVersion: kueue.x-k8s.io/v1beta1
    kind: ResourceFlavor
    metadata:
      name: default-flavor
    ---
    apiVersion: kueue.x-k8s.io/v1beta1
    kind: ClusterQueue
    metadata:
      name: cluster-queue
    spec:
      namespaceSelector: {}
      resourceGroups:
      - coveredResources: ["cpu", "memory"]
        flavors:
        - name: "default-flavor"
          resources:
          - name: "cpu"
            nominalQuota: 4
          - name: "memory"
            nominalQuota: 8Gi
    ---
    apiVersion: kueue.x-k8s.io/v1beta1
    kind: LocalQueue
    metadata:
      name: local-queue
    spec:
      clusterQueue: cluster-queue
  6. Click Create.

Verification
  1. In the OpenShift console, you should see a "Resources successfully created" message and the following resources listed:

    • default-flavor

    • cluster-queue

    • local-queue