Nvidia GPU Metrics/Alerting
My OpenShift Blog
by Keith Calligan
8h ago
In the last post, I covered how to enable the Nvidia GPU in RHOAI (Openshift AI). Today, we will cover metrics that can be exposed through the the Openshift Web Console. This Prometheus Telemetry/metrics data can then be used to develop custom queries to send alerts through AlertManager. Enabling GPU Monitoring and Metrics in Openshift The steps I that are being followed are based on the following documentation: Enabling the GPU Monitoring Dashboard — NVIDIA GPU Operator on Red Hat OpenShift Container Platform 23.9.2 documentation The following commands will be run from the OC CLI. # Dow ..read more
Visit website
Installing OpenShift Data Science (ODS) in Self-Managed Environment
My OpenShift Blog
by Keith Calligan
8M ago
Late last year, Red Hat Openshift Data Science (ODS) became generally available (GA).  At that time, this service was only available on cloud environments such as AWS and Google (GCP). Earlier this year, ODS became available in self-managed environments to install as an operator. ODS allows data scientists to analyze data and provides a cloud-based instance of Jupyter Notebook.  OpenVINO (model-serving runtime) and Openshift Pipelines are supported as well.  This is just a small snippet of what ODS can do.  Here is some more detail from Red Hat's official site. In this sho ..read more
Visit website
Openshift and Event Driven Ansible (Part 1)
My OpenShift Blog
by Keith Calligan
9M ago
In this series of posts, I will be covering the basics of setting up Ansible Automation Platform and the EDA (Event Driven Ansible) custom-resource.  An Ansible rulebook and an associated playbook will be run to take an alert from OCP via AlertManager.  This alert is received by the Rulebook via a webhook and takes the event variables as some sort of action in a playbook. Eventually, this code will be used to automatically resize a persistent volume claim but that part is still in the works.  Right now, let's just cover the basics for part 1 of this series. Note: This project i ..read more
Visit website
Query OCP API from Pod
My OpenShift Blog
by Keith Calligan
9M ago
I've been sort of busy with projects lately but here is a quick article to keep things going. Recently, I was asked for a way to query the Kubernetes/OCP API to get some information.  The requirements were to have a pod (based off of the minimal ubi8 container) that runs once and exits to query the Kubernetes/OCP API to get the current number of worker nodes in the cluster.  This is by no means the only way to get this information but it highlights different constructs inside of OCP/Kubernetes such as: Service Accounts Cluster-Roles Role Bindings Config-Maps Deployments So in this a ..read more
Visit website
Managing OCP Infrastructures Using GitOps (Part 3)
My OpenShift Blog
by Keith Calligan
1y ago
In the first part of this series, I showed you how the to install the Assisted Service/CIM (Central Infrastructure Management) and how a cluster can be deployed in a declarative manner using approximately 9 YAML files. In the second part of this series, we could see how that YAML files could be condensed into a single SiteConfig file.  The directory structure of the ztp-site-generator container was used to create the following GitHub repository. GitHub - kcalliga/ztp-example at v4.11.3-9 ZTP Example. Contribute to kcalliga/ztp-example development by creating an account on GitHub. GitHub ..read more
Visit website
ACM Add-Ons (Submariner)
My OpenShift Blog
by Keith Calligan
1y ago
After writing the article on VolSync, I wanted to follow-up with another add-on that is available in Advanced Cluster Management (ACM).  This is Submariner.  Here is a great description of Submariner right from its website Submariner Description This article will serve a few purposes.  The install process will be shown first and then a change will be made to the VolSync configuration that was discussed in my previous article.  The specific change being made to VolSync is in regards to the fact that I had to install MetalLB Operator with the previous use-case and enable a Lo ..read more
Visit website
Managing OCP Infrastructures Using GitOps (Part 2)
My OpenShift Blog
by Keith Calligan
1y ago
In the first part of this series, I covered how to take the raw YAML definitions (IE: infraenv, agentclusterinstall, etc) and applying these objects on the command-line to discover a bare-metal host which eventually would instantiate a SNO cluster. For this article, I will show you how you can compose a single site definition YAML to generate these separate object definitions.  It is a lot easier to deal with a siteconfig YAML which is more concise and takes some of the guess-work out of crafting each of these YAML definitions on your own. The ability of taking this single-config YAML def ..read more
Visit website
Advanced Cluster Management Updates (Part 2 of 2)
My OpenShift Blog
by Keith Calligan
1y ago
In the first part of this series, I covered how to generate ACM policies using PolicyGenerator and how to source a policy from a GitHub repo. In this second part, I'm going to rehash some of the information I covered in the following article in regards to creating a hub SNO cluster and enabling CIM/Assisted-Service. Managing OCP Infrastructures Using GitOps (Part 1) This multi-part series will describe the process of using declarative YAML definitions contained in a GitHub repository to describe the desired state of a fleet of OCP clusters. The walkthrough will only include building a managi ..read more
Visit website
Advanced Cluster Management Updates (Part 1 of 2)
My OpenShift Blog
by Keith Calligan
1y ago
It's been almost a year and half since I did the series on Advanced Cluster Management (ACM).  This consisted of four parts Overview of Advanced Cluster Management for Kubernetes (Part 1) This article is based on version 2.3.2 of ACM. ACM provides the mechanism to create and manage Openshift clusters at scale. In some cases, even non-Openshift clusters can be managed. I am going to provide a walkthrough on some of the menus in the Multicluster hub object in My OpenShift BlogKeith Calligan Applications in Advanced Cluster Management for Kubernetes (Part 2) Welcome to part 2 of the AC ..read more
Visit website
Managing OCP Infrastructures Using GitOps (Part 1)
My OpenShift Blog
by Keith Calligan
1y ago
This multi-part series will describe the process of using declarative YAML definitions contained in a GitHub repository to describe the desired state of a fleet of OCP clusters. The walkthrough will only include building a managing cluster (which will include ACM) and adding a SNO (single-node) cluster as a managed cluster but this process can be adapted to managing a large number of clusters.  LibVirt/KVM virtualization will be used in these examples due to limitations in my lab environment. The term ZTP (Zero-Touch Provisioning) has been used very often to describe these processes, but ..read more
Visit website

Follow My OpenShift Blog on FeedSpot

Continue with Google
Continue with Apple
OR