We are building a high-performance, multi-tenant OpenShift cluster on bare metal in our AI-optimized data center. Our goal is to offer OpenShift as a Service to private AI-focused clients who wish to host their compute-intensive workloads in a scalable, secure, and isolated environment.
We’re looking for a hands-on Red Hat OpenShift Engineer who can design, architect, and implement this platform from scratch using industry best practices. Once the cluster is built, this role will also lead the migration of existing on-prem and Azure workloads to the new OpenShift environment.
Key Responsibilities
Design, architect, and build a multi-tenant OpenShift cluster on bare metal
Configure and maintain all aspects of the OpenShift platform for high availability, scalability, and security
Define and implement networking, storage, ingress, monitoring, and logging
Develop detailed architecture documents, blueprints, and SOPs
Lead and execute migration of existing workloads from on-premise and Azure environments to OpenShift
Ensure smooth onboarding for multiple AI-focused client tenants with isolated environments
Support DevOps team with advanced Linux/OpenShift troubleshooting
Implement and enforce RBAC, tenant isolation, resource quotas, and compliance controls
Optimize performance for AI-heavy workloads running on GPU-enabled infrastructure
Own operational stability, platform upgrades, and monitoring
Required Skills & Experience
5+ years of deep hands-on experience with Red Hat OpenShift and Kubernetes
Proven experience designing, building, and managing bare metal OpenShift clusters
Solid understanding of Linux internals, container runtimes, networking, and troubleshooting
Experience with application migration from both on-premise and Azure environments into OpenShift
Strong experience with multi-tenancy architecture, including workload isolation and security
Familiarity with storage (CSI), networking (CNI), and service mesh implementations
Proficiency in monitoring and observability tools (e.g., Prometheus, Grafana, ELK)
Experience with Infrastructure as Code (Ansible, Terraform) and CI/CD automation
Strong documentation and communication skills
Certifications (Required)
Red Hat Certified Specialist in OpenShift Administration
Red Hat Certified Engineer (RHCE) or equivalent Linux certification
Nice to Have
Familiarity with AI/ML compute environments (e.g., GPU workloads, NVIDIA operators)
Experience with hybrid cloud or edge computing models
Exposure to enterprise-grade security, compliance, and policy enforcement