Job
Description
About the Job
Red Hat is looking for a Principal Software Maintenance Engineer to help a new team aimed at improving the long-term product experience of our Red Hat OpenShift AI(RHOAI) customers. In this role, you will work closely with Product Engineering to deliver extended product maintenance work, including handling bugs and CVEs in extended life streams of Red Hat OpenShift AI. You'll be actively driving all aspects of Red Hat OpenShift maintenance for a subset of product components focused on software development that goes beyond the normal maintenance phase of Red Hat OpenShift AI minor releases. You will contribute to the patch review process, the design and development and execution of backports, test automation and test plans, and the creation of automation tools to improve the efficiency, flexibility, and quality of Red Hats patch delivery to customers. You will provide technical leadership for engineers within the team. What you will do
Contribute to the development of new Red Hat OpenShift AI product maintenance offerings, aiming to make Red Hat a global leader in long-term product experience.
Explore and experiment with emerging AI technologies relevant to software development, proactively identifying opportunities to incorporate new AI capabilities into existing workflows and tooling.
Collaborate with cross-functional teams to identify opportunities for AI integration within the software development lifecycle, driving continuous improvement and innovation in engineering practices; share use cases for successful experiments with stakeholders for broader use.
Provide technical vision and leadership on critical and high-impact projects
Ensure special focus on non-functional requirements including security, resiliency, and maintainability are met
Write complex integration tests and work with quality engineers to ensure product quality
Use CI/CD best practices to deliver solutions as productization efforts into RHOAI
Contribute to a culture of continuous improvement by sharing recommendations and technical knowledge with team members
Collaborate with product management, other engineering, and cross-functional teams to analyze and clarify business requirements
Communicate effectively to stakeholders and team members to ensure proper visibility of development efforts
Give thoughtful and prompt code reviews
Diagnose the root cause of failures
Maintain clear and comprehensive documentation for tests, test cases, and testing procedures.
Participate in code reviews to ensure test coverage and quality standards.
Stay updated on emerging testing tools, technologies, and best practices.
What will you bring Development expertise in Golang and/or Python,
Advanced experience testing applications in Golang and/or Python
Advanced experience in Kubernetes, OpenShift, or other cloud-native technologies
Advanced experience in React, Typescript
Strong understanding of CI/CD practices and experience working with tools like Jenkins, GitHub Actions, GitLab CI, or similar.
Ability to quickly learn and guide others on using new tools and technologies
Autonomous work ethic, thriving in a dynamic, fast-paced environment.
Experience with source code management tools such as Git
Hands on experience in release pipeline preparation & Builds creation
Hands on experience in Builds validation & smoke testing, including upgrade testing
Technical leadership acumen in a global team environment
Good Problem solving and troubleshooting skills with Root Cause Analysis experience
Ability to work independently in a fast-paced, ever-changing environment
Excellent written and verbal communication skills
Experience with Agile/Scrum methodologies.
The following will be considered a plusWhile a Bachelors degree or higher in computer science or a related discipline is valued, we prioritize practical experience and technical prowess
Knowledge of the AI and machine learning landscape, including basic MLOps concepts
Understanding of how Open Source and Free Software communities work
Experience with public cloud services (AWS, GCE, Azure)
Experience working with or deploying MLOps platforms