Site Reliability Engineer job in Sacramento, CA | Red Hat S...

Site Reliability Engineer
Red Hat SoftwareSacramento, CA4 days ago
About the job:
The Red Hat Engineering team is looking for a Site Reliability Engineer to develop, scale, and operate our OpenShift managed cloud services; Red Hat OpenShift Container Platform is our enterprise Kubernetes distribution. In this role, you will contribute to running OpenShift at scale by enabling customer self-service, making our monitoring system more sustainable, and eliminating work through automation. As part of the Site Reliability Engineering (SRE) team, you will have the opportunity to inspire the complex challenges of scale which are unique to Red Hat managed cloud services, while using your skills in coding, operations, and large-scale distributed system design.

Red Hat relies on teamwork and openness for its success. We are a global team and strive to cultivate a transparent environment that makes room for different voices. We learn from our failures in a blameless environment to support the continuous improvement of the team. At Red Hat, your individual contributions have more visibility than most large companies, and visibility means career opportunities and growth. Successful applicants must reside in a state where Red Hat is registered to do business.
What you will do:

  • Work with live systems and coding automation

  • Contribute code to increase the scalability and reliability of the service

  • Contribute software tests and participate in peer review to increase the quality of our codebase

  • Help and develop peers’ capabilities through knowledge sharing, mentoring, and collaboration

  • Participate in a regular on-call schedule, including occasional paid weekends and holidays

  • Practice sustainable incident response and blameless postmortems

  • Resolve customer issues escalated from the Red Hat Global Support team

  • Work within a small agile team to develop and improve SRE software, support your peers, plan, and self-improve

What you will bring:

  • Bachelor's degree in computer science or a related technical field involving software or systems engineering; direct experience that demonstrates your ability and interest in SRE will also be considered

  • 3+ years of software engineering experience using object-oriented languages, preferably Golang

  • 3+ years of experience managing Linux-based systems in a public cloud like Amazon Web Service (AWS), Google Cloud Platform (GCP), or Microsoft Azure

  • 3+ years of experience with enterprise systems monitoring; knowledge of Prometheus is a plus

  • 1+ year(s) of experience delivering hosted cloud services

  • 1+ year(s) of experience with Kubernetes

  • 1+ year(s) of experience with containers on Linux

  • Ability to collaboratively troubleshoot and solve problems in a team environment

  • Experience troubleshooting an Anything-as-a-service offering (XaaS) and some experience working with complex distributed systems

  • Demonstrated ability to debug, optimize code, and automate routine tasks

  • Basic understanding of Unix or Linux operating systems

  • Excellent communications skills in a global team environment

  • Demonstrated ability to quickly and accurately troubleshoot systems issues

  • Solid understanding of standard TCP and IP networking and common protocols like domain name system (DNS) and HTTP

  • Direct experience with Kubernetes or Red Hat OpenShift Container Platform is a plus


About Red Hat:
Red Hat is the world’s leading provider of enterprise open source software solutions, using a community-powered approach to deliver reliable and high-performing Linux, hybrid cloud, container, and Kubernetes technologies. Red Hat helps customers integrate new and existing IT applications, develop cloud-native applications, standardize on our industry-leading operating system, and automate, secure, and manage complex environments. Award-winning support, training, and consulting services make Red Hat a trusted adviser to the Fortune 500. As a strategic partner to cloud providers, system integrators, application vendors, customers, and open source communities, Red Hat can help organizations prepare for the digital future.


  • Comprehensive medical, dental, and vision coverage

  • Flexible Spending Account - healthcare and dependent care

  • Health Savings Account - high deductible medical plan

  • Retirement 401(k) with employer match

  • Paid time off and holidays

  • Paid parental leave plans for all new parents

  • Leave benefits including disability, paid family medical leave, and paid military leave

  • Additional benefits including employee stock purchase plan, family planning reimbursement, tuition reimbursement, transportation expense account, employee assistance program, and more!

Note: These benefits are only applicable to full time, permanent associates at Red Hat located in the United States.

Site Reliability Engineer, Americas

Canonical - Jobs

Sacramento, CA

Mon, 27 Jun 2022 08:46:53 GMT
Our site reliability engineers bring Python software-engineering s...
Design Automation Atlassian Support Technician (SK)


Folsom, CA

Sat, 25 Jun 2022 10:15:27 GMT
This role will be eligible for our hybrid work model which allows employees to s...
Site Reliability Engineer (SRE)

Recruiting From Scratch

Elk Grove, CA

Tue, 21 Jun 2022 20:29:11 GMT
Contribute to the design, implementation and running of a high-volume, low-laten...
Sr Site Reliability Engineer

Raley's Supermarkets

West Sacramento, CA

Fri, 03 Jun 2022 10:15:53 GMT
Eligible for annual incentive bonus. Flexible work environment based on the need...
Site Reliability Engineer


Sacramento, CA

Mon, 06 Jun 2022 19:13:16 GMT
This role requires a generalist who can contribute with needs in development, sy...
Site Reliability Engineer


Folsom, CA

Tue, 24 May 2022 10:24:13 GMT
This role will be eligible for our hybrid work model which allows employees to s...
Digital Products- Site Reliability Engineer-Sr. Manager


Sacramento, CA

Wed, 25 May 2022 12:45:56 GMT
We have skilled technologists, data scientists, product managers and business st...
Senior Software Engineer, Site Reliability (REMOTE)


Sacramento, CA

Sat, 28 May 2022 01:15:04 GMT
Own site reliability for a product vertical in collaboration with ...
Digital Products- Site Reliability Engineer/Lead- Manager


Sacramento, CA

Wed, 25 May 2022 12:45:56 GMT
Managing and continually improving platform infrastructure and applications with...
Site Reliability Manager

Raley's Supermarkets

West Sacramento, CA

Wed, 18 May 2022 01:57:47 GMT
You will recommend alternatives for IT infrastructure architecture in order to e...
Senior Site Reliability Engineer


Sacramento, CA

Thu, 19 May 2022 19:13:43 GMT
This role requires a generalist who can contribute with needs in development, sy...
Senior Site Reliability Engineer (SRE)

Franklin Templeton Investments

Rancho Cordova, CA

Thu, 12 May 2022 13:40:36 GMT
Working closely with distributed computing experts and security experts, in buil...
Senior Site Reliability Engineer at Rent-to-Own Home Startup

Recruiting From Scratch

Elk Grove, CA

Thu, 05 May 2022 02:29:53 GMT
Implement industry standard best practices that promote reliability, iter...
Mid-Level/Senior Cloud Site Reliability Engineer (multiple openings) Sacramento, CA


Sacramento, CA

Wed, 27 Apr 2022 00:21:44 GMT
Our engineers ensure that our services meet the needs of our customers with the ...