Site Reliability Engineering (SRE) Team Lead Job at OneMain Financial, Irving, TX

dEVKQWZ5S25uU0RjbjJ3Z2VneWFjUVZEUXc9PQ==
  • OneMain Financial
  • Irving, TX

Job Description

We are looking for a highly skilled and experienced Site Reliability Engineering Team Lead to guide our SRE team, foster best practices, and ensure operational excellence across our infrastructure. **Position Overview** As the SRE Team Lead, you will be responsible for the technical leadership of a talented team of site reliability engineers dedicated to maintaining and improving the reliability, scalability, and performance of our critical systems and services. You will serve as a technical leader and mentor, driving strategic initiatives around automation, incident management, observability and system design while collaborating closely with engineering, operations, and product teams. **Key Responsibilities** - Lead, mentor, and grow a team of site reliability engineers, promoting a culture of reliability, automation, and continuous improvement. - Drive the design, implementation, and maintenance of scalable and fault-tolerant infrastructure to support high-availability services. - Oversee incident management processes, including triage, root cause analysis, and postmortems to improve system reliability and prevent recurrence. - Collaborate cross-functionally with software engineering, product, and operations teams to integrate reliability best practices into the software development lifecycle. - Define and implement operational metrics, SLIs/SLOs, and dashboards to monitor system health and drive proactive improvements. - Manage and assess the observability of critical environments proactively addressing gaps that may arise. - Oversee the release management processes, artifacts and tools that drive a repeatable software delivery lifecycle. - Champion automation efforts to reduce manual intervention, improve deployment pipelines, and optimize infrastructure management. - Lead capacity planning, disaster recovery, and performance tuning efforts. - Ensure security and compliance standards are upheld across infrastructure and operations. **Qualifications** - BA/BS in Computer Science, Engineering, related field, or equivalent experience. - 7+ years of experience in site reliability engineering, systems engineering, or related roles, with at least 2 years in a leadership position. - Proven experience leading and scaling high-performing engineering teams. - Deep expertise in cloud platforms (AWS, GCP, Azure) and container orchestration (Kubernetes, Docker). - Strong skills in infrastructure as code tools (Terraform, Ansible, CloudFormation) and CI/CD pipelines. - Proficiency with monitoring and alerting systems (Prometheus, Grafana, ELK, Datadog). - Solid programming and scripting skills (Python, Go, Bash, or similar). - Strong understanding of distributed systems, networking, security, and databases. - Excellent leadership, communication, and collaboration skills. - Experience managing incident response and on-call rotations. **Preferred** - Experience working with microservices and event-driven architectures. - Familiarity with compliance frameworks such as GDPR, PCI, SOX, or SOC 2. - Background in DevOps practices and tooling **Who we Are** OneMain Financial (NYSE: OMF) is the leader in offering nonprime customers responsible access to credit and is dedicated to improving the financial well-being of hardworking Americans. Since 1912, we've looked beyond credit scores to help people get the money they need today and reach their goals for tomorrow. Our growing suite of personal loans, credit cards and other products help people borrow better and work toward a brighter future. Driven collaborators and innovators, our team thrives on transformative digital thinking, customer-first energy and flexible work arrangements that grow lives, careers and our company. At every level, we're committed to an inclusive culture, career development and impacting the communities where we live and work. Getting people to a better place has made us a better company for over a century. There's never been a better time to shine with OneMain. Because team members at their best means OneMain at our best, we provide opportunities and benefits that make their health and careers a priority. That's why we've packed our comprehensive benefits package for full- and some part-timers with: + Health and wellbeing options including medical, prescription, dental, vision, hearing, accident, hospital indemnity, and life insurances + Up to 4% matching 401(k)   + Employee Stock Purchase Plan (10% share discount)   + Tuition reimbursement   + Paid time off (15 days' vacation per year, plus 2 personal days, prorated based on start date) + Paid sick leave as determined by state or local ordinance, prorated based on start date + Paid holidays (7 days per year, based on start date) + Paid volunteer time (3 days per year, prorated based on start date) OneMain Holdings, Inc. is an Equal Employment Opportunity (EEO) and Affirmative Action (AA) employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender perception or identify, national origin, age, marital status, protected veteran status, or disability status.

Job Tags

Local area, Flexible hours,

Similar Jobs

CCMI

Merchandiser/Auditor Position Available - Marshall MN Job at CCMI

- CLICK on JOB opportunities to complete your registration Merchandising/Audits available. See all information pertaining to rate of pay and tasks to be completed on the CCMI website (link below) This is not a daily job, nor will it lead to Full Time. These are...

Openkyber

IAM Governance Job at Openkyber

 ...drive the overall SRE architecture, standards, best practices, and governance. Design highly scalable, resilient, and fault-tolerant...  ...Collaboration & Leadership: Work with application, security, data, and infrastructure teams to ensure reliability is built into... 

American Red Cross

Blood Collection CDL Driver Job at American Red Cross

 ...communication skills are required. A current, valid driver's license with Class A or B Commercial Drivers License (CDL) and a good driving record is required. Experience driving large vehicles is strongly preferred. DOT certification is required, you must pass... 

Decagon AI, Inc.

Technical Recruiter Job at Decagon AI, Inc.

 ...in building a worldclass organization. About the Role Decagon is rapidly scaling our team and looking for highly capable Recruiters to fuel this growth. As a Technical Recruiter, youll have ownership and autonomy in partnering closely with company leaders and... 

THRIFT TOWN STORES, LLC

Assistant Store Manager Job at THRIFT TOWN STORES, LLC

 ...Mid-West Textile, LLC in El Paso, TX. Along with our Thrift Town Store to service our customers. Join us in making Earth a better...  ...the push and pull of merchandise on the sales floor. Analyze retail floor patterns and make recommendations to the team members to move...