Site Reliability Engineer (SRE) Job at Openkyber, Georgia

czBCRWRpS2lteWZkbUdNamZRaVJkUVJEUVE9PQ==
  • Openkyber
  • Georgia

Job Description

TEKsystems is hiring for a fully remote, Level 5 SRE for one of our clients. The role can sit in any US state and any timezone.

This is a short-term contract role with funding till end of January 2026 but may extend beyond.

Our client, a digital asset exchange platform where users can buy, sell, and store cryptocurrencies, is seeking a high-level, Senior SRE to join their AI Infrastructure team.

The following experience is REQUIRED :

  • Site Reliability Engineering (SRE) background
  • AI infrastructure familiarity (nice-to-have, not mandatory)
  • Strong Go and Python scripting skills
  • Terraform for infrastructure as code
  • GCP or AWS Cloud Infra (logging, observability, pub/sub, cloud syncs)
  • Vector.dev and Datadog for observability pipeline
  • Security risk assessment and remediation
  • Ability to own projects end-to-end with minimal supervision

Description

We are looking for a Site Reliability Engineer (SRE) to join the IT AI Infrastructure team to deploy, manage, and optimize AI-powered productivity tools and in-house AI solutions that enhance employee efficiency at scale. A successful candidate will have demonstrated success in similar roles within high-growth, security-conscious environments, bringing deep expertise in public cloud infrastructure (AWS/GCP), backend development (Python, Go, or Java), and automation tooling. The right person is passionate about building scalable and reliable AI infrastructure, driving automation, and collaborating across disciplines to integrate AI systems while maintaining strong security and compliance standards.

  • Deployment and Management of AI Tools: Deploy, configure, and manage AI-powered employee productivity tools and in-house AI built solutions
  • Reliability and Performance: Ensure high availability, reliability, and optimal performance of AI platforms and services. Implement monitoring, alerting, and incident response procedures.
  • Scalability and Infrastructure: Design and implement scalable infrastructure to support the growing demands of AI tools and user base. Optimize resource utilization and manage capacity planning.
  • Automation and Tooling: Develop and maintain automation scripts and tools to streamline deployment, monitoring, and maintenance tasks. Contribute to the experimental sandbox environments for testing new AI solutions.
  • Collaboration and Support: Collaborate with cross-functional teams (Machine-Learning, HR, Security, Data Science, Developer Experience) to support the development and integration of AI solutions. Provide technical support and troubleshooting for AI-related issues.
  • Security and Compliance: Adhere to security and privacy policies while deploying and managing AI tools. Ensure compliance with regulatory requirements.
  • Monitoring and Metrics: Implement comprehensive monitoring and metrics to track the performance and health of AI systems. Analyze data to identify areas for improvement and optimization.
  • Incident Response: Participate in incident response and troubleshooting for AI-related outages or performance issues. Develop and maintain incident response plans.
  • Backend Development: Contribute to backend development tasks to support the integration and functionality of AI tools.
  • Public Cloud Management: Deploy and manage AI solutions on public cloud platforms (AWS/GCP), leveraging cloud-native services and best practices.
  • Written and Verbal Communication: Excellent communication skills and experience presenting technical information to non-technical audiences, including senior leadership.

Skills

Proven experience as a Site Reliability Engineer (SRE) or similar role. Strong understanding of AI technologies and platforms. Experience with deploying and managing applications in a cloud environment (AWS/GCP). Solid backend development experience with programming languages such as Python, Java, or Go. Strong proficiency in managing and configuring public cloud services (AWS/GCP) for scalability and reliability.

Experience with automation tools and scripting (e.g., Ansible, Terraform, Bash, Python). Excellent troubleshooting and problem-solving skills. Strong communication and collaboration skills. Strong security and compliance understanding. Experience working in a highly regulated environment Experience in a fast-paced, high-growth company

Education

Proven experience as a Site Reliability Engineer (SRE) or similar role. Strong understanding of AI technologies and platforms. Experience with deploying and managing applications in a cloud environment (AWS/GCP). Solid backend development experience with programming languages such as Python, Java, or Go. Strong proficiency in managing and configuring public cloud services (AWS/GCP) for scalability and reliability.

Experience with automation tools and scripting (e.g., Ansible, Terraform, Bash, Python). Excellent troubleshooting and problem-solving skills. Strong communication and collaboration skills. Strong security and compliance understanding. Experience working in a highly regulated environment. Experience in a fast-paced, high-growth company

Additional Skills & Qualifications

Role: AI Site Reliability Engineer (Contractor, IC5 level)

Team: IT EMPA (Employee Productivity & Automation)

Duration: Open until end of January (possible extension)

Location: Remote

Responsibilities:

  • Manage and enhance AI-driven employee productivity tools (e.g., Glean, Google Workspace, Slack AI)
  • Implement observability solutions (logging, metrics, dashboards)
  • Automate infrastructure tasks using Terraform
  • Assess and mitigate security risks in AI systems
  • Build scaffolding APIs for unsupported Glean features
  • Collaborate with engineering teams to deliver production-ready solutions quickly

Job Type & Location

This is a Contract position based out of Oakland, CA.

Pay and Benefits

The pay range for this position is $90.00 - $100.00/hr.

Eligibility requirements apply to some benefits and may depend on your job classification and length of employment. Benefits are subject to change and may be subject to specific elections, plan, or program terms. If eligible, the benefits available for this temporary role may include the following:

  • Medical, dental & vision
  • Critical Illness, Accident, and Hospital
  • 401(k) Retirement Plan Pre-tax and Roth post-tax contributions available
  • Life Insurance (Voluntary Life & AD&D for the employee and dependents)
  • Short and long-term disability
  • Health Spending Account (HSA)
  • Transportation benefits
  • Employee Assistance Program
  • Time Off/Leave (PTO, Vacation or Sick Leave)

Workplace Type

This is a fully remote position.

Application Deadline

This position is anticipated to close on Dec 5, 2025.

About TEKsystems:

We're partners in transformation. We help clients activate ideas and solutions to take advantage of a new world of opportunity. We are a team of 80,000 strong, working with over 6,000 clients, including 80% of the Fortune 500, across North America, Europe and Asia. As an industry leader in Full-Stack Technology Services, Talent Services, and real-world application, we work with progressive leaders to drive change. That's the power of true partnership. TEKsystems is an Allegis Group company.

The company is an equal opportunity employer and will consider all applications without regards to race, sex, age, color, religion, national origin, veteran status, disability, sexual orientation, gender identity, genetic information or any characteristic protected by law.

About TEKsystems and TEKsystems Global Services

We're a leading provider of business and technology services. We accelerate business transformation for our customers. Our expertise in strategy, design, execution and operations unlocks business value through a range of solutions. We're a team of 80,000 strong, working with over 6,000 customers, including 80% of the Fortune 500 across North America, Europe and Asia, who partner with us for our scale, full-stack capabilities and speed. We're strategic thinkers, hands-on collaborators, helping customers capitalize on change and master the momentum of technology. We're building tomorrow by delivering business outcomes and making positive impacts in our global communities. TEKsystems and TEKsystems Global Services are Allegis Group companies. Learn more at TEKsystems.com.

The company is an equal opportunity employer and will consider all applications without regard to race, sex, age, color, religion, national origin, veteran status, disability, sexual orientation, gender identity, genetic information or any characteristic protected by law.

Job Tags

Contract work, Temporary work, For contractors, Remote work,

Similar Jobs

Vital Tech Solutions

Software Test Analyst Job at Vital Tech Solutions

 ...Write and maintain reusable test plans, test strategies, and testing scripts. Perform functional, regression, performance, accessibility, integration, and user acceptance testing. Generate and maintain test reports, metrics, and test data. Conduct accessibility... 

Westinghouse

Engineer, Principal - Nuc Job at Westinghouse

Nuclear EnergyOpportunity Overview:Engineer, Principal Nuc (Heat Exchanger Program Engineer) based in Juno Beach, FL but you can work remote.This is a 1 year. contract assignment.(W-2)You will perform engineering activities that support projects and operations across... 

Perchwell, Inc.

Director of Product, Growth & Platform Strategy Job at Perchwell, Inc.

 ...leading real estate technology firm in New York is seeking a Director of Product to drive product strategy and growth initiatives. Ideal candidates will have 6+ years in SaaS product management and a proven track record in both B2B and B2C environments. The role involves... 

University of Cincinnati

Post Doc Fellow, Biology, Lander Lab, College of Arts and Sciences Job at University of Cincinnati

 ...editing and cell signaling in T. cruzi ( This position offers an excellent opportunity to learn and apply cellular and molecular biology techniques. Qualified candidate should be able to work independently as well as in a group environment. The post-doc will train in... 

Commonwealth Medical Services

Neonatal Nurse Practitioner / Physician Assistant (NP/PA) - South Dakota Job at Commonwealth Medical Services

 ...Neonatal Nurse Practitioner / Physician Assistant (NP/PA) Position Summary The Neonatal Nurse Practitioner or Physician Assistant provides comprehensive medical care to neonates in the neonatal intensive care unit (NICU) and other newborn care settings. Working collaboratively...