Job summary
Title: Site Reliability Engineer (SRE)
Location: CST OR EST ONLY - Please include your city, state on resume
Salary: Base: $110,000 – $150,000
No sponsorship / Must be U.S. Citizen or Permanent Resident (Green Card)…
Job description
Title: Site Reliability Engineer (SRE)
Location: CST OR EST ONLY - Please include your city, state on resume
Salary: Base: $110,000 – $150,000
No sponsorship / Must be U.S. Citizen or Permanent Resident (Green Card)
Benefits: Health, Medical, Dental, Vision, 401(k) with match, Stock Options, PTO, and additional perks
Overview
We are hiring a Site Reliability Engineer to design, secure, and operate highly available infrastructure supporting an AI-driven platform serving U.S. customers, including organizations within regulated industries. This role owns U.S.-based platform operations while collaborating with a global engineering organization in a fast-paced, high-growth environment. Must have GCP and Snowflake.
What You’ll Do
• Design, implement, and operate scalable, fault-tolerant infrastructure primarily on GCP with future multi-cloud expansion
• Lead Infrastructure-as-Code initiatives using Terraform with strong security and governance practices
• Build and maintain CI/CD and DevSecOps pipelines supporting large-scale engineering and AI workloads
• Implement observability and monitoring using Prometheus, Grafana, ELK, and similar tools
• Define SLOs/SLIs, manage error budgets, and lead incident response with blameless postmortems
• Support compliance requirements within regulated U.S. industries
• Automate operational workflows using Python, Go, or Bash
• Collaborate with global teams while owning U.S. platform operations and incidents
What We’re Looking For
• Bachelor’s degree in Computer Science, Engineering, or equivalent experience
• 2+ years of experience in SRE, DevOps, or Systems Engineering
• Strong Terraform and Infrastructure-as-Code experience
• Proficiency with Python and scripting languages
• Experience with CI/CD tools (GitHub Actions, GitLab CI, Jenkins, ArgoCD, etc.)
• Cloud experience (GCP preferred; AWS/Azure a plus)
• Kubernetes and Docker experience
• Experience in regulated environments (Aerospace & Defense, Finance, Healthcare preferred)
• Strong communication skills and security-first mindset
Nice to Have
• Hyper-growth startup experience
• AI safety, MLOps, or AI/ML infrastructure security experience
Source & verification
Source:
External source
Verified listings are reviewed or posted directly by trusted sources.
Imported: Jan 27, 2026 23:32
Applicant notice
Jobs Malawi does not ask for payment or sensitive credentials. Never share OTPs or passwords.
If something looks suspicious, report this job.
Last updated: Jan 31, 2026 06:37