Senior Software Engineer - Site Reliability Job at Abnormal Security, Remote

L01ubWRxb3NNUjZSNGh4OXh5MHBjK0tuelE9PQ==
  • Abnormal Security
  • Remote

Job Description

About the Role

Abnormal Security is looking for a Senior Software Engineer - Site Reliability to join our Infrastructure team. In this role, you will be responsible for the reliability, scalability, and operational excellence of our systems and services. You will lead initiatives to improve the operational maturity of both SRE-managed services and critical product systems, driving change across the organization in support of stable operations.

As a senior member of the team, you will independently define and execute quarterly goals, create forward-looking roadmaps, and own cross-functional projects aligned with company-level objectives. You will serve as a key advocate for reliability, providing technical leadership, deep analysis, and mentorship while embedding with product teams as needed to improve service ownership and incident response practices.

The ideal candidate:

  • Has strong technical depth in distributed systems and operational excellence
  • Possesses a product-focused mindset with the ability to translate business needs into reliability goals
  • Is a strong communicator and mentor, able to influence both within the SRE team and across engineering
  • Has demonstrated experience leading broad technical initiatives across teams and systems

What You Will Do

  • Own the operational maturity of services in the SRE software stack, driving architectural and tooling improvements
  • Proactively partner with product teams to embed SRE best practices and support services with operational challenges
  • Independently define and drive quarterly goals for the SRE team with measurable impact on system reliability and developer productivity
  • Design and maintain systems that promote observability, automated recovery, scalability, and resilience
  • Lead incident reviews and root cause analyses; ensure follow-up actions are implemented and shared across teams
  • Collaborate with engineering leadership to shape the team roadmap and contribute to company-wide reliability goals
  • Mentor other engineers and drive adoption of SRE principles throughout the engineering organization

Must Have

  • 8+ years of experience in infrastructure, DevOps, or Site Reliability Engineering roles
  • Deep knowledge of production-grade distributed systems and cloud-native architectures
  • Demonstrated experience managing service availability, latency, and incident response in production environments
  • Strong programming skills in Python, Go, or similar languages
  • Experience with Kubernetes, Terraform, and observability tools (e.g., Prometheus, Grafana, Datadog)
  • Proven ability to lead complex, multi-team initiatives and influence system design for reliability

Nice To Have

  • Prior experience embedding with product engineering teams to support operational goals
  • Familiarity with AWS and multi-cloud environments (e.g., Azure, GCP)
  • Experience in regulated environments or with FedRAMP-compliant systems
  • Contributions to open-source SRE tooling or community knowledge sharing

#LI-NT1


At Abnormal AI, certain roles are eligible for a bonus, restricted stock units (RSUs), and benefits. Individual compensation packages are based on factors unique to each candidate, including their skills, experience, qualifications and other job-related reasons. We know that benefits are also an important piece of your total compensation package. Learn more about our Compensation and Equity Philosophy on our page.

Base pay range:

$176,000—$207,050 USD

San Francisco/New York Base pay range:

$195,000—$230,000 USD


Abnormal AI is an equal opportunity employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability, protected veteran status or other characteristics protected by law. For our EEO policy statement please click here . If you would like more information on your EEO rights under the law, please .

Job Tags

Remote job,

Similar Jobs

Crockett County Schools

Proofreading/Copy Editor Job at Crockett County Schools

 ...knowledge and skills to succeed in all post-secondary endeavors.. Your primary duties will include following a content strategy, proofreading and editing, and collaborating with co-workers to deliver quality projects on time. To excel in this role, you will have... 

Accertify

Assistant Global Controller Job at Accertify

 ...reporting into cost centers and product lines. Ensure all balance sheets accounts are reconciled monthly. Coordinate closely with the FP&A team in understanding financial results, providing input and insights into the balance sheet, cash forecast, and P&L rolling 12-... 

Healthwaze

Neurology - Physician Opportunity only Job at Healthwaze

 ...commensurate with experience Apply today to be considered for the Neurologist Opportunity Near Tampa, Florida. Requirements Board Certified Neurology Active Florida License Benefits Full Benefits Health Insurance Paid Malpractice Relocation Allowance 401(k) with Company Match

Offsite Professionals LLC

General Virtual Assistant Job at Offsite Professionals LLC

Starting rate is $5-$6 per hour for a 20-hour work week. Kindly read through the qualifications and job scope before submitting your application. Qualifications: Advanced proficiency in English communication, both written and verbal. Strong proficiency in Excel...

Get It - Executive

Customer Service Agents - Remote Job at Get It - Executive

 ...candidates About the Role We are seeking experienced travel and customer service agents who are reliable, motivated, and able to work from home. As a Remote Customer Service Agent, you'll provide exceptional support in the travel industry while assisting customers with...