Senior Site Reliability Engineer I

🇺🇸 Boston, Massachusetts
$2K - $2K Annual
Posted 1 month ago
Expires June 9, 2026
Full TimeHybridEngineering

Axon is seeking a Senior Site Reliability Engineer I to join our Observability team within the Site Reliability organization. This team is responsible for managing metrics, logging, tracing, and alerting infrastructure across Axon's global environments, ensuring our complex distributed systems are observable and reliable.

In this role, you will own and evolve Axon's distributed tracing infrastructure, including Jaeger and OpenTelemetry-based instrumentation, driving adoption across our service-oriented architecture. You will build and operate our log aggregation platform (Grafana Loki + Alloy), expand its use cases beyond Kubernetes event logs, and reduce organizational dependency on third-party log tools. Additionally, you will maintain and improve our metrics infrastructure (Cortex, Prometheus, Grafana), which serves as the foundation for alerting, dashboards, and SLO tracking across all environments.

The ideal candidate will have a Bachelor's Degree in Computer Science, Engineering, or a related technical field, along with 7+ years of experience in SRE, platform engineering, or infrastructure engineering. Strong Linux systems fundamentals and comfort working in Kubernetes-based environments are essential. Hands-on experience with components of the LGTM stack—Loki, Grafana, Tempo/Jaeger, or Mimir/Cortex—is required, as well as proficiency with infrastructure as code tools like Terraform. Experience with Golang, Python, or Java is also necessary. U.S. citizenship is required to gain CJIS clearance for full U.S. production access.

This position is based in our Boston, MA office and follows a hybrid schedule, with in-person collaboration from Tuesday through Friday and remote work on Mondays. Axon offers competitive salary and 401k with employer match, discretionary time off, and paid parental leave for all employees.

More Jobs at AXON