Senior Site Reliability Engineer, Observability

🇦🇷 Argentina Remote, Argentina
Posted Just posted
Expires July 11, 2026
Full TimeRemoteEngineeringOperations

Webflow is seeking a Senior Site Reliability Engineer specializing in Observability to enhance the reliability and stability of its customer-facing, production infrastructure, which serves millions of page views per hour. With a user base exceeding 2 million across 190 countries, Webflow empowers teams to design, launch, and optimize websites without barriers. This role is pivotal in ensuring the platform's security and scalability as tens of thousands of projects are launched each month.

The engineer will join the newly formed Observability team, responsible for providing engineers across Webflow with the tools, data, and practices necessary to understand the health and performance of the Webflow application and hosting services. Key responsibilities include owning and evolving Webflow's observability stack, including OpenTelemetry and Datadog, to deliver reliable, actionable metrics, traces, and logs across services. The role also involves driving the adoption of Service Level Objectives (SLOs), distributed tracing, and structured logging throughout engineering, as well as building and maintaining AI-powered agents and automation to surface insights faster and reduce alert fatigue.

Candidates should have a background as a software engineer with enthusiasm for observability, infrastructure, and reliability, or as an infrastructure or production engineer with a passion for coding. A minimum of 5 years of experience in building, maintaining, and debugging distributed systems in customer-facing environments with minimal downtime is required. Hands-on experience with observability platforms and tools such as Datadog, Grafana, Prometheus, or Elasticsearch is essential, along with experience in OpenTelemetry or similar instrumentation frameworks. Proficiency in defining and operationalizing SLOs/SLIs at scale, navigating multi-tier cloud environments on AWS or GCP, and working with container-centric architectures using Docker and Kubernetes is also expected.

Webflow offers a remote-first work environment, with this position being based in Argentina. The company provides a comprehensive benefits package, including equity ownership (RSUs), 100% employer-paid healthcare, vision, and dental insurance coverage for full-time employees and their dependents, flexible paid time off, and access to mental wellness and professional coaching services. Additionally, employees have opportunities for professional growth and development within a collaborative and innovative company culture.

More Jobs at Webflow