Subscribe to the latest remote jobs:

Lead SRE Engineer

🇲🇽 Mexico

Management

Python

Azure

Jenkins

Terraform

Finance

Salesforce

GitHub

Design

Sales

Devops

Lead SRE Engineer

from 🇲🇽 Mexico

Role Overview

We are seeking a highly experienced Senior SRE Lead to lead reliability engineering and observability initiatives for critical platforms supporting GM Financials’ ecosystem, with a primary focus on Salesforce and Microsoft Azure environments. This role will be responsible for establishing and scaling SRE practices, driving operational excellence, and ensuring high availability, performance, and resilience of business-critical applications. The ideal candidate will bring deep expertise in cloud-native architecture, observability frameworks, and enterprise-scale production support, along with strong leadership capabilities in a global delivery model.

Key Responsibilities - SRE Leadership & Strategy

  • Lead the SRE function for Salesforce and Azure platforms, defining the roadmap and maturity model

  • Establish and drive SRE best practices, including SLIs, SLOs, and error budgets

  • Build and mentor a high-performing SRE team across onshore and offshore locations

  • Collaborate with GM Financial stakeholders, product teams, and engineering leadership Platform Reliability (Salesforce & Azure)

  • Ensure high availability, performance, and scalability of Salesforce applications and Azure-hosted services

  • Lead major incident management (P1/P2), including triage, stakeholder communication, and resolution

  • Drive root cause analysis (RCA) and implement preventive measures

  • Manage production stability across integrations between Salesforce and Azure services Observability & Monitoring

  • Design and implement end-to-end observability across Salesforce and Azure ecosystems

  • Establish unified monitoring across logs, metrics, and traces

  • Implement and optimize tools such as Azure Monitor, Application Insights, Splunk, Datadog, or similar

  • Define dashboards, alerting strategies, and actionable insights for proactive issue detection Automation & DevOps

  • Drive automation across incident response, remediation, and operational workflows

  • Implement Infrastructure as Code (IaC) practices using tools such as Terraform, ARM templates, or similar

  • Enhance CI/CD pipelines for Salesforce and Azure deployments

  • Enable self-healing systems and reduce manual intervention Cloud & Integration Engineering

  • Optimize Azure infrastructure for performance, resilience, and cost efficiency

  • Support Salesforce platform stability, including integrations, APIs, and middleware components

  • Work closely with integration teams to ensure reliable data flows and system interactions Governance, Risk & Compliance

  • Ensure adherence to GM Financial’s security, compliance, and regulatory requirements (including SOX)

  • Maintain audit-ready processes, documentation, and operational controls

  • Participate in governance forums, audits, and compliance reviews

Required Skills & Qualifications Technical Expertise

  • Strong experience in SRE, DevOps, or Production Engineering roles

  • Hands-on experience with Microsoft Azure (mandatory)

  • Experience supporting Salesforce platforms (Sales Cloud, Service Cloud, integrations)

  • Expertise in observability tools (Azure Monitor, Application Insights, Splunk, Datadog, etc.)

  • Strong scripting/programming skills (Python, PowerShell, or similar)

  • Experience with CI/CD tools (Azure DevOps, GitHub Actions, Jenkins)

  • Familiarity with Infrastructure as Code (Terraform, ARM, Bicep) Operational Excellence

  • Proven experience in managing high-availability production environments

  • Strong understanding of incident management, RCA, and problem management

  • Experience defining and managing SLIs, SLOs, and error budgets Leadership & Stakeholder Management

  • Experience leading distributed/global teams

  • Strong communication and stakeholder management skills

  • Ability to operate in a fast-paced, high-impact environment

  • Strong decision-making and problem-solving capabilities

by @maxrusakovic