Observability Strategy & Governance Lead

Date: Mar 19, 2026

Location: Hyderabad Telangana, TG, IN, 500081

Company: Hubbell Incorporated

Job Overview

The Observability Strategy & Governance Lead defines and governs the enterprise observability strategy to deliver a unified, proactive view of IT service health, performance, and availability. This role establishes telemetry and instrumentation standards—covering metrics, logs, traces, and synthetic monitoring—and ensures consistent implementation across applications and infrastructure. The person in the role should possess a breadth of knowledge in various observability tools, framework and techniques. The role defines toolset, strategy, and direction to ensure a holistic view of service status, providing insight into the cause and effect of service disruption or degradation. It partners with IT operations and business partners to define metrics that reflect real-world business impact, enabling teams to understand business implications for events. While not directly implementing instrumentation, the role enforces governance standards and provides architectural guidance to ensure telemetry is actionable, aligned with technical and business goals, and supports continuous improvement.

 

A Day In The Life

In this role, your day centers on shaping how the organization sees and understands the health of its technology services. You work with IT operations, application teams, infrastructure partners, and business stakeholders to define what “good” observability looks like and ensure teams are building toward a shared vision.
 
You may start the day reviewing the effectiveness of existing observability patterns - looking at alert quality, noise levels, and whether dashboards and metrics clearly reflect service and business impact. You provide guidance to teams integrating observability platforms, helping them align instrumentation, event correlation, and synthetic monitoring with enterprise standards.
 
Throughout the day, you collaborate with business partners to identify the metrics that matter most, translating service behavior into insights that help leaders understand risk, impact, and performance. You also govern tool usage and configuration decisions, balancing value, cost efficiency, and consistency across a hybrid and multi-cloud environment.
 
When incidents or service degradations occur, your work shows up in high-impact moments: well-defined telemetry, actionable alerts, and clear cause-and-effect visibility that support faster diagnosis and better decision-making. You continuously refine standards, templates, dashboards, and implementation guidance to improve observability maturity across the enterprise.
 
  • Define and drive an observability roadmap ensuring alignment with overall operations and business objectives and reliability goals.
  • Govern tool selection and configuration to ensure delivery of maximum value and cost efficiency.
  • Establish and enforce enterprise-wide observability standards including specifications for metrics, logs, traces and synthetic checks by system and service tier.
  • Collaborate with business partners to identify key metrics tied to business outcomes.
  • Provide architectural guidance and policy enforcement to application and infrastructure teams for platform integration, event correlation rules, and synthetic monitoring.
  • Govern the configuration and integration of observability platforms with Major Incident tools (ex: PagerDuty) to ensure high-fidelity, actionable alerting.
  • Create and socialize implementation guides, templates, and operational dashboards with a focus on enabling single pane of glass views cross cutting enterprise and services.

What will help you thrive in this role?

  • 7+ years in IT operations, monitoring architecture, observability, or site reliability engineering.
  • Strong expertise in observability domains: telemetry design (metrics, logs, traces), synthetic monitoring, and event correlation.
  • Familiarity with common APM and observability platforms (e.g., Dynatrace, LogicMonitor, New Relic, Application Insights, Datadog, SAP ALM).
  • Proven ability to understand system architectures, define effective monitoring strategies, and guide implementation across hybrid and multi-cloud environments.
  • Experience developing observability frameworks or standards that support decentralized implementation and enable single-pane-of-glass visibility.
  • Strong understanding of ITIL Monitoring & Event Management practices; experience aligning tooling with service tier models and operational objectives.
  • Proven ability to lead cross-functional governance initiatives and translate technical performance into business impact for product and executive teams.


Job Segment: ERP, SAP, Technology