Senior Site Reliability Engineer (SRE) – Datadog Observability
Jade Global
Date: 12 hours ago
City: Chandīgarh, Chandigarh
Contract type: Full time
Senior Site Reliability Engineer (SRE) – Datadog Observability1
Job Title: Senior Site Reliability Engineer (SRE) – Datadog Observability
Experience Required: 8+ years overall in SRE and Infrastructure Operations with minimum 3+ years hands-on experience in Datadog
Location: Hyderabad preferable but open for Pune and remote
Job Summary
We are seeking an experienced Site Reliability Engineer (SRE) to lead end-to-end SRE implementation initiatives with a strong focus on Datadog Observability. The ideal candidate will bring deep technical expertise in building reliable, scalable, and observable systems, with hands-on experience in integrating enterprise applications and middleware
Key Responsibilities
Job Title: Senior Site Reliability Engineer (SRE) – Datadog Observability
Experience Required: 8+ years overall in SRE and Infrastructure Operations with minimum 3+ years hands-on experience in Datadog
Location: Hyderabad preferable but open for Pune and remote
Job Summary
We are seeking an experienced Site Reliability Engineer (SRE) to lead end-to-end SRE implementation initiatives with a strong focus on Datadog Observability. The ideal candidate will bring deep technical expertise in building reliable, scalable, and observable systems, with hands-on experience in integrating enterprise applications and middleware
Key Responsibilities
- Drive end-to-end SRE implementation, ensuring system reliability, scalability, and performance.
- Design, configure, and manage Datadog dashboards, monitors, alerts, and APM for proactive issue detection and resolution.
- Utilize the Datadog Roles API to create and manage user roles, global permissions, and access controls for various teams.
- Collaborate with product managers, engineering teams, and business stakeholders to identify observability gaps and design solutions using Datadog.
- Implement automation for alerting, incident response, and ticket creation to improve operational efficiency.
- Work closely with business and IT teams to support critical Financial Month-End, Quarter-End, and Year-End closures.
- Leverage Datadog AI
- Provide technical leadership in observability, reliability, and performance engineering practices
- 8+ years of experience in Site Reliability Engineering, Observability
- Minimum 3+ years of hands-on experience with Datadog (dashboards, APM, alerting, log management, Roles API, and monitoring setup).
- Proven experience implementing SRE best practices—incident management, postmortems, automation, and reliability metrics
- Excellent stakeholder management and communication skills; experience collaborating with business and IT teams.
- Strong problem-solving mindset and ability to work in high-pressure production support environments.
- Certification in Datadog or related observability platforms.
- Knowledge of CI/CD tools and automation frameworks.
- Experience in cloud platforms (AWS, Azure, or OCI).
- Exposure to ITIL-based production support processes.
How to apply
To apply for this job you need to authorize on our website. If you don't have an account yet, please register.
Post a resumeSimilar jobs
.Net Developer
Infosys,
Chandīgarh, Chandigarh
2 weeks ago
Technology->Cloud Platform->AWS Database, Technology->Reactive Programming->react JS, Technology->Full stack->.Net Full stackA day in the life of an Infoscion As part of the Infosys delivery team, your primary role would be to interface with the client for quality assurance, issue resolution and ensuring high customer satisfaction. You will understand requirements, create and review designs, validate the architecture and ensure high levels of...
Software Engineer
Microsoft,
Chandīgarh, Chandigarh
3 weeks ago
At Microsoft, we believe in empowering every person and every organization on the planet to achieve more. Join us, and help ensure our customers data is always safe, resilient, and ready for the future. Imagine this: You log into work on a Monday morning, and the code you wrote last week is already protecting petabytes of customer data across the...
Relationship Manager
Bajaj General Insurance,
Chandīgarh, Chandigarh
4 weeks ago
ExecutionAnalyze the customer segmentation, sales trends in terms of demographics, geography, characteristics etc. to assess the potential for businessConduct trainings at the branch for all banking partners around regulatory guidelines and products.Plan for activation of branches through R&R activities to increase the penetration in active branches to realize full potential of the bank partner in the given geographyRelationship ManagementDrive the...