Lead I - Production Support
UST Global
-
ID: 59108
5 - 7 Years
1 Opening
Trivandrum
Role description
Job Title: Production Support Engineer - AI-Enabled Operations
Location: [Pune/Kochi/Trivandrum]
Experience: [5-7 Years]
Employment Type: Full-time
Role Overview
We are seeking a highly motivated Production Support Engineer with strong exposure to modern cloud-native ecosystems and a keen interest in AI-driven support automation. The role involves managing and supporting complex enterprise systems spanning infrastructure, workflows, monitoring platforms, and data pipelines, while also exploring intelligent automation to enhance operational efficiency.
Key Responsibilities
Production Support & Operations
- Monitor and support distributed systems across:
- Azure Virtual Machines (VMs) and Azure Kubernetes Service (AKS)
- Temporal workflows and batch orchestration platforms
- Manage and respond to s from DataDog and Azure monitoring tools
- Perform incident triaging, prioritization, and escalation handling
- Investigate production issues across:
- APIs and ETL data pipelines
- SQL systems and data reconciliation processes
- FTP/file transfer dependencies
- Conduct Root Cause Analysis (RCA) and drive resolution closure
- Work closely with engineering teams to ensure system stability and reliability
Monitoring & Observability
- Leverage tools such as:
- Azure logging, diagnostics, and event monitoring
- DataDog dashboards and ing systems
- Identify trends, anomalies, and performance bottlenecks
- Improve observability by enhancing s, dashboards, and runbooks
AI-Driven Support & Automation
Contribute to the adoption and implementation of intelligent operations by:
- Enabling correlation and noise reduction using AI techniques
- Supporting automated incident triaging and prioritization
- Utilizing log analysis and anomaly detection tools
- Assisting in AI-driven RCA generation
- Building or leveraging support copilots / knowledge assistants
- Driving workflow remediation and self-healing automation initiatives
- Supporting predictive monitoring and operational analytics
Required Skills & Qualifications
- Hands-on experience in production support / SRE / operations roles
- Strong exposure to Azure cloud ecosystem (VMs, AKS, monitoring tools)
- Experience with monitoring tools like DataDog or equivalent
- Understanding of:
- API architectures and ETL pipelines
- SQL databases and data validation/reconciliation
- Distributed workflows (e.g., Temporal or similar frameworks)
- Knowledge of incident management and RCA processes
- Familiarity with logging, diagnostics, and observability practices
- Basic scripting skills (Python, Shell, etc.) for automation
Preferred Qualifications
- Experience or exposure to AI/ML-based IT operations (AIOps)
- Knowledge of:
- Anomaly detection techniques
- Intelligent ing systems
- Automation frameworks for self-healing systems
- Experience building or using AI copilots or support assistants
- Understanding of predictive analytics in operations
Soft Skills
- Strong analytical and problem-solving skills
- Ability to work under pressure in a 24/7 support environment (if applicable)
- Effective communication and stakeholder management
- Continuous learning mindset with interest in AI innovation in operations
Skills
api,sql,production support,azure virtual machines,azure kubernetes service,
About UST
UST is a global digital transformation solutions provider. For more than 20 years, UST has worked side by side with the world’s best companies to make a real impact through transformation. Powered by technology, inspired by people and led by purpose, UST partners with their clients from design to operation. With deep domain expertise and a future-proof philosophy, UST embeds innovation and agility into their clients’ organizations. With over 30,000 employees in 30 countries, UST builds for boundless impact—touching billions of lives in the process.How to apply
To apply for this job you need to authorize on our website. If you don't have an account yet, please register.
Post a resumeSimilar jobs
Internship Trainee
Lead II - Data Engineering
eCommerce Solutions Consultant