Site Reliability Engineer

Ather Energy


Date: 2 weeks ago
City: Pune, Maharashtra
Contract type: Full time
You’ll be our: Site Reliability Engineer

You’ll be based at: Pune Zonal Office

You’ll be aligned with: Cloud and Data Platform Lead / Cloud Architect

You’ll be a member of: Cloud and Data Platform Team

Ather's fleet of smart scooters is growing rapidly, and so is the volume of data they generate. Our Vehicle Data Platform (VDP) is the core of this ecosystem, and its stability and scalability are critical to our success. We are looking for a foundational Site Reliability Engineer to join our VDP team, taking full ownership of our data infrastructure and building a robust reliability practice to support our rapid growth.

What You’ll do at ather:

  • Run and own the production environment by managing alerts, leading incident response, conducting root cause analysis (RCA), and implementing permanent fixes.
  • Take full ownership of our ClickHouse database clusters as we move from a managed service, managing their performance, reliability, and scaling internally.
  • Build and maintain our core infrastructure using Infrastructure-as-Code principles (Terraform).
  • Perform critical, periodic maintenance and upgrades for our infrastructure, with a strong focus on Kubernetes, Cloud SQL, and data workloads like Kafka.
  • Partner with the Data Engineering team to support the underlying infrastructure for our new Databricks platform, ensuring robust and efficient data ingestion pipelines.
  • Enhance observability by building and refining our monitoring, logging, and tracing systems to proactively identify performance bottlenecks.
  • Lead capacity planning and forecasting for all cloud workloads, ensuring our platform can scale effectively for the next 6-12 months.
  • Drive cloud cost optimization by monitoring spending, identifying and implementing savings opportunities, and ensuring resource governance.

Here’s What we Are Looking for:

  • Our ideal candidate is a strong software engineer at heart with deep expertise in cloud-native infrastructure.
  • The main focus areas for this role are:
  • Significant Coding Experience: You must have a strong software engineering background with significant coding experience in a language like Python, Go, or Java, focusing on writing clean, scalable, and automated solutions.
  • Deep Cloud Proficiency: You need deep, hands-on experience with at least one major cloud provider (GCP, AWS, or Azure). A strong background in GCP is highly preferred.
  • Production Kubernetes Expertise: You must have proven, hands-on experience designing, running, and troubleshooting applications on Kubernetes in a production environment.
  • Other key qualifications include:
  • Hands-on experience with infrastructure automation tools like Terraform or Ansible.
  • Strong expertise in building and managing CI/CD pipelines.
  • Experience administering, monitoring, and scaling ClickHouse clusters is highly desirable.
  • Familiarity with data platforms like Databricks and their infrastructure requirements.
  • Experience with messaging queues like Kafka.
  • Strong Linux administration, system internals, and network troubleshooting skills.

You Bring to Ather:

  • A Bachelor’s or Master’s degree in Computer Science or a related engineering field.
  • 3 to 6 years of relevant experience as a Site Reliability Engineer, DevOps Engineer, or Software Engineer with a focus on infrastructure.

How to apply

To apply for this job you need to authorize on our website. If you don't have an account yet, please register.

Post a resume

Similar jobs

Vice President- Scrum Master

Barclays, Pune, Maharashtra
1 day ago
Join us as Vice President- Scrum Master, where you will play a pivotal role in overall effectiveness of the Scrum Teams within the Private Bank Wealth Management Tribe. This includes enabling teams to deliver high-quality, valuable products, optimizing delivery flow, and supporting continuous improvement initiatives. The Scrum Master acts as a servant leader, coach, and facilitator, working closely with Product...

Medical Physician Specialist II

Fortrea, Pune, Maharashtra
2 days ago
Job OverviewProvide medical safety expertise, directly and indirectly, to Sponsors of drugs, devices, and combination products, in the post marketing period.Summary Of ResponsibilitiesUndertake primary medical review of cases, including medical assessment of the case for seriousness, listedness/labeling, causality, adverse event coding and narrative review.Update and document daily case data, case-feedback in appropriate trackers/tools to facilitate tracking and workflow management.Assume complete...

Design Engineer Instrumentation & Controls

Siemens Energy, Pune, Maharashtra
3 days ago
A Snapshot of Your DayThe Design Engineer I&C is responsible for Design and specify Unit Control Systems and Low Voltage Wiring systems which compliment Rotating Machinery.How You’ll Make An Impact Order related representation of the Instrumentation & Control Engineering. Preparing Instrument datasheets, P&ID´s, Instrument List and wiring Diagrams. This role is responsible for maintaining the data model together with other...