DevOps Engineer – AI Image Editing SaaS Platform (Kubernetes + GPU + Cloud)
SuperDNA 3D Lab
Date: 1 week ago
City: Chandīgarh, Chandigarh
Contract type: Part time
Remote

We’re seeking a DevOps Engineer with hands-on experience managing GPU infrastructure, Kubernetes, and hybrid cloud environments (bare-metal, AWS, GCP). You’ll work closely with AI researchers and full-stack developers to build and scale the infrastructure that powers our image-processing microservices.
Responsibilities
Manage and optimize Kubernetes clusters across bare-metal servers, AWS (EKS), and GCP (GKE)
Deploy and maintain GPU-enabled workloads for AI inference and training (NVIDIA drivers, nvidia-docker, MIG configs)
Create and maintain CI/CD pipelines (GitHub Actions, ArgoCD, etc.) to automate deployments and model rollouts
Implement scalable, fault-tolerant infrastructure for AI microservices, using Celery, Redis, and FastAPI
Monitor system performance, resource utilization (CPU/GPU), and model latency
Set up and manage persistent storage (MinIO, S3), secrets, and config maps securely
Develop monitoring and alerting systems for both infrastructure and AI pipelines
Collaborate with AI engineers to support experimentation, benchmarking, and model updates
Required Skills
Solid experience with Kubernetes, particularly in GPU scheduling and resource management
Experience deploying and tuning AI/ML workloads on GPUs (NVIDIA Docker, CUDA stack, drivers)
Comfortable managing hybrid cloud infrastructure: bare-metal servers, AWS, and GCP
Deep knowledge of Docker, Helm,
Strong scripting skills (Bash, Python) for automation and tooling
Experience with Redis, Celery, and handling message queues or background job systems
Tech Stack
Infra: Docker, Kubernetes, Helm, Terraform, GitHub Actions
Cloud: AWS (EKS, EC2, S3), GCP (GKE, Compute), Bare-Metal Servers
AI Ops: NVIDIA Docker, CUDA, Celery, Redis, FastAPI
Storage: MinIO, AWS S3, Persistent Volumes
Responsibilities
Manage and optimize Kubernetes clusters across bare-metal servers, AWS (EKS), and GCP (GKE)
Deploy and maintain GPU-enabled workloads for AI inference and training (NVIDIA drivers, nvidia-docker, MIG configs)
Create and maintain CI/CD pipelines (GitHub Actions, ArgoCD, etc.) to automate deployments and model rollouts
Implement scalable, fault-tolerant infrastructure for AI microservices, using Celery, Redis, and FastAPI
Monitor system performance, resource utilization (CPU/GPU), and model latency
Set up and manage persistent storage (MinIO, S3), secrets, and config maps securely
Develop monitoring and alerting systems for both infrastructure and AI pipelines
Collaborate with AI engineers to support experimentation, benchmarking, and model updates
Required Skills
Solid experience with Kubernetes, particularly in GPU scheduling and resource management
Experience deploying and tuning AI/ML workloads on GPUs (NVIDIA Docker, CUDA stack, drivers)
Comfortable managing hybrid cloud infrastructure: bare-metal servers, AWS, and GCP
Deep knowledge of Docker, Helm,
Strong scripting skills (Bash, Python) for automation and tooling
Experience with Redis, Celery, and handling message queues or background job systems
Tech Stack
Infra: Docker, Kubernetes, Helm, Terraform, GitHub Actions
Cloud: AWS (EKS, EC2, S3), GCP (GKE, Compute), Bare-Metal Servers
AI Ops: NVIDIA Docker, CUDA, Celery, Redis, FastAPI
Storage: MinIO, AWS S3, Persistent Volumes
How to apply
To apply for this job you need to authorize on our website. If you don't have an account yet, please register.
Post a resumeSimilar jobs
Law/Legal Internship
IndisJob,
Chandīgarh, Chandigarh
17 hours ago
Job Overview: Law/Legal Internship role at KMG Legal in S.A.S. Nagar . Job Overview:KMG Legal is seeking a Law/Legal Intern to join our team. This position will provide valuable hands-on experience in various areas of law, including litigation, corporate law, intellectual property, and more. The ideal candidate will have a strong academic background and a passion for the legal field.Key...

Practice Development Manager, Jaipur, Delhi & Chandigarh
Align Technology,
Chandīgarh, Chandigarh
1 week ago
Department: SalesLocation: APAC-IndiaDescriptionThe PDM for Restorative will be responsible for our market entry and sales of Invisalign to non-Invisalign segment in India. The individual will manage the business in order to achieve Operational Plan in line with company and country strategic objectives through detailed planning that includes but is not limited to; targeting plan, KOL development, establishing account relationships with...

Executive Housekeeper
Hyatt,
Chandīgarh, Chandigarh
3 weeks ago
You will be responsible for the efficient running of the department in line with Hyatt International's Corporate Strategies and brand standards, whilst meeting employee, guest and owner expectations. The Housekeeping Manager is responsible to manage all functions related to the cleanliness of the hotel, including guest rooms, public areas, and back-of-house non-kitchen areas, as well as the laundry and dry-cleaning...
