Skip to main content
jobs.devopsengineer.title
Kubernetes • Cloud Infrastructure
Remote
Permanent
Infrastructure

About the Role

Build and maintain the infrastructure that powers Swift Compute's global GPU cloud. You'll work with Kubernetes, cloud platforms, and cutting-edge GPU hardware to ensure 99.9% uptime for our customers' critical AI workloads.

What You'll Do

  • Manage Kubernetes clusters across multiple cloud providers
  • Build and maintain CI/CD pipelines for rapid deployment
  • Implement comprehensive monitoring and alerting systems
  • Automate infrastructure provisioning with Terraform
  • Optimize network performance and security configurations
  • Develop disaster recovery and backup strategies

What We're Looking For

Required:

  • 3+ years of DevOps or SRE experience
  • Expert knowledge of Kubernetes and Docker
  • Experience with cloud platforms (AWS, GCP, Azure)
  • Proficiency with Infrastructure as Code (Terraform)
  • Strong scripting skills (Python, Bash, Go)
  • Experience with monitoring tools (Prometheus, Grafana)

Bonus:

  • Experience with GPU workloads and NVIDIA drivers
  • Knowledge of service mesh technologies (Istio, Linkerd)
  • Experience with GitOps workflows (ArgoCD, Flux)
  • Background in high-performance computing
  • Certifications in cloud platforms
  • Experience with security compliance frameworks

What We Offer

  • Competitive salary and equity compensation
  • Premium health benefits and wellness programs
  • Flexible remote work with home office stipend
  • Professional development and certification budget
  • Access to latest DevOps tools and platforms
  • Opportunity to work with cutting-edge GPU infrastructure