jobs.devopsengineer.title
Kubernetes • Cloud Infrastructure
Remote
Permanent
Infrastructure
About the Role
Build and maintain the infrastructure that powers Swift Compute's global GPU cloud. You'll work with Kubernetes, cloud platforms, and cutting-edge GPU hardware to ensure 99.9% uptime for our customers' critical AI workloads.
What You'll Do
- • Manage Kubernetes clusters across multiple cloud providers
- • Build and maintain CI/CD pipelines for rapid deployment
- • Implement comprehensive monitoring and alerting systems
- • Automate infrastructure provisioning with Terraform
- • Optimize network performance and security configurations
- • Develop disaster recovery and backup strategies
What We're Looking For
Required:
- • 3+ years of DevOps or SRE experience
- • Expert knowledge of Kubernetes and Docker
- • Experience with cloud platforms (AWS, GCP, Azure)
- • Proficiency with Infrastructure as Code (Terraform)
- • Strong scripting skills (Python, Bash, Go)
- • Experience with monitoring tools (Prometheus, Grafana)
Bonus:
- • Experience with GPU workloads and NVIDIA drivers
- • Knowledge of service mesh technologies (Istio, Linkerd)
- • Experience with GitOps workflows (ArgoCD, Flux)
- • Background in high-performance computing
- • Certifications in cloud platforms
- • Experience with security compliance frameworks
What We Offer
- • Competitive salary and equity compensation
- • Premium health benefits and wellness programs
- • Flexible remote work with home office stipend
- • Professional development and certification budget
- • Access to latest DevOps tools and platforms
- • Opportunity to work with cutting-edge GPU infrastructure