- Company Name
- Cosine
- Job Title
- Software Engineer - Cloud Infrastructure
- Job Description
-
**Job Title:** Software Engineer – Cloud Infrastructure
**Role Summary:**
Design, build, and operate the core cloud infrastructure that powers AI products, including Kubernetes clusters, deployment pipelines, networking, and secure, scalable platform services for enterprise and on‑premises deployments.
**Expactations:**
- Minimum 5 years of hands‑on experience developing or operating production‑grade infrastructure.
- Proven ability to run and scale large Kubernetes environments.
- Experience building abstractions and tooling on major cloud platforms (AWS, GCP, Azure).
- Strong emphasis on reliability, security, and compliance in regulated industries.
- Comfortable with fast‑moving priorities and on‑call incident response.
**Key Responsibilities:**
- Architect and implement development and production platforms that ensure reliability, security, and scalability.
- Scale infrastructure to support order‑of‑magnitude growth in users and data volumes.
- Develop tooling, abstractions, and automation for deployment, monitoring, and observability.
- Participate in on‑call rotation, diagnosing, triaging, and resolving critical incidents.
- Collaborate with product, research, and design teams to align infrastructure with product goals.
- Advocate for diversity, rigorous thinking, and open communication culture.
**Required Skills:**
- Kubernetes cluster design, management, and operations at scale.
- Cloud platform expertise (AWS, GCP, Azure) – compute, networking, IAM, security controls.
- Infrastructure as Code (Terraform, Pulumi, CloudFormation) and CI/CD pipelines (GitHub Actions, ArgoCD, Jenkins).
- Containerization (Docker), networking (CNI, service meshes), and observability (Prometheus, Grafana, ELK).
- Strong scripting (Python, Bash) and automation skills.
- Security best practices: encryption, secrets management, compliance frameworks (HIPAA, SOC2, ISO27001).
- Incident response, root‑cause analysis, and post‑mortem documentation.
**Required Education & Certifications:**
- Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent practical experience).
- Professional cloud certifications preferred (e.g., AWS Certified Solutions Architect, GCP Professional Cloud Architect, Azure Solutions Architect).
- Kubernetes certifications (CKA/CKAD) are a plus.