- Company Name
- Maarut Inc
- Job Title
- Site Reliability Engineer 5+ years of experience
- Job Description
-
**Job Title**
Site Reliability Engineer
**Role Summary**
Senior SRE responsible for designing, deploying, and operating scalable API infrastructure on Google Cloud Platform. Leads production systems that integrate Apigee Hybrid, Kubernetes, and networking services to ensure high availability, security, and performance.
**Expectations**
- Scaffold and maintain mission‑critical API services in a multi‑team environment.
- Drive continuous improvement of reliability, observability, and automation.
- Mentor and influence junior engineers across cross‑functional teams.
**Key Responsibilities**
- Operate and evolve Apigee Hybrid at scale, including lifecycle management, scaling, and performance tuning.
- Design, deploy, and maintain GKE clusters and Kubernetes workloads, ensuring compliance with security, networking, and scalability best practices.
- Build and maintain CI/CD pipelines (GitOps, Cloud Build, Terraform) for infrastructure and application updates.
- Implement API security controls: OAuth, JWT, mTLS, quotas, rate limiting, and governance frameworks.
- Design enterprise‑grade networking: DNS, firewalls, VPC, load balancers, routing and mTLS across cloud services.
- Analyze and optimize system performance, capacity, and cost.
- Participate in incident response, root‑cause analysis, and post‑mortem documentation.
- Evaluate and recommend enhancements, including migration to Apigee X, service mesh (Istio/Anthos), and alternative API gateways.
**Required Skills**
- 5+ years SRE or equivalent experience in production environments.
- Expert knowledge of Apigee Hybrid, GCP services, GKE, and Kubernetes (networks, workloads, security, scaling).
- Deep understanding of enterprise networking: DNS, firewalls, VPC, load balancers, routing, mTLS.
- Proven experience with CI/CD, GitOps, Terraform, Cloud Build, or similar tooling.
- Strong grasp of API security technologies (OAuth, JWT, mTLS), quotas, rate limiting, and governance.
- Excellent communication, leadership, and cross‑team collaboration skills.
**Nice to Have**
- Apigee X experience or migration project knowledge.
- Service mesh exposure (Istio or Anthos).
- Benchmarking knowledge of alternative API gateways.
**Required Education & Certifications**
- Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent professional experience).
- Relevant GCP certifications (e.g., Professional Cloud Architect, Professional Cloud DevOps Engineer) are highly desirable.