cover image
CATHEXIS

CATHEXIS

www.cathexiscorp.com

1 Job

170 Employees

About the Company

Founded in 2006, CATHEXIS is a trusted mid-tier government contractor advancing the missions of U.S. federal agencies through innovative consulting and technology solutions. By combining deep expertise with data-driven strategies, we're empowering federal agencies to achieve their objectives with precision, agility, and impact.

At CATHEXIS, we take a people-first approach in everything we do and developed a culture on collaboration, inclusivity, and continuous growth. From our work with government agencies to the communities we serve, we support initiatives that create healthier, more equitable opportunities and make a lasting, positive impact.

Listed Jobs

Company background Company brand
Company Name
CATHEXIS
Job Title
Site Reliability Engineer (req-174)
Job Description
**Job Title:** Site Reliability Engineer **Role Summary** Design, deploy, and maintain the reliability, scalability, and security of Kubernetes clusters and cloud infrastructure supporting AI‑driven solutions. Manage CI/CD pipelines, automation, monitoring, and incident response across AWS, Azure, and GCP environments. **Expectations** - Deliver 99.95% uptime and rapid incident resolution. - Continuously improve infrastructure performance and cost efficiency. - Automate provisioning, scaling, and monitoring using IaC tools. - Ensure compliance with security standards and regulatory requirements. - Collaborate closely with development, service, and operations teams. **Key Responsibilities** - Deploy, monitor, and scale applications on Kubernetes clusters; maintain Helm charts and cluster resources. - Provision and configure cloud infrastructure (AWS, Azure, GCP) with Terraform, CloudFormation or equivalent IaC. - Implement and tune monitoring, alerting, and logging for Kubernetes, CI/CD, and infrastructure components. - Lead incident response: diagnose root causes, implement fixes, and improve post‑mortem processes. - Automate infrastructure workflows with Terraform, Ansible, or similar tools. - Enforce security best practices—RBAC, encryption, vulnerability scanning, and compliance checks. - Collaborate with cross‑functional teams to integrate application development and infrastructure delivery. - Identify and remediate performance bottlenecks and reliability gaps proactively. **Required Skills** - Kubernetes cluster administration, Helm, and resource management. - Cloud platform expertise: AWS, Azure, and/or GCP. - Infrastructure as Code: Terraform, CloudFormation, or similar. - Monitoring tools: Prometheus, Grafana, ELK, or equivalent. - Programming: Python, Java, C/C++, Ruby, or JavaScript (structured/OOP). - Distributed storage knowledge: NFS, HDFS, Ceph, Amazon S3. - Experience with CI/CD systems (e.g., Jenkins, GitLab CI). - Agile/Scrum workflow experience. - Strong problem‑identification, troubleshooting, and performance tuning skills. - Security & compliance fundamentals (RBAC, encryption, vulnerability scanning). **Required Education & Certifications** - Bachelor’s degree or equivalent in Computer Science, Engineering, or related field. - Active Secret Clearance (required). - 2+ years of experience managing on‑premise and cloud environments. - Certifications in Kubernetes (CKA/CKAD) and/or cloud platforms (AWS Certified Solutions Architect, Azure Administrator, GCP Professional Cloud Architect) are a plus.
Hawaii, United states
On site
14-11-2025