- Company Name
- VeeRteq Solutions LLC
- Job Title
- Principal Platform Engineer & Team Lead
- Job Description
-
**Job Title**
Principal Platform Engineer & Team Lead
**Role Summary**
Hands‑on technical leader responsible for building and operating a full‑stack infrastructure platform from the ground up. Owns infrastructure architecture, IaC implementation, Kubernetes and bare‑metal environments, networking, security, and automation to enable self‑service provisioning. Partners with software engineering leadership while mentoring a team of infrastructure engineers.
**Expectations**
- 10+ years of infrastructure/platform engineering experience, with at least 4 years in senior, staff, or principal roles.
- Proven track record delivering production‑grade infrastructure end‑to‑end.
- Ability to set technical standards, drive adoption of best practices, and mentor engineers.
- Strong problem‑solving skills across Linux, networking, virtualization, and cloud environments.
**Key Responsibilities**
- Design and own the platform’s core infrastructure (compute, storage, networking, IAM).
- Lead IaC development using Terraform/OpenTofu, including module design, state management, and automation patterns.
- Define golden‑path workflows for provisioning, scaling, lifecycle management, and decommissioning.
- Architect Kubernetes clusters, CNI, storage classes, RBAC, and bare‑metal provisioning (PXE, hardware lifecycle).
- Design VPCs, security groups, load balancers, DNS, and multi‑region networking.
- Establish and enforce platform engineering standards (IaC, DevOps, security hardening, operational procedures).
- Build observability pipelines (metrics, logs, alerts, dashboards).
- Mentor and grow the infrastructure engineering team, providing technical direction and reviews.
- Evaluate emerging technologies, identify gaps, and drive continuous improvement of platform capabilities.
**Required Skills**
- Expert Terraform/OpenTofu proficiency (module design, state handling).
- Deep Kubernetes expertise (cluster ops, CNI, storage, RBAC, troubleshooting).
- Virtualization experience (hypervisor configuration, VM lifecycle).
- Bare‑metal provisioning experience (PXE boot, hardware automation).
- Advanced Linux system knowledge (kernel, filesystems, process management).
- Strong networking skills (VPC design, security groups, load balancing, DNS, firewalls).
- Hands‑on experience with AWS, GCP, and/or Azure core services.
- Configuration management (Ansible, Chef, or Puppet) and automation patterns.
- Infrastructure observability (metrics, logging, alerting, dashboards).
- Ability to mentor engineers and define infrastructure standards.
**Required Education & Certifications**
- Bachelor’s degree in Computer Science, Computer Engineering, Information Technology, or related field (or equivalent experience).
- Cloud certifications (e.g., AWS Certified Solutions Architect, Google Cloud Professional Architect, Azure Solutions Architect) and/or security certifications are preferred but not mandatory.