DevOps Engineer – AI & HPC InfrastructureWe're looking for a DevOps Engineer to help build, scale, and operate cutting-edge AI and High-Performance Computing (HPC) infrastructure.This role is ideal for someone who thrives in production environments, enjoys owning infrastructure end-to-end, and wants to work at the intersection of AI, cloud, and high-performance systems.What You'll DoOperate and support production-grade HPC environmentsDesign, own, and continuously improve CI/CD pipelines (including GitHub Actions)Build automation and operational tooling using Python and BashManage and optimize Linux systems (administration, performance tuning, troubleshooting)Architect and maintain AWS-based multi-environment infrastructureManage containerized and distributed systemsEstablish and drive operational best practicesPartner with engineering and AI/ML teams to deliver reliable, scalable infrastructureWhat We're Looking ForExtensive hands-on experience in DevOps or Platform EngineeringProven track record supporting HPC environments in productionDeep expertise in Linux systemsStrong proficiency in Python and BashExperience designing and maintaining CI/CD pipelinesStrong cloud experience, particularly AWSExperience managing containerized and distributed systemsStrong problem-solving skills with a production-first mindsetAbility to take full ownership of infrastructure and drive improvementsNice to HaveExperience managing GPU-based systems (NVIDIA drivers, CUDA)Strong MLOps knowledge (model lifecycle, deployment, monitoring)Experience bridging AI/ML teams and production infrastructureHands-on experience with Infrastructure as Code and system automation