Ph3Infrastructure / DevOps Engineer /h3pEuropean Tech Recruit are working closely with a market leading 3D scanning company, based in Bressanone, who are looking for a talented Infrastructure / DevOps Engineer to join their team. /ppIn this role you will join a company that leverage state-of-the-art Computer Vision and Machine Learning algorithms to scan high quality, relightable 3D models of objects and products at scale. /ppYou will build and maintain the foundation of their compute infrastructure. You'll work on hardware provisioning, networking, container orchestration, and deployment pipelines across cloud and on-premise environments. This role focuses on making their multi-GPU clusters reliable, their deployments reproducible, and their developers productive. /ph3Responsibilities /h3ulliProvision, configure, and maintain heterogeneous compute clusters (CPU/GPU) across multiple physical locations. /liliImplement dynamic compute and storage provisioning based on workload demands. /liliDesign storage solutions at both hardware and software level (NAS, distributed filesystems, storage tiering). /liliImplement and manage container orchestration systems (Kubernetes, Docker) for development and production workloads. /liliDesign and maintain infrastructure as code using tools like Terraform and Ansible. /liliBuild and optimize job scheduling and resource allocation systems (Slurm, Kubernetes). /liliSet up monitoring, alerting, and observability infrastructure (Prometheus, Grafana, IPMI). /liliProfile and optimize system-level performance: GPU utilization, memory bandwidth, I/O throughput, network latency. /liliManage networking, VPNs, and secure access across distributed systems. /liliHandle reliability concerns: hardware failure detection, job checkpointing, disaster recovery. /li /ulh3Requirements /h3ulliStrong Linux system administration knowledge. /liliExperience with containerization (Docker) and orchestration (Kubernetes). /liliKnowledge of infrastructure as code (Terraform, Ansible). /liliExperience with HPC clusters and job scheduling (Slurm). /liliFamiliarity with monitoring solutions (Prometheus, Grafana). /liliUnderstanding of networking principles and implementation. /liliExperience with hardware infrastructure management (IPMI, BMC, server maintenance). /liliKnowledge of storage systems design (NFS, Ceph, distributed filesystems). /li /ulh3Desirable Experience /h3ulliExperience with cloud services (AWS, or others). /liliFamiliarity with bare-metal provisioning (MaaS). /li /ulpIf this role is of any interest please apply directly on LinkedIn or send a copy of your CV to /ppBy applying to this role you understand that we may collect your personal data and store and process it on our systems. For more information please see our Privacy Notice ( /p /p #J-18808-Ljbffr