Are you an AI Engineer expert in Large Language Models and Generative AI ?
Reply is a company that specialises in Consulting, Systems Integration and Digital Services with a focus on the conception, design and implementation of solutions based on the new communication channels and digital media. Reply partners with key industrial groups in defining and developing business models made possible by the new technological and communication paradigms such as Artificial Intelligence, Big Data, Cloud Computing, Digital Communication, the Internet of Things and Mobile and Social Networking.
You will design, build, and industrialize enterprise-grade Generative AI solutions based on Large Language Models, with a strong focus on the Mistral ecosystem. You will develop end-to-end GenAI pipelines, from experimentation to production deployment across cloud and on-prem environments. Technologies. You will work primarily with Python and Large Language Models, leveraging the Mistral AI ecosystem. You will work with cloud platforms including AWS, Azure, and GCP, and apply MLOps practices for deploying and maintaining production systems. You may also use tools and frameworks for distributed systems, containerization, and API development, including Java and Spring Boot in some contexts.
You will join a cross-functional team of AI engineers, software engineers, and cloud specialists working on cutting-edge Generative AI solutions for enterprise clients. You will collaborate closely with architects, data scientists, and business stakeholders to translate requirements into scalable AI systems. You will work in an environment that values experimentation, engineering excellence, and rapid iteration, with a strong focus on delivering production-ready and secure AI solutions.
Bachelor's o Master's Degree in Informatics, Computer Engineering, Telecommunication Engineering, Electronic, Automation, Robotics Engineering.
Valuable expertise You have at least 2 years of experience in backend development and system integration, with exposure to scalable digital solutions. You have experience with Python and Large Language Models, going beyond simple usage into real implementation and production scenarios. You are comfortable working with cloud platforms such as AWS, Azure, or GCP and understand distributed and cloud-native architectures. Knowledge of Java and Spring Boot is appreciated
Exposure to MLOps tools and practices (e.g. model monitoring, CI/CD for ML, orchestration frameworks). Familiarity with vector databases, retrieval-augmented generation (RAG), and agent-based systems. Hundreds of small units with their own projects and teams. Even though it still may look unreal.
Reply is committed to embracing diversity and creating an inclusive work environment by valuing the uniqueness of people regardless of age, gender, sexual orientation, religion, nationality, or disabilities as protected by Italian Law (L.Furthermore, Reply is committed to ensuring a fair and accessible selection process: to help you during the recruitment process, please let us know of any kind of support you may need.