Job Description
We are looking for an innovative AI/LLM Engineer to drive the advancement of our conversational AI capabilities. You'll join our Tech&Product team as an AI/LLM Engineer, focusing on prompt engineering, multi-agent orchestration, automated testing, and deploying integrations in production with LLM systems. You'll work closely with Backend Engineers to deliver world-class AI experiences to end users.
What will you do?
Core Development
* Design, optimize, and version prompts for production voice and chat LLM applications
* Architect and orchestrate multi-agent systems for complex conversations
* Build automated testing and validation frameworks for LLM outputs
* Implement prompt versioning, storage, and retrieval systems
System Integration & Deployment
* Collaborate with Backend Engineers to deploy and scale LLM-based systems
* Integrate LLMs with communication APIs (Twilio, WhatsApp, ElevenLabs)
* Implement RAG (Retrieval-Augmented Generation) solutions and vector search for multilingual environments
* Monitor performance metrics and conversation quality
Research & Innovation
* Research and prototype multi-agent frameworks (open-source and commercial)
* Experiment with cutting-edge conversational AI and real-time speech processing techniques
* Contribute to evolving the team's LLMOps best practices
* Continuously improve conversational quality, RAG pipelines, and reduce latency
Qualifications
Must have
* 2+ years hands-on experience with LLMs (OpenAI or similar, open-source models)
* Strong knowledge in prompt engineering and LLM optimization strategies
* Experience in evaluating LLMs, designing and running evaluation frameworks, creating test datasets, and defining success metrics
* Familiarity with automated testing pipelines, building CI/CD-integrated eval systems that run on every prompt change
* Experience in multi-agent architecture, from design to development of orchestration of complex LLM systems
* Good understanding of transformer architectures and proficiency in LLM frameworks such as LangChain, LlamaIndex, or similar frameworks
* Proficiency in Python
* Experience with RAG pipelines and vector databases
* Experience in cross-functional teams, ability to work in fast-moving environments where you own outcomes, not just tasks. You're comfortable with ambiguity and excited by the challenge of figuring things out.
Nice to have
* Experience in healthcare industries
* LLM integration with voice platforms (Twilio, ElevenLabs)
* Background in conversational AI, chatbots, voice assistants
* Knowledge of real-time speech processing and multi-modal systems
* Functional programming principles and advanced NLP
* Exposure to OOP stacks (.NET, PHP)
* Understanding of security and privacy in conversational AI
Additional Information
✨ What we offer:
We value a healthy work-life balance and long-term growth. Benefits vary by location, but here’s what you can expect:
Shared benefits
* 100% remote work, with the option to join our offices in Bologna or Barcelona
* Stock options plan after 6 months
* One extra day off for your birthday
* Access to iFeel – our mental wellbeing platform
Italy-specific
* ️ €8/day meal vouchers – lunch is covered if you're in the Bologna office
* Private health coverage via Metasalute
Spain-specific
* ❤️ Comprehensive private health insurance with Adeslas
* Flexoh – flexible compensation platform
* Wellhub – gym & wellness network membership
* Language courses
How does the recruitment process work?
1. HR interview – a friendly chat to get to know you, your motivations, and tell you more about Tuotempo, our culture, and the team.
2. Technical interview – a deep-dive with our Tech Managers, including practical discussions or small exercises focused on LLMs, multi-agent systems, prompt design, and evaluation workflows.
3. Functional interview — a conversation with our Product Managers to understand how you collaborate cross-functionally and to align on the AI product domain.