Senior engineer – ai model compression research

Axelera

Pubblicato il 6 agosto

Descrizione

PbCompany Overview /b /ppAxelera is a European, high-growth Series B startup revolutionizing the AI landscape with our in-memory computing platform. We specialize in creating AI hardware and software optimized for high-performance inference, catering to cutting-edge use cases across high-end edge computing, embodied AI, and server-side AI deployments. We are looking for passionate, innovative research engineers to join our team and help drive the future of AI. /ppbRole Overview /b /ppWe are looking for an AI Research Engineer with a strong focus on model compression to join our dynamic team. This role will be responsible for developing cutting-edge compression techniques that make Generative AI models more efficient for real-time inference across a variety of environments, from high-end edge systems to large-scale server-side deployments. You will be key in ensuring that our models are optimized for memory usage, computational efficiency, and performance, while maintaining or improving model accuracy. /ppThis is an exciting opportunity to work at the intersection of advanced machine learning, in-memory computing, and high-performance AI inference on cutting-edge hardware architectures. /ppbResponsibilities : /b /ppuModel Compression : /u Design and implement advanced model compression techniques such as pruning, quantization, weight sharing, and knowledge distillation to make models more memory-efficient and computationally optimized. /ppuPerformance Tuning : /u Optimize compressed models to achieve high-throughput and low-latency inference, specifically tailored to our in-memory computing platform. /ppuCollaboration : /u Work closely with AI researchers, software engineers, and hardware engineers to integrate your model optimizations into our AI platform, ensuring that models work effectively across edge and server-side deployments. /ppuInnovation : /u Stay on top of the latest developments in the AI and model compression research space, pushing the envelope on novel techniques for reducing model size without sacrificing performance. /ppDeployment Testing : Implement best practices for model testing, deployment, and continuous improvement to ensure models scale effectively in production environments. /ppbRequirements : /b /ppuExperience : /u Proven experience (for all levels) working on model compression, including techniques like pruning, quantization, low-rank factorization, and knowledge distillation. /ppTechnical Skills : /ppExpertise in deep learning frameworks such as TensorFlow, PyTorch, or JAX. /ppExperience optimizing models for resource-constrained environments, such as edge devices or embedded systems. /ppFamiliarity with distributed systems, in-memory computing, or high-performance computing environments. /ppA strong understanding of deep learning algorithms, neural networks, and the trade-offs involved in model compression. /ppuKnowledge : /u A strong understanding of the latest advancements in AI / ML research, particularly in compression and distillation of generative models (e.g. transformers and diffusion models). /ppCollaboration Communication : Ability to work in a highly collaborative, fast-paced startup environment and communicate complex technical concepts clearly. /ppbPreferred Qualifications : /b /ppPhD or advanced degree in Computer Science, Machine Learning, AI, or related fields. /ppb5+ years of post-graduation relevant work experience. /b /ppResearch experience in model compression, efficient inference, or deploying AI models to resource-constrained devices. /ppFamiliarity with model deployment frameworks like TensorRT, ONNX, or similar. /ppA passion for solving real-world challenges with AI in dynamic, high-performance environments. /ppbLocation /b /ppThis position is based in Italy we support relocation to Bologna, Florence or Milan for talent based abroad and interested in this role. /ppbWhy Join Us? /b /ppuImpact : /u Work on groundbreaking technology that will power the next wave of AI applications, from edge computing to embodied AI systems. /ppuCulture : /u Join a diverse, driven team that values innovation, collaboration, and continuous learning. /ppuGrowth : /u As a Series B startup, you’ll have significant growth opportunities, including the chance to shape the direction of the product and AI strategy. /ppuCompensation : /u Competitive salary, equity options, and benefits package. /ppbHow to Apply? /b /ppPlease submit your resume and a brief cover letter explaining why you're excited about this opportunity, and how your experience aligns with our model compression goals. /ppAt Axelera AI, we wholeheartedly embrace equal opportunity and hold diversity in the highest regard. Our steadfast commitment is to cultivate a warm and inclusive environment that empowers and celebrates every member of our team. We welcome applicants from all backgrounds to join us in shaping the future of AI. /ppJ-18808-Ljbffr /p #J-18808-Ljbffr

Rispondere all'offerta

Crea una notifica

Salva