About Us
At Amini, we're reimagining AI for the Global South. By combining sovereign data, local
infrastructure, and cutting-edge AI, we're helping 4 billion people connect to the digital
economy and unlock new opportunities for growth. Join us in shaping the world's most
inclusive AI revolution.
Our core values of Collaboration, Innovation, Trust, Integrity, Humility, and Passion are at
the heart of everything we do.
Our team embodies inclusivity, diversity, agility, and dynamism. Comprising highly skilled
experts, we recognize the transformational nature of our work while remaining humble, ego
free, and non-hierarchical. We believe in fully utilizing our skills and experiences, sharing ideas,
and making a collective impact to drive change and positively influence the AI industry and
billions of lives.
About the Role
Our Senior Data Engineer will be instrumental in building and maintaining data pipelines from
multimodal ingestion to fusion and processing for some our top customers.
Responsibilities
• Develop and implement data management strategies to ensure data quality,
consistency, and accessibility.
• Design and implement data pipelines, data warehouses and ETL integrating
multiple data sources for efficient data processing and management.
• Developing and maintaining ETL workflows, integrating data from various sources,
and ensuring data integrity throughout the data pipeline.
• Data & Retrieval Infrastructure
Design pipelines for data ingestion, pre-processing, and transformation.
Implement entity extraction, linking, and ontology design for knowledge
management systems.
• Develop and maintain documentation related to data pipeline architecture,
development processes
• Produce scalable, replicable code and engineering solutions that help automate
repetitive data management tasks.
Technical skills and knowledge
• Minimum of 5 years of hands-on experience as a Data Engineer, with a proven
track record of delivering production-grade data solutions in complex, large-scale
environments. Advanced proficiency in SQL, with strong experience extracting,
transforming, and manipulating data within relational databases.
• Advanced proficiency in Python (mandatory), with expertise in data manipulation
libraries such as Pandas, NumPy, and PySpark. Experience with R is considered a
plus, but not a substitute.
• Advanced experience with workflow orchestration (preferably Apache Airflow),
including building complex DAGs, managing dependencies and SLAs, integrating
with cloud data services, and maintaining reliable production workflows.
• Proven expertise in designing and implementing robust ETL pipelines, including
large-scale data extraction, complex transformations, data cleaning and
validation, schema design, dataset integration, and performance optimization
across multiple sources and formats Proficiency with version control systems
(Git) and experience working with integrated CI/CD pipelines using tools such as
GitHub Actions, GitLab, Bitbucket, Azure DevOps, or equivalent.
• Strong ability to document data processes and workflows using collaborative
platforms such as Notion, Confluence, or similar tools. Proficiency in building and
maintaining automated data integrations using APIs, ensuring secure, scalable,
and reliable data exchange across multiple platforms.
• Strong experience with cloud-based data services, particularly in AWS (e.g., EMR,
SageMaker, Lambda, S3, Athena, Redshift), or equivalent tools in GCP or Azure.
• Familiarity with advanced data visualization and mapping tools is a plus,
particularly the ability to prepare and structure data for effective visualization and
downstream analytics.
Professional Competencies
• Strong problem-solving mindset, with the ability to quickly learn new tools and
techniques and troubleshoot independently.
• Excellent communication and interpersonal skills, with active listening and clear
information sharing.
• Proven ability to work effectively in teams, both in leadership and supporting roles.
• Flexibility and willingness to travel to client sites as needed.
If you're a collaborative, resourceful professional looking to work alongside exceptional
individuals, we invite you to apply and join us in shaping the future of AI