Get AI-powered advice on this job and more exclusive features.
This range is provided by UNGUESS. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.
Base pay range
Location: Remote-first
Job type: Full-time
About the job
Can you imagine a world where business and digital solutions will be truly seamless and where users will help companies to co‑create them? Do you want to help us shape this human‑centred world? Welcome to UNGUESS. UNGUESS is the crowdsourcing platform for effective testing and real insights that enable tech, digital and business leaders to make smarter decisions, faster. By unleashing the power of the crowd—a community of highly engaged people all over the world—UNGUESS brings end‑customer insights into the design, development, and testing phases of a product.
Why work at UNGUESS
At UNGUESS, you’ll have the chance to make an immediate impact in a fast‑paced and dynamic environment. We’re growing rapidly and strengthening our market position. Joining us now means stepping into an exciting challenge that won’t always be easy, but will undoubtedly be among the most rewarding and fulfilling experiences of your career. You’ll constantly learn, grow, and apply your full skill set across diverse and stimulating projects.
Your mission
As our first dedicated Data Engineer, you will be the architect of the infrastructure that makes this vision possible. You’ll own the design, implementation, and scalability of our data stack, working closely with the product and development teams. We are a rapidly growing tech company with the ambition of building an LLM‑queryable Knowledge Base by leveraging existing but currently unstructured data sources. This is the first data hire, so you will have full ownership over architecture, implementation, and scalability.
Responsibilities
* Design and implement data ingestion and normalization pipelines from heterogeneous sources (APIs, files, databases, streams).
* Build a data lake on AWS (S3, Glue, Athena) and orchestrate data flows using CDK.
* Implement RAG (Retrieval‑Augmented Generation) systems using vector databases and LLM models (Bedrock, OpenAI, LangChain).
* Model metadata and define chunking strategies for NLU‑queryable documents.
* Ensure data security, governance, monitoring, and cost optimization.
* Collaborate with the Product team to integrate the knowledge base into the existing platform.
Requirements
* Hands‑on experience with RAG systems in production, embedding models (OpenAI, Cohere, Amazon Titan), and vector databases (OpenSearch, Pinecone, pgvector).
* Strong grasp of chunking strategies, retrieval optimization (precision/recall/reranking).
* Proven expertise with AWS CDK, data services (S3, Glue, Athena, Lambda, Step Functions), and ML/AI workloads (Bedrock, SageMaker). Solid understanding of IAM, KMS, VPC for security/compliance.
* Has a builder's mindset and enjoys designing robust, scalable solutions.
Nice to have
* Hands‑on with serverless architectures and cost‑optimized scaling strategies.
* Experience in cloud‑native environments and CI/CD (AWS).
* Familiarity with monitoring and alerting (CloudWatch, X‑Ray).
Compensation and benefits
* Compensation: €45,000 to €50,000/year gross salary and competitive MBO bonus – this range is a guideline; we’re first and foremost looking for the right person, the final offer will be shaped around you and reflect your skills and experience.
* Remote work lovers.
* Fast‑track growth opportunities.
* Access to group and personal training programs.
Please note that this job advertisement is open to applicants of all genders, in accordance with Laws 903/77 and 125/91.
Referrals increase your chances of interviewing at UNGUESS by 2x.
Seniority level
Not Applicable
Employment type
Full‑time
Job function
Information Technology
Industries
IT Services and IT Consulting
#J-18808-Ljbffr