About GoDiverse
At GoDiverse, we're on a mission to reshape UK public procurement for the better. We are an early-stage tech company building an AI-powered platform to help public bodies discover and engage with Small and Medium-sized Enterprises (SMEs), driven by the new Procurement Act 2023. Our goal is to level the playing field for the 99.9% of UK businesses that are SMEs, ensuring they get fair access to the annual public spend.
The Role
A Senior Data Engineer who also leads MVP delivery (60% data engineering, 40% execution), owning pipelines, schemas, and reliable data flows that power SME discovery and tender NLP for a pilot-ready MVP.
What You'll Do:
- Build data pipelines: Ingest and transform public procurement data (e.g., Find a Tender) with versioning, lineage, monitoring, and recoverability for daily refreshes.
- Design schemas: Model tenders, suppliers, CPV/NACE/SIC, features, and model outputs; optimize storage and indexing for performance and cost.
- Ship feature layer: Create reusable features for supplier matching and barrier-language detection; document for reproducible ML iteration.
- Enforce data quality: Implement validation, deduplication, entity resolution, audit logs, and access controls suitable for public-sector contexts.
- Serve data and APIs: Provide clear data contracts and performant endpoints to AI and UI; ensure reliable dependencies for inference.
- Lead MVP cadence: Own data roadmap, align demo milestones, run lightweight sprints, and surface risks/trade-offs early to founders.
What We're Looking For:
- A 4+ years data engineering in 0→1 settings; strong Python/SQL, orchestration (Airflow/Prefect/Dagster), containers/CI, warehouse/lake patterns.
- Practical data modeling for analytics/ML; familiarity with CPV/NACE/SIC preferred.
- Proven data quality, lineage, and governance practices with clear documentation and reproducibility.
- Nice to have: procurement/GovTech exposure, MLOps basics (feature stores, experiment tracking), Azure/AWS/Supabase.
Why Join GoDiverse?
- Build and lead: Own the data backbone and delivery rhythm from raw data to pilot-ready demos for an AI SME discovery product.
- Meaningful impact: Advance SME inclusion aligned with the Procurement Act 2023 through data-driven procurement tooling.
- Foundational role: Early influence on product and culture, with scope for leadership and potential equity as we scale.