About the role AI · Claude
Own the data pipeline architecture and ETL infrastructure powering Metatech's AI product line and Komdigi 117/2024–regulated client engagements. You'll design schemas, optimize query performance for real-time inference, manage data quality checkpoints, and deliver dashboards that inform product decisions. Measurable outcomes: sub-200ms query latency, <0.5% null rates in production pipelines, and on-time data delivery for compliance audits.
What we're filtering for
- Production experience with Apache Spark, Airflow, or Prefect on 500GB+ datasets
- SQL optimization: query plans, indexing, materialized views; demonstrated latency wins
- Schema design and data modeling for OLAP or real-time serving (Snowflake, BigQuery, etc.)
- End-to-end pipeline ownership: ingestion, transformation, monitoring, alerting
- Experience shipping data infra in regulated environments or audit-heavy contexts