icon picker
Sr. Data Engineer

OBSOLETE, please refer to:

overview
We are an industry-leading startup developing AI for consumer brands. Our solutions leverage machine learning, generative AI, agent-based systems, and graph technologies to get our customers to insights in seconds and to business impact in minutes using our products.
We are looking for our Founding Senior Data Engineer responsible for designing and maintaining the cloud‑native data platform that powers our analytics and ML products.
role
The Founding Senior Data Engineer owns the end‑to‑end data pipeline lifecycle—from source‑system ingestion and transformation to governance, privacy, and serving optimized data models. You will work closely with Backend Engineers, ML Engineers, and Data Scientists to deliver clean, trusted, and business‑ready data sets that accelerate product innovation.
responsibilities
Cloud Data Pipeline Design & Development:
Build scalable, cost‑effective ETL/ELT pipelines with AWS Glue, Athena, Redshift, S3, and Lambda (Python).
Integrate data from diverse source systems (REST, streaming, databases, SaaS) and standardize into business‑specific target models (dimensional, lakehouse, or graph schemas).
Leverage orchestration frameworks (Apache Airflow, AWS Step Functions) and versioned SQL/Python transformations (dbt or similar) to enable repeatable deployments.
Data Governance, Lineage & Privacy:
Implement data lineage and catalog solutions (DataHub, Amundsen, or OpenMetadata) ensuring end‑to‑end traceability.
Enforce data governance and control‑plane policies using AWS Lake Formation or equivalent, aligning with SOC 2, ISO 27001, GDPR, and CCPA requirements.
Champion data‑quality SLAs, automated testing, and monitoring (Great Expectations, Monte Carlo, or similar).
Collaboration & Enablement:
Partner with ML Engineers and Data Scientists to provision feature stores and training datasets.
Work with Backend Engineers to design efficient APIs and streaming interfaces for real‑time data access.
Document data models, publish best practices, and mentor junior engineers on modern data‑engineering standards.
all about you
Proven track record of deploying AI products into customer environment
Strong grasp of core machine learning, genai, agent and graph concepts, and best practices.
Experience building a customer success team
Demonstrated self-motivation, hacking mentality, and creative problem-solving abilities.
Experience contributing thought leadership and driving innovation for AI within a customer context.
location
Hybrid role based in New York City; open to remote U.S. candidates willing to travel monthly to our NYC office.
compensation
$150K-$170K base salary + performance bonus + excellent benefits and perks + equity
employee opportunity employer
We are an equal opportunity employer and consider applicants without regard to gender, gender identity, sexual orientation, race, ethnicity, disability, veteran status, or any other characteristic protected by law. We actively encourage diversity, inclusion, and equitable hiring practices.
If you require accommodations during the hiring process, please reach out to our recruitment team at join@sciemo.ai
Want to print your doc?
This is not the way.
Try clicking the ⋯ next to your doc name or using a keyboard shortcut (
CtrlP
) instead.