Data Engineer
Data Engineer
About the role
About The Meridiem
The Meridiem is a fast-growing technology media platform on a mission to deliver timely, high-quality journalism covering AI, startups, venture capital, cybersecurity, and enterprise tech. Our platform — built on Next.js 16 and deployed globally on Cloudflare Workers — serves hundreds of thousands of technology professionals who depend on us for authoritative morning briefings and in-depth evening analysis.
Role Overview
We are looking for a Data Engineer to build and scale the data infrastructure that powers content analytics, reader behavior tracking, editorial insights, and business intelligence at The Meridiem. You will design robust data pipelines, manage our data warehouse, and create the foundational systems that enable the entire organization — editorial, product, and growth — to make data-informed decisions. This is a foundational hire that will shape how a media company uses data at every level.
Key Responsibilities
- Design, build, and maintain scalable data pipelines (ETL/ELT) that ingest data from our Next.js application, Strapi CMS, Cloudflare analytics, newsletter platform, and third-party sources.
- Architect and manage the data warehouse, selecting and optimizing the right storage layer for content analytics, reader behavior, and business metrics.
- Implement real-time and batch event tracking infrastructure to capture reader interactions — page views, scroll depth, read time, clicks, shares, and engagement signals.
- Build reliable data ingestion from Cloudflare Workers logs, R2 storage metrics, and edge performance data to support platform observability.
- Create and maintain data models that connect editorial content metadata from Strapi with reader behavior and business outcomes.
- Develop data quality monitoring, alerting, and validation frameworks to ensure pipeline reliability and data accuracy.
- Optimize query performance and storage costs as data volumes scale with growing readership.
- Collaborate with the analytics and ML teams to ensure clean, well-documented, and accessible data for downstream analysis and modeling.
- Implement privacy-compliant data collection and processing in line with GDPR, CCPA, and emerging regulations.
- Build and maintain infrastructure-as-code for all data systems, ensuring reproducibility and disaster recovery.
- Document data architecture, schemas, and pipeline logic so the team can onboard quickly and operate independently.
Requirements
- 4+ years of professional data engineering experience building production data pipelines and warehouse infrastructure.
- Strong proficiency in Python and SQL, with experience writing performant, maintainable transformation code.
- Hands-on experience with modern data stack tools — cloud data warehouses (BigQuery, Snowflake, or Redshift), orchestration (Airflow, Dagster, or Prefect), and transformation (dbt or similar).
- Experience designing event tracking systems and working with clickstream or behavioral data at scale.
- Solid understanding of data modeling patterns — star schemas, slowly changing dimensions, and event-sourced models.
- Familiarity with streaming or real-time data processing (Kafka, Kinesis, or Pub/Sub) for near-real-time analytics.
- Knowledge of cloud infrastructure (AWS, GCP, or Cloudflare) and infrastructure-as-code practices.
- Strong understanding of data governance, privacy requirements, and secure data handling practices.
- Ability to work autonomously in a fast-paced startup environment and communicate technical decisions clearly to non-technical stakeholders.
Nice-to-Have
- Experience with Cloudflare Workers, Workers Analytics Engine, or R2 storage.
- Background in media, publishing, or content platform data infrastructure.
- Familiarity with Strapi CMS data models and REST/GraphQL API integration.
- Experience with Beehiiv or newsletter platform analytics data.
- Knowledge of Next.js application telemetry and web performance metrics.
- Experience implementing consent management and privacy-preserving analytics.
- Contributions to open-source data tools or community projects.
What We Offer
- Competitive salary ($130,000 - $165,000) with meaningful equity in a Series A startup.
- Fully remote work environment with flexible hours and async-first communication.
- Annual learning and development budget of $3,000 for conferences, certifications, and courses.
- Premium health, dental, and vision insurance coverage.
- Generous PTO policy with company-wide recharge weeks.
- The opportunity to build data infrastructure from the ground up at a company where data directly shapes editorial and product strategy.
- Latest hardware and any tools or cloud resources you need to do your best work.
How to Apply
Send your resume, a brief description of a data pipeline or infrastructure project you are proud of, and a note on what interests you about data engineering for a media platform to support@themeridiem.com with the subject line "Data Engineer Application." We review applications on a rolling basis and aim to respond within one week.
