hero

Job Opportunities

Senior Data Engineer

Alembic

Alembic

Data Science
San Francisco, CA, USA
Posted on Mar 12, 2025

About Alembic

Alembic is a fast-growing Series A software startup focused on building cutting-edge solutions that transform how businesses harness and leverage data. We are a team of innovators, engineers, and product leaders passionate about solving complex problems with scalable, data-driven technology. At Alembic, we believe that great software is built by great people, and we are looking for a Data Engineer who thrives in a fast-paced, high-impact environment.

About the Role

As a Data Engineer at Alembic, you will be at the core of our data platform, building scalable and reliable data pipelines, optimizing storage solutions, and enabling real-time and batch analytics. You will work closely with data scientists, software engineers, and product leaders to design and implement robust data architectures.

Key Responsibilities

  • Design, develop, and maintain scalable ETL pipelines that ingest, process, and transform large volumes of structured and unstructured data.

  • Optimize data storage solutions using modern data lakehouse architectures and best practices for cost, performance, and reliability.

  • Collaborate with data scientists and engineers to integrate machine learning models and analytical workloads into production environments.

  • Ensure data integrity, quality, and security by implementing monitoring, alerting, and governance best practices.

  • Work with cloud-based data warehouses and distributed data processing frameworks.

  • Continuously evaluate and implement new technologies to improve data infrastructure and operational efficiency.

What We’re Looking For

  • 10+ years of experience in data engineering, software engineering, or a related field.

  • Strong expertise in SQL and Python for data processing.

  • Experience with modern data warehousing and lakehouse solutions (i.e. Iceberg or similar).

  • Proficiency in working with distributed systems and big data technologies (Apache Spark, Hadoop, Kafka, Flink).

  • Hands-on experience with cloud platforms (AWS, GCP, Azure) and related data services.

  • Deep understanding of data modeling, database design, and performance optimization.

  • Familiarity with CI/CD pipelines, containerization (Docker, Kubernetes), and infrastructure-as-code (Terraform, CloudFormation) for data pipelines.

  • Strong problem-solving skills, with a passion for building reliable, scalable, and maintainable data systems.

  • Excellent communication skills and the ability to collaborate in a cross-functional team.

Nice to Have

  • Experience with Graph Databases, NoSQL, or Time-Series Databases.

  • Familiarity with data privacy, governance, and compliance (GDPR, HIPAA, SOC 2).

  • Experience with machine learning pipelines and MLOps.

Why you might be excited about Alembic:

  • You want to build something that is both technologically challenging and solves a real customer need. You want a role with major upside that tackles a massive market opportunity.

  • You are a serial startup builder or want to learn more before becoming a founder yourself. Our team holds deep experience building and selling B2B marketing solutions that work.

  • You want to work where you can take a big swing at building something big while maximizing your personal growth.

Why you might not be excited:

  • If you only want to tell people

  • You prefer company practices with 100% built out process for every little detail.

  • You prefer static over dynamic. Projects, priorities, and roles will adapt to your skill set and your goals. Though we have a playbook for growth, we proudly remain an early stage startup.