Posted May 25, 2026
Data Scientist
San Francisco, CA
After quickly ramping up on our clinical data domain, your goal is to be the person who owns clinical intelligence within Metriport's stack — turning massive volumes of clinical records into predictions, insights, and automated decisions. Day to day, this looks like:
Applying AI/ML to Clinical Data: Building and deploying models to predict patient outcomes, identify gaps in care, or surface anomalies across our data warehouse. - Normalizing Clinical Data at Scale: Using NLP, LLMs, or rule-based systems to transform messy, unstructured clinical records into structured, searchable, trustworthy data. - Owning Analytics When It Matters: You'll share ownership of our analytics stack and data quality alongside the team. When a customer needs an accurate report or the team needs a reliable metric, you're just as accountable as anyone. - Productizing Intelligence: Designing and shipping data-science-powered features as core parts of the Metriport platform — not just internal experiments, but things customers use. - Building the Data Foundation: Contributing to data modeling, warehouse design, and tooling (dbt, DWHs, PostHog, etc.) as we scale our data infrastructure. Science without solid foundations is noise. - Team alignment: Participating in a daily 30-minute remote standup at 7:30 AM PST Mon–Fri (our only regular mandatory meeting). ###
4+ years of experience in a data science role, ideally at a high-growth or startup company where you wore multiple hats. - SQL mastery: You can write complex, performant queries in your sleep. - ML/statistical modeling: Practical experience building and deploying models (classification, regression, clustering, NLP) — not just prototyping. - Coding proficiency: Strong in Python (pandas, scikit-learn, and ideally some experience with LLM APIs or frameworks). TypeScript proficiency is a plus — our stack is TypeScript-heavy. - Analytical chops: You're comfortable owning dashboards, data quality, and ad-hoc analysis. You see this as part of the job, not beneath it. - Location: San Francisco / Bay Area (or willing to relocate). ### Nice to Have
Experience with healthcare data is strongly preferred — FHIR, HL7, or clinical data. Understanding how a patient moves through the healthcare system is the core of what we do. - Experience with data modeling tools (dbt or similar) and product analytics platforms (PostHog, Mixpanel, Amplitude). - Experience integrating models into backend services or APIs (not just notebooks). ### Benefits
Competitive equity + compensation package 🚀
Full family Platinum health insurance, dental, and vision coverage 🦷
401(k) retirement plan + matching 💰
Flexible work from home or in-office 🏢
Healthy lunches are complimentary when working in-office (and breakfast + dinners as needed) 🍏
Quarterly company off-sites with the team ⛷️
MacBook provided by us 💻
Unlimited PTO (we work hard, but trust you to take time you need to be at your best) 🧘♂️
Our data lives in PostgreSQL, DynamoDB, S3, Snowflake, and a FHIR server. We use dbt for transformations and Posthog for product analytics. Our infrastructure is managed via AWS CDK, and our core platform is written in TypeScript and Python. We are looking for a generalist who can jump into any part of this stack to extract value. Metriport provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, sex, national origin, age, disability, genetics, sexual orientation, gender identity, or gender expression. We are committed to a diverse and inclusive workforce and welcome people from all backgrounds, experiences, perspectives, and abilities.
Don't want to apply yourself?
Our team writes your resume, applies for you, preps you for interviews, and negotiates your offer.
Browse Jobs
By Role
By City