Duolingo Data Scientist Guide (2026): Job, Salary & Interviews

Duolingo Data Scientist at a Glance

Interview Rounds

6 rounds

Difficulty

Python SQLMonetizationForecastingSubscription BusinessExperimentationEdTechProduct AnalyticsFinanceMachine Learning

Duolingo's revenue grew roughly 45% year-over-year to around $748M in 2024, yet the data science org stays lean. Each DS ends up owning a surface area that would be carved across multiple people at a comparably sized tech company, which is exactly why the interview process skews so heavily toward product ownership and causal inference rather than pure modeling chops.

Duolingo Data Scientist Role

Primary Focus

MonetizationForecastingSubscription BusinessExperimentationEdTechProduct AnalyticsFinanceMachine Learning

Skill Profile

Math & Stats

Expert

Deep understanding and application of advanced statistical methods including causal inference, structural modeling, behavioral modeling, and experiment design (A/B testing). Required advanced degree in a quantitative field like Data Science, Economics, or Statistics, with a focus on scientific rigor.

Software Eng

High

Strong programming skills in Python and SQL for data manipulation, analysis, and building production-ready, robust data solutions and algorithms. Comfort working across the data stack.

Data & SQL

High

Experience collaborating with data engineering and business intelligence teams to create and maintain robust data pipelines, dashboards, and algorithms. Preferred experience includes setting up metrics and data pipelines from scratch.

Machine Learning

High

Strong practical experience applying machine learning techniques to deeply understand learner engagement and build impactful data products. Expertise in applied ML is required.

Applied AI

Low

Not explicitly mentioned as a core requirement for this role; the focus is on traditional machine learning and advanced statistical methods rather than modern AI or Generative AI.

Infra & Cloud

Low

Familiarity with 'big data' tooling like Redshift, BigQuery, and Hive is preferred, indicating experience with cloud-based data warehouses, but not direct responsibility for cloud infrastructure deployment or management.

Business

Expert

Exceptional ability to partner with product and learning leads to identify and prioritize impactful questions, translate data insights into smarter product decisions, and apply data science to complex, real-world product problems in a fast-paced environment. This is a high-stakes, high-visibility role.

Viz & Comms

High

Exceptional clarity of thinking and communication, with the ability to make complex ideas simple and actionable for both technical and non-technical stakeholders. Experience building metrics, attribution models, and collaborating on dashboards.

What You Need

Data Science
Causal Inference
Behavioral Modeling
Experiment Design (A/B testing)
Applied Machine Learning
Metrics Development
Attribution Modeling
End-to-end Data Solution Development
Tech Leadership
Management (building and nurturing talent)
Scientific Rigor
Product Data Science (6+ years experience)
Strategic Problem Solving
Cross-functional Collaboration

Nice to Have

Setting up metrics, pipelines, and tests from scratch (0->2 environments)
Passion for learning, teaching, and making knowledge joyful
Strong Duolingo user engagement (alarming streak or high XP)

Languages

PythonSQL

Tools & Technologies

BigQueryRedshiftHiveDashboards

Want to ace the interview?

Practice with real questions.

Start Mock Interview

You're embedded inside a product squad (monetization, growth, engagement, or new subjects like music and math) rather than a centralized analytics function. Your loop runs from proposing an analysis, through designing the experiment and writing the queries in BigQuery, to producing a written memo with a ship-or-kill recommendation for product leadership. Success after year one means you've driven multiple experiments from hypothesis to launch decision and created at least one metric definition the team now tracks weekly.

A Typical Week

A Week in the Life of a Duolingo Data Scientist

Typical L5 workweek · Duolingo

Weekly time split

Analysis — 25%Coding — 18%Meetings — 18%Writing — 18%Break — 10%Research — 7%Infrastructure — 4%

Culture notes

Duolingo runs at a fast but disciplined pace — the 'Test It First' culture means you're always either designing, running, or reading out experiments, and the expectation is rigorous analysis over gut feel, but hours are generally reasonable with most people offline by 6 PM.
The company operates on a hybrid schedule requiring three days per week in the Pittsburgh HQ, and the data science team is deeply embedded within product squads rather than siloed in a central org.

The split that catches most candidates off guard is how much of the week goes to written documents. Duolingo's "Test It First" culture means every experiment gets a formal write-up following an internal template: hypothesis, methodology, caveats, and a concrete recommendation. If you don't enjoy writing clearly for non-technical readers, this role will wear you down fast.

Projects & Impact Areas

Subscription monetization carries the highest stakes: you'll model Super Duolingo conversion funnels, run pricing experiments, and wrestle with revenue attribution when users sit in multiple overlapping A/B tests. New subject expansion (music, math) is a different beast entirely, because you're defining success metrics from zero for learning modalities that behave nothing like language courses. Connecting those two worlds is the causal inference work of disentangling interaction effects across simultaneous experiments, a recurring challenge that touches nearly every product area.

Skills & What's Expected

Expert-level statistics and business acumen are the two dimensions that actually gate your candidacy, but don't underestimate the applied ML expectation. The role requires strong practical ML experience for behavioral modeling and engagement prediction, even though you won't be deploying production model services. What's genuinely underrated in prep: SQL and software engineering quality. DS here are expected to write production-grade queries, contribute to data pipelines (including building metrics and pipelines from scratch in new product areas), and review teammates' code. What you can safely deprioritize: GenAI and modern AI architectures, which aren't part of this role's core requirements.

Levels & Career Growth

Duolingo posts new-grad DS and staff DS as distinct roles with very different scope expectations. The jump from mid-level to senior hinges on owning experiment design end-to-end without your manager reviewing methodology, while the jump to staff means owning the entire analytical narrative for a product area (the staff financial forecasting role in monetization, for instance, shapes investor-facing metrics directly). What blocks promotion most often, from what employees report, is breadth of influence: technical brilliance alone won't get you there if you're not proactively shaping the product roadmap with your analyses.

Work Culture

The hybrid schedule requires time in the Pittsburgh HQ each week, and the culture notes from the team describe most people being offline by 6 PM, so the pace is disciplined rather than grueling. Duolingo's operating principles ("be direct," "ship it") play out concretely in things like the experiment review council, where DS present launch decisions and are expected to flag flawed designs in the room, not days later over Slack. One honest tradeoff: the team is small enough that context-switching between experiment readouts, ad-hoc queries, and pipeline fixes is constant, and long uninterrupted blocks for deep analysis are rare.

Duolingo Data Scientist Compensation

Duolingo's RSUs vest over four years with a one-year cliff, so your equity component carries real risk if you're unsure about staying. Because DUOL is a post-IPO stock with a relatively small float, the market value of your grant at vesting can differ meaningfully from the number on your offer letter. Factor that uncertainty into how you weight equity versus guaranteed cash when evaluating the package.

Base salary and the initial RSU grant size are where candidates report the most flexibility. If you're comparing against offers from larger tech companies, remember that Duolingo's Pittsburgh headquarters means your dollar goes further on housing, taxes, and daily expenses. Don't just match raw numbers; instead, quantify that purchasing power gap and use it to push for a stronger equity grant, which is the component most likely to compound if Duolingo's subscriber growth continues its trajectory.

Duolingo Data Scientist Interview Process

6 rounds·~5 weeks end to end

Initial Screen

1 round

Recruiter Screen

45mPhone

This initial conversation with a recruiter will cover your background, experience, and career aspirations. You'll discuss your interest in Duolingo, the Data Scientist role, and your general qualifications to ensure alignment with the company's needs. Expect to briefly touch upon your technical skills and availability.

behavioralgeneral

Tips for this round

Research Duolingo's mission, products, and recent news to demonstrate genuine interest.
Be prepared to articulate your experience with data analysis, experimentation, and product impact concisely.
Have a clear understanding of your salary expectations and visa sponsorship needs, if any.
Prepare a few questions to ask the recruiter about the role, team, or company culture.
Highlight any experience working with consumer digital products or educational technology.

Technical Assessment

2 rounds

Coding & Algorithms

60mLive

You'll face a live coding challenge focused on SQL and potentially a basic data structures or algorithm problem using Python or R. This round assesses your ability to manipulate data efficiently, solve problems programmatically, and write clean, functional code. The interviewer will evaluate your problem-solving approach and coding proficiency.

algorithmsdata_structuresdatabaseengineering

Tips for this round

Practice advanced SQL queries, including joins, aggregations, window functions, and subqueries.
Brush up on fundamental data structures (arrays, lists, dictionaries) and common algorithms (sorting, searching).
Be prepared to explain your thought process out loud as you code, discussing trade-offs and edge cases.
Ensure you are comfortable coding in either Python or R, as these are the primary languages for data science at Duolingo.
Consider problems involving data cleaning, transformation, and feature engineering using code.

Machine Learning & Modeling

75mLive

This round delves into your theoretical and practical understanding of machine learning, statistics, and experimental design. You'll discuss various ML algorithms, their applications, and how to evaluate their performance. Expect questions on A/B testing methodologies, statistical significance, and causal inference techniques relevant to product decisions.

machine_learningstatisticsab_testingcausal_inference

Tips for this round

Review core machine learning algorithms (e.g., regression, classification, clustering) and their underlying principles.
Understand experimental design, power analysis, and common pitfalls in A/B testing.
Be ready to discuss metrics for evaluating model performance (e.g., precision, recall, F1, AUC, RMSE) and when to use each.
Familiarize yourself with concepts of bias, variance, overfitting, and regularization in ML models.
Practice explaining complex statistical concepts clearly and concisely to a non-technical audience.

Onsite

3 rounds

Product Sense & Metrics

60mVideo Call

You'll be given a business problem related to Duolingo's product and asked to analyze it from a data perspective. This round assesses your ability to define key metrics, propose experiments to test hypotheses, and interpret results to drive product improvements. Expect to demonstrate a strong understanding of user behavior and how data can inform product strategy.

product_senseab_testingguesstimate

Tips for this round

Think critically about Duolingo's product features, user engagement, and monetization strategies.
Practice defining success metrics for new features or product changes, considering both short-term and long-term impacts.
Be prepared to design an A/B test from scratch, including hypothesis formulation, experiment setup, and interpretation of results.
Work through guesstimate problems to demonstrate your ability to break down complex problems and make reasonable assumptions.
Showcase your ability to connect data insights directly to product decisions and user experience.

Case Study

60mVideo Call

This round often involves a more comprehensive data science case study, potentially combining elements of modeling, analysis, and product thinking. You might be asked to walk through a past project or solve a new, multi-faceted problem, demonstrating your end-to-end data science workflow. The interviewer will be looking for your ability to structure a problem, choose appropriate methods, and communicate your findings effectively.

machine_learningdata_modelingproduct_senseml_coding

Tips for this round

Prepare a detailed case study from your past experience that showcases your full data science skill set.
Be ready to discuss the trade-offs of different modeling approaches and data collection strategies.
Focus on the business impact of your solutions and how you would measure success.
Practice articulating your thought process clearly, from problem definition to solution implementation and evaluation.
Consider how you would handle unexpected data issues or changes in project scope during the case study.

Behavioral

60mVideo Call

This interview focuses on your past experiences, how you've handled challenges, collaborated with teams, and your motivations. Interviewers will assess your communication skills, problem-solving approach, and cultural fit within Duolingo's collaborative and mission-driven environment. You'll be asked about your strengths, weaknesses, and how you learn from mistakes.

behavioralgeneral

Tips for this round

Prepare several examples using the STAR method (Situation, Task, Action, Result) for common behavioral questions.
Highlight instances where you collaborated effectively with product managers, engineers, or other stakeholders.
Demonstrate your passion for Duolingo's mission of making education accessible.
Be ready to discuss how you handle constructive feedback and adapt to new challenges.
Showcase your growth mindset and eagerness to learn and contribute to a fast-paced environment.

Tips to Stand Out

Master SQL and Python/R. Duolingo heavily relies on these for data analysis and modeling. Practice complex queries, data manipulation, and scripting for data cleaning and feature engineering.
Deeply understand A/B Testing and Causal Inference. As a product-driven company, Duolingo uses experimentation extensively. Be prepared to design, analyze, and interpret A/B tests, and discuss causal inference methods.
Develop strong Product Sense. Connect your data insights directly to user experience and business outcomes. Think about how your analysis can inform product decisions and improve Duolingo's learning platform.
Practice explaining complex concepts clearly. You'll need to communicate technical findings to both technical and non-technical audiences. Focus on clarity, conciseness, and impact.
Showcase your collaborative spirit. Duolingo values engineers and data scientists who can work cross-functionally. Prepare examples of successful teamwork and how you contribute to a positive team environment.
Be prepared for a hybrid interview approach. While you interview for a specific role, Duolingo considers candidates for other open roles if the initial target is filled. Demonstrate adaptability and broad technical skills.
Research Duolingo's product and mission. Understand how their app works, their user base, and their educational goals. This will help you tailor your answers and show genuine interest.

Common Reasons Candidates Don't Pass

✗Weak SQL or coding skills. Many candidates struggle with the depth of SQL required or fail to write efficient, bug-free code under pressure, which is a fundamental requirement.
✗Lack of strong product intuition. Failing to connect data analysis to real-world product implications or struggling to define meaningful metrics for product features is a common pitfall.
✗Inadequate understanding of A/B testing. Not being able to design a robust experiment, identify potential biases, or correctly interpret results for product decisions can lead to rejection.
✗Poor communication of technical concepts. Candidates who cannot clearly articulate their thought process, assumptions, or the implications of their analysis often struggle.
✗Limited experience with relevant ML techniques. While Duolingo values a broad understanding, a lack of depth in applying ML to user behavior or educational data can be a concern.
✗Inability to handle ambiguity in case studies. Struggling to structure an approach, ask clarifying questions, or make reasonable assumptions when faced with an open-ended problem.

Offer & Negotiation

Duolingo's compensation packages typically include a competitive base salary, annual performance bonus, and Restricted Stock Units (RSUs) that vest over a four-year period, often with a one-year cliff. Base salary and RSU grants are generally the most negotiable components. Candidates should research market rates for Data Scientists at similar-stage tech companies and be prepared to articulate their value based on experience and unique skills. Consider total compensation, including the long-term value of equity, when evaluating an offer. It's advisable to have competing offers if possible to strengthen your negotiation position.

The whole loop takes about five weeks from recruiter call to offer. Six rounds is a lot for a company with fewer than 1,000 employees, and the back half (Product Sense, Case Study, Behavioral) all land during the onsite, so that final week is dense. Block real prep time for each round separately, because they test genuinely different muscles.

The most common rejection pattern is weak coding combined with poor product intuition. The Coding & Algorithms round covers SQL but can also include basic data structures or algorithm problems in Python or R, so candidates who only drill queries sometimes get caught flat-footed. The other trap is the Case Study: from what candidates report, interviewers pay close attention to how clearly you structure and communicate your analysis, not just whether your methodology is correct.

Duolingo Data Scientist Interview Questions

Product Sense & Metrics (Monetization/Subscriptions)

Expect questions that force you to define success for subscription and ads monetization, choose guardrails, and reason about user value vs revenue tradeoffs. Candidates often stumble by proposing metrics that aren’t actionably tied to product levers or that break under experimentation.

Duolingo launches a 7-day Super free trial in the app store and you see trial starts up but day-30 revenue is flat. What primary success metric, 2 diagnostic metrics, and 2 guardrails do you set, and how do you attribute impact across trial start, conversion, renewal, and retention without double counting?

EasySubscription Metrics Design

Sample Answer

Most candidates default to trial start rate or day-7 conversion, but that fails here because free trials shift timing and can cannibalize full-price starts while inflating top funnel. Your primary metric should be incremental $30D net revenue per eligible user (or per exposed user) and you decompose it into start rate, $P(\text{convert})$, renewal hazard, and post-trial retention, with attribution done via a revenue waterfall so each stage explains a unique slice. Diagnostic metrics include trial-to-paid conversion and first renewal rate, plus time-to-cancel, while guardrails include learning engagement (DAU or lessons per active) and support burden (refund rate or chargeback rate). If the app store delays revenue recognition, you also track realized revenue and a leading proxy (expected LTV) separately so the experiment does not “win” on accounting timing.

You want to forecast incremental subscription revenue from a new paywall that changes price localization by country, but you only have 8 weeks of post-launch data and heavy seasonality. How do you build a forecast that is usable for finance, and what backtests and uncertainty outputs do you require before it is used in planning?

HardMonetization Forecasting and Uncertainty

Practice more Product Sense & Metrics (Monetization/Subscriptions) questions

Experiment Design & A/B Testing

Most candidates underestimate how much rigor you’ll need around power, variance reduction, and practical issues like logging, ramp-ups, and policy constraints. You’ll be evaluated on whether your experiment plan would survive real subscription funnels, seasonality, and multiple simultaneous tests.

You are testing a new Super Duolingo paywall copy meant to increase subscription starts; what is your primary success metric and what guardrail metric would you require before shipping?

EasyMetrics and Guardrails

Sample Answer

Use incremental trial-to-paid conversions per exposed user (or incremental revenue per exposed user) as the primary metric, with retention or learning engagement as a guardrail. Monetization changes can win on starts while hurting long-run value, so you need a metric aligned to cash outcomes, not just clicks. A hard guardrail like $D7$ active learning days or lesson completion rate catches cannibalization where users churn or disengage after seeing a more aggressive paywall.

A paywall experiment runs for 2 weeks, but Duolingo has strong weekly seasonality and simultaneous pricing tests in some countries; how do you design randomization and analysis to get an unbiased lift estimate for revenue per user?

MediumRandomization and Interference

Sample Answer

You could do user-level randomization with regression adjustment, or geo and country-level cluster randomization with a difference-in-differences style estimator. User-level wins here because subscriptions are decided at the individual level, you get more effective sample size, and you can block by country, platform, and entry point to handle heterogeneity. Add fixed effects for day-of-week (or include time dummies) and exclude or explicitly model overlapping experiments using mutually exclusive allocation or factorial designs when feasible.

You are asked to ramp a new annual plan default from 0% to 100%, but finance cares about 90-day revenue and you only have 14 days of observation; how do you set the ramp, the stopping rule, and the estimator so the decision is statistically and financially defensible?

HardRamp Strategy and Long-Run Impact

Practice more Experiment Design & A/B Testing questions

Causal Inference & Attribution (Revenue Impact)

Your ability to separate correlation from product-driven lift is central when revenue moves without clean randomization. You’ll need to justify methods like diff-in-diff, CUPED/trigger analysis, IV, or propensity approaches and articulate assumptions in plain language.

A new paywall is launched to 10% of iOS users, but assignment is by app version so older versions never see it; how do you estimate incremental subscription revenue in the first 14 days post launch? Specify the causal estimator, the comparison groups, and the core assumptions you would validate.

EasyQuasi-Experimental Design (Diff-in-Diff)

Sample Answer

You could do difference in differences on pre and post revenue using never eligible users as controls, or you could do a matched observational estimate using propensity scores. Diff in diff wins here because assignment is driven by rollout timing and you have clean pre periods, so parallel trends is testable and you avoid high variance and modeling dependence from propensity scoring. You still need to restrict to comparable app versions or include version fixed effects, otherwise you confound treatment with version driven monetization changes. Validate parallel trends on pre period revenue and engagement, and run placebo launches to catch seasonality or app store shocks.

Duolingo runs an email reactivation campaign that targets lapsed learners, and you need to attribute incremental 30 day subscription revenue to the campaign, but you only observe who was sent and who opened, not a randomized holdout. Propose an identification strategy, explain how you would compute revenue lift, and list the assumptions that could break it.

HardAttribution and Instrumental Variables

Practice more Causal Inference & Attribution (Revenue Impact) questions

Forecasting & Financial Modeling (Planning)

Rather than just naming time-series models, you’ll be pushed to build a planning-grade forecast that connects product inputs (pricing, trials, paywall exposure) to revenue outputs. The tricky part is communicating uncertainty, scenario planning, and what you’d monitor when reality diverges.

You need a 6 month plan forecast for Duolingo Super net revenue by month, using weekly cohorts of trials and conversions, churn, and FX. What is your minimal model, what inputs does Finance control, and what would you ship as outputs (point forecast plus uncertainty)?

EasyPlanning Forecast Design

Sample Answer

Reason through it: Start by forecasting paid subscriber counts, because revenue is just price times active paid users after platform fees and refunds. Build a cohort-based rollforward: new trials, trial-to-paid conversion, paid churn, and price or FX, then aggregate cohorts into monthly active paid and multiply by expected net ARPPU. Finance-controlled knobs are pricing, promo calendar, platform fee assumptions, and FX rates, product-controlled knobs are paywall exposure and trial volume, and you separate them explicitly in the model inputs. Ship a base, upside, downside scenario plus interval bands from uncertainty on conversion and churn (for example Beta-Binomial for rates, then Monte Carlo).

After a paywall redesign test, conversion lifts for Android look big but churn also increases, and Finance wants to use the experiment to update the annual revenue plan. How do you translate experiment deltas into a planning model without double-counting seasonality and without assuming the test effect is permanent?

MediumExperiment to Plan Translation

Sample Answer

Start with what the interviewer is really testing: "This question is checking whether you can turn a local causal estimate into a global forecast assumption without breaking accounting logic." You map the experiment to specific model parameters, for example $\Delta$ trial-to-paid and $\Delta$ hazard of churn, then apply those deltas only to the exposed traffic share and only for the rollout window you actually expect. Avoid double-counting seasonality by estimating deltas on seasonally adjusted baselines or by applying deltas to a baseline forecast that already includes seasonality. Handle non-permanence by decaying the effect over time, or by modeling it as an adoption curve with a reversion parameter and running scenarios.

You are asked to forecast monthly net revenue for subscriptions and IAP with uncertainty, but your paid user time series has a structural break from an App Store price tier change and a mid-quarter promo. What modeling strategy do you use, how do you quantify uncertainty, and what do you monitor weekly to know the plan is drifting?

HardStructural Breaks and Uncertainty

Practice more Forecasting & Financial Modeling (Planning) questions

Applied Machine Learning & Behavioral Modeling

The bar here isn’t whether you know algorithms, it’s whether you can pick and validate models that explain learner behavior and improve monetization decisions. Expect emphasis on feature/label design, leakage, calibration, interpretability, and turning predictions into actions.

You want to predict 30-day subscriber churn for Duolingo Super to drive targeted retention offers, using daily product events and billing tables. How do you define the label and observation window to avoid leakage, and what validation split would you use given weekly seasonality and product launches?

MediumBehavioral Modeling

Sample Answer

This question is checking whether you can prevent label leakage while keeping the problem aligned to an action that can be taken at prediction time. You need a clear cutoff time $t_0$, features computed only from events at or before $t_0$, and a churn label computed strictly after $t_0$ over a fixed horizon (for example, no active paid entitlement in $(t_0, t_0+30]$). Use time-based splits (rolling or forward-chaining) so training only sees the past, and ensure the split respects known shocks (launches) and periodicity (week-of-year). If you cannot state what data is available at scoring time, the model will look great offline and fail in production.

You trained a model that outputs $p(\text{convert to Super in 7 days})$ for free users, and product wants to target the top decile with a discount to maximize incremental revenue while limiting cannibalization. How do you calibrate the probabilities and choose a decision threshold that accounts for treatment effect heterogeneity, not just likelihood to convert?

HardUplift and Targeting

Practice more Applied Machine Learning & Behavioral Modeling questions

SQL & Analytics Queries

You’ll likely need to write production-realistic SQL to compute subscription funnel metrics, retention cohorts, and experiment readouts from event logs. Watch for pitfalls like deduping, window functions, timezone boundaries, and defining the correct unit of analysis.

Compute a daily subscription funnel for Duolingo, by app_local_date: unique users who started a paywall view, started a trial, and converted to paid within 7 days of trial start, deduping multiple events per user-day.

EasyWindow Functions

Sample Answer

The standard move is to aggregate by user and day with $COUNT(DISTINCT user_id)$ on each funnel step. But here, event deduping and the 7 day linkage matter because trial starts and purchases can happen on different days, and multiple paywall views per day will otherwise inflate the top of funnel.

/*
Daily subscription funnel by app-local date.
Assumptions:
- events table has one row per event.
- app_local_date is already computed in the app timezone.
- relevant event types: 'paywall_view', 'trial_start', 'purchase'.
- purchase events for subscription have product_type = 'subscription'.

BigQuery Standard SQL
*/
WITH paywall_user_day AS (
  SELECT
    app_local_date,
    user_id
  FROM `analytics.events`
  WHERE event_name = 'paywall_view'
  GROUP BY app_local_date, user_id
),
trial_starts AS (
  SELECT
    app_local_date AS trial_start_date,
    user_id,
    MIN(event_ts) AS trial_start_ts
  FROM `analytics.events`
  WHERE event_name = 'trial_start'
  GROUP BY trial_start_date, user_id
),
purchases AS (
  SELECT
    user_id,
    event_ts AS purchase_ts
  FROM `analytics.events`
  WHERE event_name = 'purchase'
    AND product_type = 'subscription'
),
trial_conversions_7d AS (
  SELECT
    t.trial_start_date,
    t.user_id
  FROM trial_starts t
  JOIN purchases p
    ON p.user_id = t.user_id
   AND p.purchase_ts >= t.trial_start_ts
   AND p.purchase_ts < TIMESTAMP_ADD(t.trial_start_ts, INTERVAL 7 DAY)
  QUALIFY ROW_NUMBER() OVER (
    PARTITION BY t.user_id, t.trial_start_date
    ORDER BY p.purchase_ts
  ) = 1
)
SELECT
  d.app_local_date,
  COUNT(DISTINCT d.user_id) AS paywall_view_users,
  COUNT(DISTINCT t.user_id) AS trial_start_users,
  COUNT(DISTINCT c.user_id) AS paid_within_7d_users
FROM paywall_user_day d
LEFT JOIN trial_starts t
  ON t.user_id = d.user_id
 AND t.trial_start_date = d.app_local_date
LEFT JOIN trial_conversions_7d c
  ON c.user_id = t.user_id
 AND c.trial_start_date = t.trial_start_date
GROUP BY d.app_local_date
ORDER BY d.app_local_date;

You are reading out an A/B test on a new annual plan paywall, compute D7 retention for each variant where retention means any learning session on day 7 after the user first saw the paywall (use app-local dates and keep the unit of analysis at user).

MediumExperiment Readout Queries

Sample Answer

Get this wrong in production and you will ship a paywall that looks like it helps retention, when you actually just changed who got counted on day boundaries or double counted users with multiple exposures. The right call is to anchor each user to their first exposure date, define day 7 using app-local dates, then compute a per-user retained flag and aggregate by variant.

/*
D7 retention by experiment variant.
Definitions:
- Anchor date is the user's first paywall_view while in the experiment.
- D7 retained = at least one 'learning_session' on app_local_date = anchor_date + 7.
- Unit of analysis is user.

Assumptions:
- exposures table: one row per exposure with (user_id, experiment_id, variant, exposure_ts, exposure_app_local_date).
- events table: one row per event with (user_id, event_name, app_local_date).

BigQuery Standard SQL
*/
WITH first_exposure AS (
  SELECT
    user_id,
    variant,
    MIN(exposure_ts) AS first_exposure_ts,
    ANY_VALUE(exposure_app_local_date) KEEP (DENSE_RANK FIRST ORDER BY exposure_ts) AS anchor_date
  FROM `experimentation.exposures`
  WHERE experiment_id = 'annual_paywall_v1'
  GROUP BY user_id, variant
),
d7_activity AS (
  SELECT
    e.user_id,
    1 AS retained_d7
  FROM `analytics.events` e
  JOIN first_exposure x
    ON x.user_id = e.user_id
   AND e.app_local_date = DATE_ADD(x.anchor_date, INTERVAL 7 DAY)
  WHERE e.event_name = 'learning_session'
  GROUP BY e.user_id
),
user_level AS (
  SELECT
    x.user_id,
    x.variant,
    IFNULL(a.retained_d7, 0) AS retained_d7
  FROM first_exposure x
  LEFT JOIN d7_activity a
    ON a.user_id = x.user_id
)
SELECT
  variant,
  COUNT(*) AS users_exposed,
  AVG(retained_d7) AS d7_retention_rate
FROM user_level
GROUP BY variant
ORDER BY variant;

Compute monthly gross subscription revenue and churned MRR by cohort month, where churned MRR is revenue lost from users who had paid revenue last month but have $0$ paid revenue this month, excluding refunds and handling users with multiple subscriptions.

HardCohorts and Revenue Attribution

Practice more SQL & Analytics Queries questions

The distribution skews toward questions where you're reasoning about Super Duolingo conversion funnels, streak retention tradeoffs, or paywall pricing, then immediately asked to prove the causal revenue lift of whatever you proposed. That compounding of product sense into causal inference is where most candidates break, because a single interview question can start with "define the right metric for a 7-day trial" and end with "the rollout was by app version, not random assignment, so justify your estimator." The prep mistake that'll cost you: drilling SQL sessionization and ML algorithms while underinvesting in the ability to connect a freemium paywall design decision to an incremental revenue estimate using diff-in-diff or CUPED on Duolingo's overlapping experiment stack.

Practice questions modeled on Duolingo's subscription monetization and experiment design focus at datainterview.com/questions.

How to Prepare for Duolingo Data Scientist Interviews

Know the Business

Updated Q1 2026

Official mission

“Our mission is to develop the best education in the world and make it universally available.”

What it actually means

Duolingo's real mission is to provide the highest quality education globally through technology, making it universally accessible. They achieve this by continuously improving their product, prioritizing long-term user growth, and leveraging a freemium business model to fund innovation.

Pittsburgh, PennsylvaniaHybrid - 3 days/week

Key Business Metrics

Revenue

$964M

+41% YoY

Market Cap

$5B

-74% YoY

Employees

830

+15% YoY

Current Strategic Priorities

Develop the best education in the world and make it universally available
Evolve from a language learning app into a broader educational platform
Bridge the gap between online learning and real-world impact

Competitive Moat

Scale advantageAI-driven personalizationFreemium business modelGamified language learningNetwork effects

Duolingo's company strategy frames the ambition plainly: evolve from a language learning app into a broader education platform while keeping the freemium model that fuels growth. Revenue reached $964M (41% year-over-year growth), yet the company still operates with roughly 830 employees, a ratio that tells you something about how much ownership each person carries.

Their operating principles emphasize directness and speed, two values that shape what interviewers actually want to hear. Read both posts before you prep anything else, then skim the investor relations page for the latest subscriber and DAU numbers.

Most candidates blow the "why Duolingo" question by defaulting to "I believe in accessible education." That's table stakes. A stronger answer shows you understand the specific tension the business lives inside: growing subscription revenue without degrading the free tier that drives top-of-funnel user acquisition. If you can connect your DS skills to that tradeoff (experiment design for conversion flows, causal measurement of monetization changes on retention), you're speaking the company's language instead of reciting its mission statement.

Spend at least two weeks using the app daily before your interview. Notice when you hit a paywall, how streak recovery works, and where upsell prompts appear. That kind of product fluency turns a rehearsed answer into a credible one.

Try a Real Interview Question

Incremental revenue lift from a paywall A/B test

sql

You are given user-level A/B assignment and subscription events with revenue. For each variant, compute $n_{users}$, $n_{converters}$ (at least one subscription within $7$ days after assignment), $conversion\_rate$, $arppu$ (average revenue per assigned user within $7$ days), and $incremental\_revenue$ defined as $$\left(arppu_{treatment} - arppu_{control}\right) \times n_{users,treatment}.$$ Return one row per variant plus a final row for the incremental revenue value.

| experiment_assignments | |
|------------------------|----------------------------------------------------------------|
| user_id | assignment_ts        | experiment_id | variant   |
|--------|-----------------------|---------------|-----------|
| 101    | 2025-01-01 10:00:00   | paywall_v1    | control   |
| 102    | 2025-01-01 11:00:00   | paywall_v1    | treatment |
| 103    | 2025-01-02 09:00:00   | paywall_v1    | control   |
| 104    | 2025-01-02 12:00:00   | paywall_v1    | treatment |

| subscriptions | |
|--------------|---------------------------------------------------------------|
| user_id | event_ts             | revenue_usd | product |
|--------|-----------------------|------------:|---------|
| 101    | 2025-01-03 08:00:00   | 9.99        | monthly |
| 102    | 2025-01-10 10:00:00   | 9.99        | monthly |
| 104    | 2025-01-05 12:30:00   | 59.99       | annual  |
| 104    | 2025-01-06 12:30:00   | 9.99        | monthly |

-- Write a SQL query that produces the requested metrics for experiment_id = 'paywall_v1'.

WITH base AS (
  SELECT
    a.user_id,
    a.variant,
    a.assignment_ts
  FROM experiment_assignments a
  WHERE a.experiment_id = 'paywall_v1'
),
rev_7d AS (
  SELECT
    b.user_id,
    b.variant,
    SUM(CASE
      WHEN s.event_ts >= b.assignment_ts
       AND s.event_ts < b.assignment_ts + INTERVAL '7 day'
      THEN CAST(s.revenue_usd AS DECIMAL(18,2))
      ELSE 0
    END) AS revenue_7d
  FROM base b
  LEFT JOIN subscriptions s
    ON s.user_id = b.user_id
  GROUP BY 1,2
),
metrics AS (
  SELECT
    variant,
    COUNT(*) AS n_users,
    SUM(CASE WHEN revenue_7d > 0 THEN 1 ELSE 0 END) AS n_converters,
    AVG(CASE WHEN revenue_7d > 0 THEN 1.0 ELSE 0.0 END) AS conversion_rate,
    AVG(revenue_7d) AS arppu
  FROM rev_7d
  GROUP BY 1
),
vals AS (
  SELECT
    MAX(CASE WHEN variant = 'control' THEN arppu END) AS arppu_control,
    MAX(CASE WHEN variant = 'treatment' THEN arppu END) AS arppu_treatment,
    MAX(CASE WHEN variant = 'treatment' THEN n_users END) AS n_users_treatment
  FROM metrics
)
SELECT
  variant,
  n_users,
  n_converters,
  conversion_rate,
  arppu,
  CAST(NULL AS DECIMAL(18,2)) AS incremental_revenue
FROM metrics

UNION ALL

SELECT
  'incremental' AS variant,
  CAST(NULL AS BIGINT) AS n_users,
  CAST(NULL AS BIGINT) AS n_converters,
  CAST(NULL AS DOUBLE PRECISION) AS conversion_rate,
  CAST(NULL AS DECIMAL(18,2)) AS arppu,
  (arppu_treatment - arppu_control) * n_users_treatment AS incremental_revenue
FROM vals;

700+ ML coding problems with a live Python executor.

Practice in the Engine

Duolingo's engineering blog makes clear the org values clean, efficient code, and from what candidates report, the DS coding round reflects that bar. Practice both algorithm problems and complex funnel/sessionization queries at datainterview.com/coding.

Test Your Readiness

How Ready Are You for Duolingo Data Scientist?

1 / 10

Product Sense & Metrics

Can I define a clear north-star metric for subscriptions (for example, net revenue retention or payer conversion) and explain the supporting metrics and tradeoffs (trial start rate, trial-to-paid, churn, refund rate, ARPPU, LTV)?

The questions you'll face skew toward Duolingo's actual business decisions, so product context matters more than memorized formulas. Sharpen that context with Duolingo-specific practice at datainterview.com/questions.

Frequently Asked Questions

How long does the Duolingo Data Scientist interview process take?

From first recruiter call to offer, expect roughly 4 to 6 weeks. The process typically includes a recruiter screen, a technical phone screen focused on SQL and Python, a take-home or live coding exercise, and then a full onsite (often virtual). Scheduling the onsite can add a week or two depending on interviewer availability. I've seen it move faster for senior candidates Duolingo is actively pursuing.

What technical skills are tested in the Duolingo Data Scientist interview?

SQL and Python are non-negotiable. Beyond that, you'll be tested on experiment design and A/B testing, causal inference, behavioral modeling, and applied machine learning. Duolingo cares a lot about metrics development too, so expect questions about how you'd define and track success for product features. If you're interviewing at a senior level, be ready to talk about end-to-end data solution development and attribution modeling.

How should I tailor my resume for a Duolingo Data Scientist role?

Lead with impact, not tools. Duolingo wants to see that you've designed experiments, built models that shipped, and defined metrics that actually changed decisions. Mention A/B testing experience explicitly. If you've worked on anything related to behavioral modeling, user engagement, or education tech, put that front and center. Keep it to one page, quantify your results with real numbers, and make sure Python and SQL are clearly listed.

What is the total compensation for a Duolingo Data Scientist?

Duolingo pays competitively, especially given their Pittsburgh HQ where cost of living is lower than the Bay Area. For a mid-level Data Scientist, total comp (base plus equity plus bonus) typically falls in the $160K to $220K range. Senior Data Scientists and those with management responsibilities can see total comp push toward $250K to $350K or higher. Equity is a meaningful part of the package since Duolingo is publicly traded (DUOL). Exact numbers depend on level, experience, and negotiation.

How do I prepare for the behavioral interview at Duolingo?

Duolingo's values are very specific, so study them. 'Test it first,' 'Ship it,' and 'Learners first' come up constantly in how they evaluate culture fit. Prepare stories that show you prioritize ruthlessly, embrace ambiguity, and make decisions backed by data rather than opinion. They also value candor, so have an example ready where you gave or received tough feedback constructively. Don't be generic here. Tie every answer back to how it would play out at a mission-driven company focused on accessible education.

How hard are the SQL and coding questions in the Duolingo Data Scientist interview?

The SQL questions are medium to hard. Expect window functions, CTEs, self-joins, and questions that require you to think about user engagement data (think daily active users, streaks, retention cohorts). Python questions lean toward data manipulation with pandas and writing clean, readable code rather than pure algorithm puzzles. You should be comfortable writing queries and scripts under time pressure. Practice with realistic product analytics problems at datainterview.com/coding to get the right feel.

What machine learning and statistics concepts should I know for Duolingo's Data Scientist interview?

Causal inference is a big one. Know the difference between correlation and causation, and be ready to discuss methods like difference-in-differences, propensity score matching, or instrumental variables. A/B testing fundamentals are essential: power analysis, significance testing, common pitfalls like peeking. Applied ML topics include classification, regression, and behavioral modeling. You don't need to derive backpropagation, but you should be able to explain when and why you'd choose one model over another for a real product problem.

What format should I use to answer behavioral questions at Duolingo?

Use a STAR-like structure but keep it tight. Situation (two sentences max), Task (what was your specific role), Action (what you actually did, not your team), Result (quantified if possible). Duolingo interviewers appreciate concise answers. I've seen candidates ramble for five minutes and lose the room. Aim for 90 seconds to two minutes per answer, then let the interviewer ask follow-ups. Being candid and kind is literally one of their values, so don't oversell or dodge the hard parts of your stories.

What happens during the Duolingo Data Scientist onsite interview?

The onsite is typically four to five rounds spread across a day. Expect a SQL/coding round, a product/metrics case study, a machine learning or statistics deep-dive, and one or two behavioral rounds. The product case often involves Duolingo-specific scenarios like optimizing lesson completion or measuring the impact of a new feature on learner retention. There's usually a presentation or take-home review component where you walk through your analysis. Come prepared to explain your reasoning clearly, not just your results.

What metrics and business concepts should I know before interviewing at Duolingo?

Understand Duolingo's freemium model inside and out. Know how they make money (Super Duolingo subscriptions, ads, Duolingo English Test). Be ready to discuss engagement metrics like DAU/MAU ratio, streak retention, lesson completion rates, and conversion from free to paid. Attribution modeling comes up too, so think about how you'd measure the impact of push notifications or gamification features on long-term retention. Duolingo hit $1B in revenue, so understanding their growth levers will set you apart in case study rounds.

What common mistakes do candidates make in the Duolingo Data Scientist interview?

The biggest one is treating it like a generic tech interview. Duolingo is deeply mission-driven, and candidates who don't connect their answers to education or learner outcomes fall flat. Another common mistake is jumping to complex ML solutions when a simple A/B test or descriptive analysis would answer the question. Duolingo values 'Reduce complexity,' so show that instinct. Finally, don't skip the 'why' behind your technical choices. They care about your reasoning as much as your answer.

How can I practice for the Duolingo Data Scientist technical rounds?

Start with SQL and Python problems that mirror product analytics scenarios. Think user-level engagement data, cohort analysis, and experiment evaluation. datainterview.com/questions has problems designed specifically for data science interviews at product companies like Duolingo. Beyond coding, practice explaining your approach out loud. Duolingo interviewers want to hear your thought process in real time. I'd also recommend running through two or three mock case studies where you define metrics for a hypothetical feature, design an experiment, and interpret results.

Duolingo Data Scientist Interview Guide

Duolingo Data Scientist Role

A Typical Week

A Week in the Life of a Duolingo Data Scientist

Weekly time split

Culture notes

Projects & Impact Areas

Skills & What's Expected

Levels & Career Growth

Work Culture

Duolingo Data Scientist Compensation

Duolingo Data Scientist Interview Process

Initial Screen

Recruiter Screen

Technical Assessment

Coding & Algorithms

Machine Learning & Modeling

Onsite

Product Sense & Metrics

Case Study

Behavioral

Tips to Stand Out

Common Reasons Candidates Don't Pass

Duolingo Data Scientist Interview Questions

Product Sense & Metrics (Monetization/Subscriptions)

Experiment Design & A/B Testing

Causal Inference & Attribution (Revenue Impact)

Forecasting & Financial Modeling (Planning)

Applied Machine Learning & Behavioral Modeling

SQL & Analytics Queries

How to Prepare for Duolingo Data Scientist Interviews

Try a Real Interview Question

Incremental revenue lift from a paywall A/B test

Test Your Readiness

Frequently Asked Questions

Dan Lee

Related Articles

Meta AI Researcher Interview Guide

Mistral Machine Learning Engineer Interview Guide

TikTok Machine Learning Engineer Interview Guide