Top 29 Product Sense Interview Questions (2026)

Product Sense questions dominate final round interviews at Meta, Google, Airbnb, Uber, Spotify, and Netflix because they reveal whether you can think like a product owner, not just analyze data. These companies need analysts and scientists who understand business context, can design meaningful experiments, and translate metrics movements into actionable insights. If you nail Product Sense, you signal that you're ready to own metrics and drive product decisions from day one.

The challenge isn't memorizing frameworks, it's demonstrating business intuition under pressure. Consider this: Spotify's daily active users spike 15% during a major outage recovery, but session duration drops 20%. Most candidates rush to blame the algorithm or suggest A/B testing without first asking whether users are checking if the service works, then leaving. Strong candidates pause, think about user behavior, and propose multiple hypotheses before jumping to solutions.

Here are the top 29 Product Sense questions organized by the core skills that separate great candidates from average ones.

Intermediate29 questions

Product Sense Interview Questions

Top Product Sense interview questions covering the key areas tested at leading tech companies. Practice with real questions and detailed solutions.

Data AnalystData Scientist Meta

Metrics and Goal Setting

Interviewers use metrics questions to test whether you understand the difference between vanity metrics and business drivers. Too many candidates pick engagement metrics without considering monetization, or choose revenue metrics that ignore user experience trade-offs.

The key insight most candidates miss: your north star metric choice reveals your mental model of how the business works. When Meta asks about hiding like counts, weak answers focus on user satisfaction without connecting to creator retention or ad revenue impact.

Metrics and Goal Setting

Start by defining what success means: you translate a vague product prompt into a clear goal, a north star, and guardrail metrics. You struggle here when you pick metrics that are easy to compute but not decision-making.

Meta is considering a feature that lets users hide like counts on their posts. What is your north star metric for success, and what 2 guardrail metrics do you add to avoid optimizing for the wrong thing?

MetaMediumMetrics and Goal Setting

Sample Answer

Most candidates default to easy activity metrics like number of hides toggled or time spent, but that fails here because it does not tell you whether the feature improves the social experience. Your north star should capture the intended outcome, for example the share of active users who create or share content in a meaningful window, with a clear definition like 7 day creators per DAU. Add guardrails for platform health, for example negative feedback rate (hides, reports, unfollows per impression) and retention (D7 or D28), so you do not increase posting while harming long term engagement. Make sure every metric is attributable to feed experiences where like counts are visible versus hidden, otherwise you will chase noise.

Google Search wants to launch an AI generated answer panel for informational queries. Define success with one primary metric, then list 3 supporting metrics that help you decide whether to expand rollout.

GoogleHardMetrics and Goal Setting

Sample Answer

Primary metric: incremental successful search sessions per exposed user. Justify it by tying to user value, a session is successful when it ends without quick reformulation and with strong downstream satisfaction signals. Supporting metrics: reformulation rate within $t$ minutes, long click rate or dwell time on clicked results, and query abandonment rate, each segmented by query class and confidence bucket. Add a safety check by tracking help center complaints or explicit bad answer reports, but keep it separate from the decision metric so you do not overreact to low volume anecdotes.

Uber is testing an in app tip suggestion redesign for drivers. What goal would you set, and how would you choose between tips per trip and driver earnings as the north star?

UberMediumMetrics and Goal Setting

Sample Answer

You could optimize tips per trip, or you could optimize total driver earnings per online hour. Tips per trip wins only if the company goal is specifically gratuity capture and you can prove it does not change rider behavior, otherwise it is too narrow and can backfire via cancellations or lower ratings. Earnings per online hour wins here because it aligns to driver value and incorporates tradeoffs across fares, tips, and supply. Use guardrails like rider rating, cancellation rate, and trip completion to ensure you are not extracting tips at the expense of marketplace health.

Spotify is rolling out a new personalized Home feed layout. You see DAU up 1.5% and time spent up 4%, but next day retention is flat. What metrics would you set as your success criteria, and how do you interpret this readout?

SpotifyHardMetrics and Goal Setting

Sample Answer

Step 1, restate the product goal: better discovery that increases sustained listening, not just scrolling. Step 2, pick a north star like listening minutes of successful sessions per user, where a successful session includes at least one intentional play and low skip rate. Step 3, add guardrails, for example skip rate, saves or likes per session, and long term retention like D7, because time spent can be inflated by browsing. Interpreting the readout, the layout may be increasing browsing friction or novelty clicks without improving content satisfaction, so you dig into session composition, skips, and saves by cohort and by module exposure.

Airbnb wants to reduce guest support contacts after booking by adding clearer cancellation and check in info. What is your north star metric, and what 2 guardrails ensure you do not hurt bookings or host experience?

AirbnbMediumMetrics and Goal Setting

LinkedIn is adding an auto suggested comment feature to increase conversation on posts. Define a goal, a north star, and 3 guardrails that prevent low quality engagement and creator churn.

LinkedInMediumMetrics and Goal Setting

Practice more Metrics and Goal Setting questions

Experiment Design and Causal Thinking

Experiment design questions expose whether you can think causally or just correlate patterns. Candidates often design clean experiments on paper but fail to account for network effects, novelty bias, or measurement challenges that make real-world testing messy.

The fatal mistake: assuming you can randomize users cleanly when products have social connections, marketplace dynamics, or shared infrastructure. At Uber, testing driver incentives affects both sides of the market, but most candidates design single-sided experiments that miss crucial spillover effects.

Experiment Design and Causal Thinking

In this section, you show you can turn a product idea into a testable hypothesis, pick an evaluation method, and avoid common validity traps. Candidates often miss confounders, misuse significance, or forget how the experiment could change user behavior.

Meta is considering showing a new Reels ranking model to increase watch time. Design an online experiment, pick primary and guardrail metrics, and explain how you will avoid interference from social connections.

MetaHardExperiment Design and Causal Thinking

Sample Answer

Run a cluster randomized A/B test at the user level with network-aware clustering, optimize for incremental watch time while guarding against negative feedback and creator churn. Your primary metric is total watch time per user, with guardrails like session starts, hides, unfollows, and long term retention. Interference happens when treatment affects what your friends see or create, so you either cluster by social graph communities or limit treatment to ranking at consumption only and measure spillovers explicitly. You also pre-register the decision rule and run an A/A to validate instrumentation and variance assumptions.

Uber wants to reduce rider cancellations by showing an upfront cancellation fee warning before request. How would you test it, and how would you separate behavior change from selection effects due to riders deciding not to request at all?

UberMediumExperiment Design and Causal Thinking

Sample Answer

You could randomize at the rider level and measure cancellations per request, or randomize at the session level and measure cancellations per active session including those who never request. Rider-level per-request metrics can look great while hiding selection, because the warning can suppress marginal requests and change who enters the denominator. Session-level wins here because it captures the full funnel and lets you quantify tradeoffs: fewer cancellations versus fewer requests and completed trips. You still keep per-request cancellation as a secondary metric to diagnose whether the effect is prevention or just demand suppression.

Netflix ships a new autoplay preview feature on the home screen. Design an experiment and explain how you would handle novelty effects and multiple outcomes like play starts, watch time, and satisfaction.

NetflixMediumExperiment Design and Causal Thinking

Sample Answer

First, you define the hypothesis: autoplay previews increase content discovery, so play starts and long watch time should rise without hurting satisfaction. Then you randomize at the profile level and track a primary metric like weekly hours watched per profile, plus guardrails like % plays abandoned in the first 2 minutes and thumbs-down rate. To handle novelty, you look at the time path of effects by week, and you plan a longer holdout or a ramp with re-randomization checks to see if gains decay. For multiple outcomes, you pre-specify one primary, correct for multiple comparisons on secondary metrics, for example using Benjamini Hochberg on $p$-values, and interpret consistent directional changes rather than cherry-picking.

Google Search changes the layout to add a larger 'People also ask' module. How would you design a causal evaluation that accounts for position bias, measurement bias in clicks, and long-term user trust?

GoogleHardExperiment Design and Causal Thinking

Sample Answer

This question is checking whether you can pick metrics that reflect user value, not just clicks, and whether you can anticipate validity threats when the UI changes what gets clicked. You run an A/B test with user-level randomization, but you do not treat raw CTR as the primary because layout changes induce position bias and can inflate clicks without improving satisfaction. You prioritize outcomes like successful search rate, query reformulation, and long-click or dwell time, with guardrails on latency and abandonment, and you validate instrumentation because the module can change what counts as an impression. For long-term trust, you add a holdout or long-term tracking for return rate and complaint signals, and you check heterogeneous effects on sensitive query classes.

Airbnb wants to add an AI-generated 'Neighborhood summary' on listing pages. Propose an experiment, identify key confounders, and explain what you would do if hosts change their listing descriptions in response during the test.

AirbnbHardExperiment Design and Causal Thinking

Spotify considers sending a push notification that recommends a new playlist when you finish a workout. Design an experiment and explain how you would avoid biased estimates from notification opt-in, time-of-day effects, and repeated exposures.

SpotifyEasyExperiment Design and Causal Thinking

Practice more Experiment Design and Causal Thinking questions

Diagnosing Metric Movements

Metric movement questions test your debugging instincts and business intuition simultaneously. Weak candidates either panic and list every possible cause, or confidently blame one factor without gathering evidence first.

Smart candidates follow a systematic triage approach: first rule out measurement issues and external factors, then form testable hypotheses ranked by likelihood and business impact. When Spotify sees listening minutes drop while DAUs spike, your first question should be about session measurement, not algorithm changes.

Diagnosing Metric Movements

When a core metric spikes or drops, you need a structured debugging plan that narrows the problem quickly. Many candidates jump to a pet theory instead of slicing by funnel stage, segment, platform, and time.

At Uber, completed trips per active rider dropped 6% week over week in one major city, while app opens and ride requests stayed flat. How do you diagnose whether this is a supply issue, a pricing issue, or a measurement issue?

UberMediumDiagnosing Metric Movements

Sample Answer

You could start by brainstorming causes from intuition, or you could decompose the metric into a funnel and validate each step with cuts. The funnel approach wins here because the inputs are partially flat, so you need to locate the exact conversion break: request to match, match to pickup, pickup to completion. Slice by time of day, rider segment, and pickup zone to see if the drop concentrates where supply constraints show up. Then check related guardrails like surge prevalence, ETA, cancellation rate, and driver online hours to separate supply and pricing from tracking.

At Spotify, daily active users are up 10% day over day, but total listening minutes are down 8%. Walk me through how you would debug this without assuming it is an algorithm change.

SpotifyHardDiagnosing Metric Movements

Sample Answer

First I would sanity check definitions, DAU is typically any open, minutes require a play event, so a tracking or logging issue could create divergence. Next I would decompose minutes as $\text{minutes} = \text{DAU} \times \text{sessions per user} \times \text{minutes per session}$ and see which component moved. Then I would segment DAU growth by acquisition channel, country, platform, and new vs returning, a spike in low intent users from a notification or marketing push can raise DAU while lowering per user engagement. Finally I would check playback error rate, buffering, and app crashes, because a client bug can inflate opens while suppressing plays and minutes.

At Meta, ad revenue is flat, but CPM is up 12% and impressions are down 10% over the same period. What is your diagnostic plan to find the driver and assess whether this is good or bad?

MetaMediumDiagnosing Metric Movements

Sample Answer

This question is checking whether you can translate a metric movement into its components and then isolate which lever moved. Start from $$\text{Revenue} = \text{Impressions} \times \text{CPM}/1000$$ and confirm the arithmetic, flat revenue with those moves is plausible, so the real work is causal attribution. You should slice by placement, geo, device, and advertiser vertical to see whether impressions fell due to supply, policy, or ranking, and whether CPM rose due to auction pressure or pacing. Then evaluate quality by checking auction metrics like bid density, fill rate, and user outcomes like session time, because a CPM lift that comes from reduced inventory can be positive or harmful depending on engagement and long term demand.

At DoorDash, checkout conversion dropped 3 percentage points on iOS only, starting around 7 pm local time, but traffic and cart adds are unchanged. How would you debug, and what would you do in the first hour?

DoorDashEasyDiagnosing Metric Movements

Sample Answer

The standard move is to localize the break in the funnel and the blast radius, then correlate it with recent changes and operational signals. But here, the time and platform specificity matters because it screams client release, payments integration, or a backend dependency that iOS hits differently. In the first hour you should verify event instrumentation for checkout steps, check payment authorization and error codes, and compare iOS app versions to see if the drop aligns with a rollout. If errors spike at 7 pm, you escalate to engineering, consider a rollback or feature flag disable, and monitor recovery using a narrow iOS checkout success metric.

At Netflix, new user free trial starts are flat, but trial to paid conversion fell 15% week over week in Canada. What analyses would you run to diagnose the root cause and recommend next steps?

NetflixMediumDiagnosing Metric Movements

At Google Search, overall click through rate is down 2% day over day, but only on Android, and only for queries with local intent. How would you separate ranking changes from UI changes from logging issues, and what data would you need?

GoogleHardDiagnosing Metric Movements

Practice more Diagnosing Metric Movements questions

Feature Impact and Launch Evaluation

Launch evaluation questions reveal whether you can balance competing metrics and think beyond immediate outcomes. Most candidates focus on primary success metrics but ignore downstream effects that show up weeks later.

The critical insight: successful launches require measuring leading indicators (immediate user behavior), lagging indicators (retention and satisfaction), and guardrail metrics (preventing negative side effects). DoorDash's utensils toggle might boost basket size today but create delivery confusion next month.

Feature Impact and Launch Evaluation

You will be asked to evaluate a new feature or ranking change, including what to measure pre-launch, at launch, and post-launch. A common failure mode is focusing on one metric and missing trade-offs like satisfaction, latency, or downstream effects.

Meta is testing a change to Instagram Reels ranking that increases watch time but may show more repetitive content. What would you measure pre-launch, at launch, and post-launch to decide whether to roll it out?

MetaHardFeature Impact and Launch Evaluation

Sample Answer

Reason through it: First, define success as a balanced scorecard, not just watch time: primary engagement (watch time per session, completion rate), satisfaction (hide, not interested, survey, negative feedback rate), and ecosystem health (creator diversity, new creator reach, content repetition index). Pre-launch, validate offline ranking metrics and guardrail simulations, then run a small canary to check latency, crash rate, and distribution shifts. At launch, monitor treatment vs control deltas with guardrails like blocks, reports, and session abandonment, plus infra metrics like p95 feed load time. Post-launch, watch for lagging effects like creator churn, long-run retention $R_{7}, R_{28}$, and concentration metrics like Gini of impressions that can drift over weeks.

Google Search is launching a new snippet format that increases click-through rate but slightly increases page load time due to heavier rendering. How do you evaluate whether the launch is worth it, and what guardrails do you set?

GoogleMediumFeature Impact and Launch Evaluation

Sample Answer

This question is checking whether you can trade off a win metric against user experience and long term outcomes, without getting fooled by short term CTR. You should pick a primary objective like successful search sessions, then pair it with guardrails: p50 and p95 latency, pogo-sticking rate, dwell time, query reformulation, and long-click rate. At launch, segment by device and network, since the load time impact is not uniform and can flip the average treatment effect. Decide with an explicit constraint, for example only ship if $\Delta CTR > 0$ and $\Delta p95\_LCP \le 50\text{ms}$ while satisfaction proxies do not degrade.

DoorDash is adding an in-checkout 'Add utensils and condiments' toggle. It might increase basket size but could slow checkout and increase order issues. What metrics do you use, and how do you tell if it is a net positive?

DoorDashEasyFeature Impact and Launch Evaluation

Sample Answer

The standard move is to optimize for incremental gross order value and conversion, with checkout completion rate as the immediate win metric. But here, friction matters because a small increase in checkout time can cause drop-offs and support costs. You should track funnel metrics (checkout start to submit, time in checkout), incremental attach rate for paid add-ons, and quality guardrails like missing item reports, refunds, and customer support contact rate. Call it a win only if you see positive $\Delta$ revenue per checkout start without a statistically meaningful rise in cancellation, defects, or support per order.

Airbnb is rolling out a new 'Instant Book eligible' badge intended to boost bookings. How would you design launch evaluation to catch marketplace trade-offs, including host behavior changes?

AirbnbHardFeature Impact and Launch Evaluation

Sample Answer

Get this wrong in production and you can spike bookings short term while driving host churn, lower acceptance, and worse guest experience, which is costly and slow to recover. The right call is to measure both sides of the marketplace: guest conversion and booking rate, plus host acceptance, cancellation, response time, and listing availability. You also need quality metrics like rebooking rate, complaint rate, and CS contacts, since Instant Book can increase mismatched expectations. Post-launch, watch for strategic host behavior like toggling eligibility or tightening rules, so segment by host type and track supply changes over multiple weeks.

LinkedIn is testing 'Suggested replies' in messaging to increase reply rate. How would you evaluate impact while detecting harm to conversation quality and user trust?

LinkedInMediumFeature Impact and Launch Evaluation

Netflix changes its homepage ranking to surface more new releases, and early results show higher play starts but lower completion. What is your launch decision framework, and how do you diagnose what is happening?

NetflixHardFeature Impact and Launch Evaluation

Practice more Feature Impact and Launch Evaluation questions

Product Strategy, Trade-offs, and Growth Loops

Strategy and trade-off questions separate candidates who think tactically from those who understand sustainable growth loops. These questions have no single right answer, but strong candidates demonstrate clear reasoning about resource allocation and long-term impact.

The winning approach: acknowledge the trade-off explicitly, define success criteria for each option, then choose based on which creates stronger network effects or defensible advantages. When Airbnb asks about conversion versus retention investment, your framework matters more than your final choice.

Product Strategy, Trade-offs, and Growth Loops

Finally, you need to reason like a product partner: choose where to invest, quantify trade-offs, and explain how you would drive sustainable growth. Candidates struggle when they cannot connect an initiative to a mechanism, such as retention loops, network effects, or marketplace balance.

Meta is seeing a 3% drop in 7-day retention for Reels among new users, but watch time per retained user is up. You can invest one quarter in either improving creator supply (more fresh content) or improving personalization quality (better ranking), which do you pick and how do you quantify the trade-off?

MetaHardProduct Strategy, Trade-offs, and Growth Loops

Sample Answer

This question is checking whether you can connect an initiative to a growth loop and quantify second-order effects, not just pick a side. You should model impact on $LTV$ via $$LTV \approx ARPDAU \times \sum_{t=1}^{T} P(\text{active at } t)$$ and treat supply and ranking as levers on different terms, supply shifts the content frontier, ranking shifts match efficiency. If retention is dropping for new users, bias toward personalization if you can show it improves early session satisfaction, reduces cold-start failures, and compounds via more interactions that further train the model. You still sanity-check supply with leading indicators, like new creator posts per DAU and content freshness, to ensure you are not ranking a thin catalog.

At Airbnb, search conversion is flat, but repeat bookings are down. You can either reduce customer support response time by 30% or increase listing quality by enforcing stricter photo and amenity standards, which investment is more likely to create a sustainable retention loop and why?

AirbnbMediumProduct Strategy, Trade-offs, and Growth Loops

Sample Answer

The standard move is to improve the funnel metric closest to revenue, like search conversion. But here, repeat bookings are down, so you should prioritize the lever that reduces bad stays and increases trust, which is listing quality, because it improves the core experience that drives habit. Support speed helps when things go wrong, but it does not prevent the failure and can even mask quality issues, weakening incentives for hosts to improve. You would validate by measuring changes in rebooking rate, complaint rate per stay, and refund rate, then tie it to a loop: better stays lead to better reviews, which attract better guests and hosts, which raises marketplace quality.

DoorDash wants to lower delivery ETAs by 2 minutes in a dense city. You can do it by increasing dasher incentives (more supply) or by batching more orders per trip (higher utilization), how do you decide and what guardrails do you set?

DoorDashMediumProduct Strategy, Trade-offs, and Growth Loops

Sample Answer

Get this wrong in production and you either burn margin with incentives or degrade food quality and churn by over-batching. The right call is to quantify the cost per minute of ETA improvement for each lever and pick the one that improves the marketplace flywheel, not just the metric. Incentives can be a short-term stabilizer during peak imbalance, but batching is only safe if it does not raise lateness variance and item defect rates, so you set guardrails like p90 lateness, cancellation rate, and reorder rate within 14 days. You also segment by time of day and zone, because the optimal lever depends on supply elasticity and restaurant prep time bottlenecks.

LinkedIn notices that new users who add 5+ connections in the first week have 2x 30-day retention, but nudges to connect feel spammy. Propose a growth loop that increases early connections without harming long-term trust, and specify the key experiment and success metrics.

LinkedInHardProduct Strategy, Trade-offs, and Growth Loops

Spotify is debating between investing in better podcast recommendations or launching a referral program for premium. Both can lift growth, but with different loop dynamics, which would you prioritize for sustainable growth over 12 months, and what data would you request before deciding?

SpotifyEasyProduct Strategy, Trade-offs, and Growth Loops

Practice more Product Strategy, Trade-offs, and Growth Loops questions

How to Prepare for Product Sense Interviews

Practice metric decomposition out loud

Break complex metrics like 'revenue per user' into component parts and explain how each piece connects to user behavior. This builds the analytical muscle memory you need when interviewers ask follow-up questions.

Study actual product launches at target companies

Read about Instagram Reels launch, Uber's upfront pricing rollout, or Spotify's podcast push. Understanding real product decisions helps you sound informed about business context and competitive dynamics.

Master the difference between correlation and causation

Practice explaining why two metrics might move together without one causing the other. Interviewers love asking about spurious correlations to test whether you think causally or just pattern-match.

Build mental models for two-sided marketplaces

Understand how changes affect both supply and demand sides at Uber, Airbnb, and DoorDash. Many product sense questions involve marketplace dynamics that trip up candidates who only consider one side.

Time-box your initial response

Spend 30 seconds thinking through the business context before diving into frameworks. Rushed answers that jump straight to metrics often miss crucial business nuances that stronger candidates catch.

How Ready Are You for Product Sense Interviews?

1 / 6

Metrics and Goal Setting

Your marketplace app saw a 12% increase in total orders last month, but customer support tickets also rose. In an interview, how should you set a goal and choose metrics for the next quarter to ensure the growth is healthy?

Frequently Asked Questions

How much product depth do I need for a Product Sense interview as a Data Analyst or Data Scientist?

You do not need to be a PM, but you do need to think in terms of users, goals, and measurable outcomes. Expect to define the problem, propose a few solutions, and pick success metrics with clear tradeoffs. Your depth should show you can connect product decisions to data, experiments, and business impact.

Which companies ask the most Product Sense interview questions for data roles?

Consumer tech and marketplace companies tend to ask Product Sense most often, especially teams that run frequent experiments. Expect it at large tech companies, social, e-commerce, rideshare, streaming, and fintech firms, plus growth and product analytics teams at startups. If the role sits close to product decisions, Product Sense questions are likely.

Is coding required in a Product Sense interview for Data Analyst or Data Scientist roles?

Product Sense itself is usually not a coding round, it is about product thinking, metrics, and experimentation. However, many interview loops pair it with SQL or Python to validate you can operationalize your analysis. Prepare for both, and practice coding at datainterview.com/coding.

How does a Product Sense interview differ between Data Analyst and Data Scientist roles?

As a Data Analyst, you are typically evaluated on metric design, instrumentation, diagnosis, and communicating insights that drive decisions. As a Data Scientist, you are also expected to reason about causal inference, experiment design details, model driven tradeoffs, and longer-term measurement strategy. Both roles need structured thinking, but DS interviews often go deeper on uncertainty and methodology.

How can I prepare for Product Sense interviews if I have no real-world product experience?

Use familiar products and practice framing problems: pick a goal, identify users, propose changes, and define primary and guardrail metrics. Then outline an experiment or analysis plan, including what data you would need and what could bias results. Use datainterview.com/questions to drill common Product Sense prompts and compare your metric choices to strong examples.

What are the most common mistakes candidates make in Product Sense interviews for data roles?

You lose points when you jump to a solution without clarifying the goal, user segment, and constraints. Another common miss is listing many metrics without choosing one north star metric and a few guardrails tied to the product change. You also hurt your answer if you ignore tradeoffs, confounders, or how you would validate impact with an experiment or quasi-experiment.

Product Sense Interview Questions

Product Sense Interview Questions

Metrics and Goal Setting

Metrics and Goal Setting

Experiment Design and Causal Thinking

Experiment Design and Causal Thinking

Diagnosing Metric Movements

Diagnosing Metric Movements

Feature Impact and Launch Evaluation

Feature Impact and Launch Evaluation

Product Strategy, Trade-offs, and Growth Loops

Product Strategy, Trade-offs, and Growth Loops

How to Prepare for Product Sense Interviews

Practice metric decomposition out loud

Study actual product launches at target companies

Master the difference between correlation and causation

Build mental models for two-sided marketplaces

Time-box your initial response

Frequently Asked Questions

Dan Lee

Related Articles

Better.com Product Improvement

Walmart.com Enhancements

Envy-Free Cake Cut with Three Players