What SQL questions do data analysts get asked?

Window functions (RANK, DENSE_RANK, ROW_NUMBER, LAG, LEAD), multi-CTE queries, self-joins, conditional aggregation with CASE inside SUM or AVG, second-highest-salary patterns, cohort retention queries, and query optimization at a read-the-plan level. The bar is roughly the same as a data engineer at the entry level for pure SQL writing, but data analysts get more business-flavored prompts (find customers whose spend dropped 20% week-over-week) versus the data-flow prompts a data engineer would face. Window functions are universal in 2026. If you can't write a query that ranks within a partition, that's an automatic fail signal.

What Excel skills should a data analyst know for an interview?

Pivot tables (the single most-tested Excel skill), INDEX/MATCH and XLOOKUP, VLOOKUP (still asked even though XLOOKUP supersedes it), SUMIFS and COUNTIFS, conditional formatting, basic data-cleaning functions (TRIM, CLEAN, TEXT, LEFT/RIGHT/MID), and at least one dynamic-array function (FILTER or UNIQUE). The 2026 Excel question pattern at entry level: 'here's a messy CSV, clean it, build a pivot, surface three insights.' If you can do that in 20 minutes without Googling, you pass the Excel axis. Power Query is a senior-level expectation, not entry-level.

What statistics questions do data analysts get asked at the entry level?

Five categories: descriptive statistics (mean vs median when to use which, standard deviation interpretation, outlier detection), probability fundamentals (independence, conditional probability, Bayes at a light level), hypothesis testing (null vs alternative hypothesis, p-value interpretation, type I vs type II errors), A/B test design (sample size, statistical significance, what a 95% confidence interval actually means), and regression intuition (correlation vs causation, R-squared interpretation, linear vs logistic in plain English). The bar is interpretation, not derivation. You don't need to derive the t-distribution. You need to explain what a p-value of 0.03 means to a non-technical PM.

What Tableau interview questions should I prepare for?

Calculated fields (the syntax differs from SQL), level-of-detail (LOD) expressions (FIXED, INCLUDE, EXCLUDE: the single most-tested Tableau concept above the basics), dashboard actions (filter, highlight, URL), parameters versus filters, the difference between dimensions and measures, table calculations (Tableau's running totals and percentage-of-total), and one real-world dashboard design question (build a sales dashboard with three views). Power BI questions cover similar terrain with DAX as the calculated-field language instead of Tableau's expression language. If the job description names a specific tool, study that one to 70% depth and the other to 30%.

What's the difference between data analyst, data scientist, and data engineer interviews?

Data analyst interviews focus on SQL plus business reasoning plus a viz tool. Light statistics, no ML, no infrastructure. Data scientist interviews go deep on statistics, machine learning, A/B test design, and experiment analysis. Data engineer interviews focus on pipelines, distributed systems, Spark, Airflow, dbt, and infrastructure tradeoffs. The analyst seat is the most reachable for a CS new grad without statistics or ML coursework. The data scientist seat needs at least one stats or probability class plus an ML project on the resume. The data engineer seat needs Python plus SQL plus a portfolio project on the modern data stack. Salary differs too: analysts typically $65K-$95K entry level, scientists $95K-$140K, engineers $85K-$130K in the US.

Should CS new grads pivot from SWE to data analyst in 2026?

For most CS new grads applying since spring 2025 with no SWE offers, the data analyst seat is the most reachable pivot. Lower LeetCode bar (often zero), fewer applicants per opening, and faster time-to-offer per Class-of-2025 cohort reports on r/cscareerquestions through Q1 2026. The honest tradeoff: analyst salaries run about $20K lower than SWE at entry level in the US. The pivot is worth it if your goal is to end the search and start building tenure. The CS coursework that helps most: any database class for SQL, any introductory stats class for the statistics axis, and any project that involved building a dashboard or report. The pivot reads as honest when the resume signals analytics interest and the portfolio includes one real Tableau or Power BI artifact.

How many rounds is a data analyst interview loop?

Entry-level data analyst loops are typically 3-4 rounds: recruiter screen, technical phone screen (SQL plus one stats question plus one Excel question), onsite SQL whiteboard plus business case plus behavioral, and sometimes a final-round chat with the hiring manager. The onsite runs 3-4 hours, meaningfully shorter than a SWE onsite. Mid-market and consulting firms often replace the onsite with a take-home (build a dashboard from a provided dataset, write up three insights) and one follow-up technical screen. The take-home is more common in analytics than in software engineering.

What is a SQL window function and how do I use it in an analyst interview?

A window function computes a value across a set of rows related to the current row, without collapsing the rows the way GROUP BY would. Syntax: function() OVER (PARTITION BY col ORDER BY col). The most-asked window functions in data analyst interviews are ROW_NUMBER (assign sequential numbers within a partition), RANK and DENSE_RANK (rank with and without gaps), LAG and LEAD (access previous or next row in a partition), and SUM or AVG with OVER (running totals and moving averages). Canonical interview prompt: 'find the second-highest salary per department.' Solve with DENSE_RANK() OVER (PARTITION BY department ORDER BY salary DESC) and filter to rank = 2. Memorize this pattern.

What is the difference between a pivot table and a SQL GROUP BY?

A pivot table is an Excel feature that reshapes data from long format (one row per observation) to wide format (one row per group with categories as columns). A SQL GROUP BY collapses rows into aggregated values but keeps the long format. Pivot tables also let you add row totals, column totals, percent-of-total calculations, and drill-downs interactively without writing code. The interview-relevant trap: when a hiring manager says 'pivot the data,' they usually mean the Excel-style reshape. If they say 'aggregate the data by region,' they usually mean GROUP BY. Listen for the verb.

How do I prepare for a data analyst interview in 30 days as a CS new grad?

Week 1: SQL deep dive (window functions, multi-CTE, conditional aggregation, optimization). 4 hours per day on a free practice platform. Week 2: Excel plus pivot tables plus one stats refresh. 3 hours per day Excel hands-on, 1 hour per day on stats interpretation drills. Week 3: Pick Tableau or Power BI (whichever the target job lists), build one portfolio dashboard on a public dataset, study the tool's calculated-field language to a working level. Week 4: business case studies (4-5 worked examples), behavioral STAR prep, and three timed mock interviews. The plan compresses to 20 days for someone with strong CS coursework foundations and stretches to 45 days for someone weaker on SQL.

What's a data analyst business case study question?

A business prompt where the interviewer asks you to reason through a metric question without writing code. Example: 'Daily active users dropped 15% on Monday. Walk me through how you'd investigate.' The expected answer shape: clarify the metric definition, segment the drop by user cohort and platform and geo and feature, form three hypotheses, propose the SQL queries you'd run to test each, and state what would confirm or rule out each hypothesis. Strong answers stay calm under prompted ambiguity. Weak answers panic and start naming tools instead of thinking through the data. The case study round is the highest-variance round in a data analyst loop because it tests business sense, which can't be crammed in a week.

What soft skills do data analyst interviewers grade for?

Three things show up in nearly every behavioral round: communication (can you explain a chart to a non-technical PM without using jargon), curiosity (do you ask clarifying questions before diving into the SQL or do you guess at the requirement), and tenacity (when a query gives you a weird answer do you investigate or do you submit the weird answer). The behavioral round in analytics often probes one specific scenario: a stakeholder asked for a metric, you built the dashboard, they pushed back on the number, what did you do. The expected answer involves checking your work, asking what number they expected, and reconciling the difference. The trap answer: defending your number without checking.

What's the salary range for entry-level data analysts in 2026?

Entry-level data analyst base salaries in the US run $65K-$95K depending on company tier and location, with mid-market employers concentrated at $70K-$85K. Large public tech employers push past $95K with equity. The narrative that analytics pays less than engineering is partially true at the entry level (about $15-20K gap) and narrows meaningfully by year 3-4 when senior analyst and analytics engineer roles approach $130K-$160K. For a CS new grad whose primary goal is ending the search, the analyst entry point is a faster path to tenure than continuing to apply for SWE roles that aren't biting.

What's the most common data analyst interview mistake CS new grads make?

Underestimating SQL. CS new grads who took one database course assume SQL is solved and bomb the window-function round. The interview SQL bar is meaningfully higher than coursework SQL. The second-most-common mistake: skipping the dashboard portfolio. A data analyst pivoter without a Tableau or Power BI artifact on GitHub or a portfolio site reads as someone who hasn't done the work. The third mistake: treating Excel as beneath them. Interviewers ask Excel questions to filter out CS grads who refuse to learn the tools the team actually uses. Excel is the lingua franca of analytics. Learn it.

Data Analyst Interview Questions for 2026: 40+ Questions Across SQL, Excel, Statistics, Tableau + the SWE-to-DA Pivot Guide

Q: What do data analyst interview questions test in 2026?

Six axes: SQL fluency at the window-function-plus-multi-CTE level, Excel beyond VLOOKUP (pivot tables, INDEX/MATCH, XLOOKUP, conditional aggregations), statistics at the A/B-test-interpretation bar (p-values, confidence intervals, sample size reasoning), a data-viz tool deep dive (Tableau, Power BI, or Looker), one or two business case studies under a time limit, and the behavioral round most loops still close on. The LeetCode bar is meaningfully lower than a SWE loop, often skipped entirely below FAANG tier. Loops typically run 3-4 rounds with one 4-5 hour onsite, which is shorter than a pure SWE pipeline.

Data analyst interviews in 2026 test SQL fluency at the window-function level, Excel beyond VLOOKUP, statistics at the A/B-test interpretation bar, and at least one data-viz tool (Tableau or Power BI). The LeetCode bar is near zero. For CS new grads whose SWE pipelines have stalled, the data analyst seat is the most reachable entry point into the analytics stack. This guide covers 40+ questions across six categories plus the SWE-to-DA pivot strategy.

By Sam K., Founder, InterviewChamp.AI · Last updated 2026-06-04

40 min read

What data analyst interview questions test in 2026

A data analyst interview in 2026 tests six things: SQL fluency at the window-function-plus-multi-CTE level, Excel beyond VLOOKUP (pivot tables especially), statistics at the A/B-test-interpretation bar, one data-viz tool deep dive (Tableau, Power BI, or Looker), one or two business case studies under a time limit, and the behavioral round that closes nearly every loop.

The LeetCode bar is near zero. Below FAANG-tier analytics teams, algorithm rounds are typically skipped entirely. That's the opportunity for CS new grads whose SWE pipelines have stalled and the trap for grads who assume "no LeetCode" means "easy interview." The trap: SQL is graded more strictly here than in a SWE loop, and the business case round tests something CS coursework rarely drills, which is talking about data to a non-technical audience under prompted ambiguity.

The format varies by company size. A 3-4 round loop at large public employers, sometimes a single onsite at mid-market companies, often a take-home dashboard plus one technical round at early-stage startups. The six axes hold across nearly every tier. Master those, walk into any data analyst loop, hold your own.

The 2026 hiring environment matters. Analytics seats stayed open while SWE pipelines tightened. Class-of-2025 grads who pivoted from SWE applications to analyst applications in Q4 reported meaningfully shorter time-to-offer than peers who kept grinding SWE pipelines through Q1 2026. The analyst seat isn't a consolation prize. It's a faster path to tenure for the new grad whose primary goal is ending the search.

Here's the avatar this guide is written for. CS new grad, May 2025, mid-tier state school, 3.4 GPA. 487 applications since graduation. 14 interviews. Zero offers. 11 months in. Took the database class and one stats class in college. Knows SQL at the SELECT-FROM-WHERE level from coursework. Can build a pivot table in Excel because the marketing internship four summers ago made him build them. Used Tableau once for a class project. Has been on LeetCode until 3am, refreshing LinkedIn at 10am and seeing classmates post FAANG offers. That's the candidate the analyst pivot was made for.

How data analyst questions differ from data engineer and data scientist questions

CS new grads exploring the data career space often conflate these three roles. The interview formats differ meaningfully. Calibrate before you commit to a prep plan.

A data analyst interview centers on SQL plus business reasoning plus a viz tool. Light statistics, no machine learning, no distributed systems, no Spark or Airflow or dbt. The day-to-day work is building dashboards, answering ad-hoc business questions, and supporting product or operations teams with metrics. The interview rounds reflect that: heavy SQL, light algorithms, business cases as the differentiator.

A data engineer interview goes deep on pipelines, partitioning, distributed systems, and the modern data stack (Spark, Airflow, dbt, cloud warehouses). The work is moving data, not analyzing it. The interview rounds reflect that: heavy SQL plus system design oriented around data flow.

A data scientist interview goes deep on statistics, machine learning, A/B test design, and experimentation. The work is building models and running experiments. The interview rounds reflect that: hypothesis testing, regression, sometimes a coding round in pandas plus scikit-learn, an A/B test design case study.

Three CS-coursework signals map to each:

Database course completed → data analyst path opens. SQL is partly there. Window functions and query optimization need refresh.
Statistics or probability course completed → data scientist path opens. Foundation is there. ML coursework or a portfolio model is the wedge to break in.
Distributed systems or operating systems course completed → data engineer path opens. Systems thinking transfers. Python plus dbt plus Airflow portfolio is the wedge.

For a CS new grad with no stats coursework and no ML projects but solid database fundamentals, the analyst seat is the most reachable entry point. The pivot is shorter (about 30 days of focused prep) and the saturation is meaningfully lower than SWE.

The 40+ data analyst interview questions you should rehearse

What follows is a structured rehearsal set covering the six categories that show up most. Each question carries a brief sample answer outline. Not a canned response, but the bones of a strong answer at the entry-level bar. Adapt the language to your own voice. The structure is the load-bearing part.

SQL interview questions for data analysts (10 questions)

SQL is the highest-ROI prep area. It appears in nearly every loop: the phone screen, the onsite whiteboard, and often baked into the business case round. A window function is a SQL operation that computes a value across a set of rows related to the current row without collapsing them the way GROUP BY does, and it is the single concept that separates a pass from a fail at the entry-level analyst bar in the 2026 hiring cycle. Master these ten.

Q1. Find the second-highest salary in an employees table.

Sample answer using DENSE_RANK:

SELECT salary
FROM (
  SELECT salary, DENSE_RANK() OVER (ORDER BY salary DESC) AS rnk
  FROM employees
) ranked
WHERE rnk = 2;

Alternative one-liner: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);. The DENSE_RANK version generalizes to "find the Nth-highest" cleanly. Memorize the pattern.

Q2. What's the difference between RANK, DENSE_RANK, and ROW_NUMBER?

All three assign sequential numbers within a partition based on ORDER BY. ROW_NUMBER always gives unique sequential numbers (1, 2, 3, 4) regardless of ties. RANK gives the same rank to ties and skips the next value (1, 2, 2, 4). DENSE_RANK gives the same rank to ties without skipping (1, 2, 2, 3). The interview-relevant question: when do you use which? ROW_NUMBER for paging or dedup. RANK when gaps matter (Olympic medals). DENSE_RANK when gaps don't matter (the second-highest pattern).

Q3. Write a query to find customers whose spend increased more than 20% week-over-week.

WITH weekly AS (
  SELECT customer_id, DATE_TRUNC('week', order_date) AS wk, SUM(amount) AS spend
  FROM orders
  GROUP BY 1, 2
),
with_prev AS (
  SELECT customer_id, wk, spend,
         LAG(spend) OVER (PARTITION BY customer_id ORDER BY wk) AS prev_spend
  FROM weekly
)
SELECT customer_id, wk, spend, prev_spend
FROM with_prev
WHERE prev_spend > 0 AND (spend - prev_spend) / prev_spend > 0.20;

LAG is the load-bearing function. The query teaches the multi-CTE pattern interviewers expect: clean the data, compute the lag, filter at the end.

Q4. What's the difference between WHERE and HAVING?

WHERE filters rows before aggregation. HAVING filters groups after aggregation. The classic trap: trying to use WHERE on an aggregate. WHERE COUNT(*) > 5 fails. HAVING COUNT(*) > 5 works. Use WHERE for raw-row filters (date > '2026-01-01') and HAVING for aggregate filters (total spend > $1000).

Q5. Write a query that pivots monthly sales into one row per region with months as columns.

SELECT region,
  SUM(CASE WHEN month = 1 THEN sales ELSE 0 END) AS jan,
  SUM(CASE WHEN month = 2 THEN sales ELSE 0 END) AS feb,
  SUM(CASE WHEN month = 3 THEN sales ELSE 0 END) AS mar
FROM monthly_sales
GROUP BY region;

Conditional aggregation with CASE inside SUM is the pivot pattern in standard SQL. Some dialects have PIVOT syntax (SQL Server, Oracle). PostgreSQL has crosstab in the tablefunc extension. The CASE-SUM version is universal.

Q6. What is a self-join and when do you use one?

A self-join is a JOIN of a table to itself. Useful for hierarchical data (employees joined to their managers from the same employees table) and for comparing rows in the same table (find pairs of customers from the same city). Syntax requires aliasing the table twice:

SELECT e.name AS employee, m.name AS manager
FROM employees e
LEFT JOIN employees m ON e.manager_id = m.id;

Q7. Find the third-highest distinct salary using a single SELECT.

SELECT DISTINCT salary
FROM employees
ORDER BY salary DESC
OFFSET 2 ROWS FETCH NEXT 1 ROW ONLY;

Alternative: SELECT MIN(salary) FROM (SELECT DISTINCT salary FROM employees ORDER BY salary DESC LIMIT 3) t;. The DENSE_RANK version also generalizes. Know all three so you can pick the one the interviewer is asking about.

Q8. What's the difference between INNER JOIN, LEFT JOIN, RIGHT JOIN, and FULL OUTER JOIN?

INNER JOIN returns only rows where the keys match in both tables. LEFT JOIN returns all rows from the left table plus matching rows from the right, with NULLs where no match exists. RIGHT JOIN is the mirror image. FULL OUTER JOIN returns all rows from both tables with NULLs filling gaps. The interview-relevant test: when do you reach for each? LEFT JOIN when you want to keep all customers regardless of whether they ordered. INNER JOIN when you only care about customers who placed an order. RIGHT JOIN is rare in practice. FULL OUTER is the choice when you're auditing two tables for missing data.

Q9. Write a query to find the top 3 customers by spend in each region.

WITH ranked AS (
  SELECT region, customer_id, SUM(amount) AS spend,
         ROW_NUMBER() OVER (PARTITION BY region ORDER BY SUM(amount) DESC) AS rn
  FROM orders
  GROUP BY region, customer_id
)
SELECT region, customer_id, spend
FROM ranked
WHERE rn <= 3;

The "top N per group" pattern is one of the most-asked SQL questions in analyst interviews. PARTITION BY plus ROW_NUMBER plus filter. Memorize the shape.

Q10. How do you read an EXPLAIN plan and identify a slow query?

Three things to look for: sequential scans on large tables (should be index scans), nested-loop joins on large tables (should be hash or merge joins above some row count), and high estimated cost on a single step (the bottleneck). The fix usually involves adding an index on the join or filter column, rewriting a subquery as a JOIN, or breaking a complex query into multi-step CTEs. The interviewer isn't expecting you to optimize the query in 5 minutes. They're checking whether you know what an EXPLAIN plan is and what to look at first.

Honest take from watching new grads in this round: don't memorize EXPLAIN plan jargon you can't apply. If the interviewer asks about EXPLAIN and you've never read one, say so and offer to reason about query cost from first principles. "I haven't read an EXPLAIN plan in production, but I'd start by checking whether each step is reading from an index or scanning the full table." That's the right answer. The wrong answer is fake confidence followed by jargon that falls apart on the follow-up.

Excel interview questions for data analysts (8 questions)

Excel is the lingua franca of analytics. CS new grads who refuse to learn Excel get filtered out fast. A pivot table is the Excel feature that reshapes long-format data into a grouped summary with categories as columns, and it is the most-tested Excel skill in an analyst loop as of 2026. Master these eight.

Q11. How do you build a pivot table from a dataset and what does it summarize?

Select the data range, Insert > PivotTable, drag fields into Rows (categorical grouping), Columns (categorical comparison), Values (numeric aggregation, default SUM), and Filters (slicer). Pivot tables summarize by aggregating numeric values across categorical groups, equivalent to a SQL GROUP BY but with interactive drill-down, percent-of-total, and grand totals built in. The interview trap: explain a pivot table in plain English to a non-technical PM. "It's a summary table that shows totals broken down by category."

Q12. What's the difference between VLOOKUP and XLOOKUP?

VLOOKUP searches the leftmost column of a range and returns a value from a column to the right by index. Limited to left-to-right lookups, error-prone with column index changes, and slow on large datasets. XLOOKUP (Excel 365 and 2021+) searches any column, returns from any direction, defaults to exact match, and supports custom error handling with the if-not-found argument. Use XLOOKUP if it's available. Know VLOOKUP because most legacy spreadsheets and many interview questions still reference it.

Q13. How do you use INDEX/MATCH and why is it sometimes preferred over VLOOKUP?

=INDEX(return_range, MATCH(lookup_value, lookup_range, 0)) returns a value from any column by matching against any column. The 0 in MATCH means exact match. The advantage over VLOOKUP: it works left-to-right or right-to-left, doesn't break when columns are inserted, and runs faster on large datasets. INDEX/MATCH was the pro-analyst pattern before XLOOKUP. It still shows up in interview questions and in legacy spreadsheets.

Q14. What does SUMIFS do and how is it different from SUMIF?

SUMIFS sums values from a range that satisfy multiple conditions across other ranges. SUMIF supports only one condition. Syntax: =SUMIFS(sum_range, criteria_range1, criteria1, criteria_range2, criteria2, ...). Example: total revenue for customers in California where the order date is in 2026. SUMIFS is asked over SUMIF because multi-condition aggregations are the realistic analyst task. COUNTIFS and AVERAGEIFS follow the same pattern.

Q15. How do you clean a dirty dataset in Excel?

A standard cleanup sequence: TRIM (remove leading and trailing whitespace), CLEAN (strip non-printable characters), PROPER or UPPER or LOWER (normalize case), TEXT (format dates and numbers consistently), Find and Replace (mass-fix known typos), Remove Duplicates (button on the Data tab), and Data Validation (lock down acceptable values going forward). For more complex cleanups, Power Query is the senior-level tool. The interview question is usually: "here's a messy CSV, walk me through how you'd clean it." Explain the sequence.

Q16. What's the difference between a relative and an absolute cell reference?

A relative reference (A1) adjusts when you copy the formula. An absolute reference ($A$1) stays fixed. Mixed references ($A1 or A$1) lock one dimension. Use absolute references for constants (a tax rate cell), lookup ranges in VLOOKUP, and any reference you don't want to shift when copying down or across. The shortcut to toggle: F4 on the highlighted reference. Pressing F4 cycles through A1 → $A$1 → A$1 → $A1.

Q17. How do you build a running total in Excel?

The classic approach uses an absolute starting reference: in cell B2, write =SUM($A$2:A2). Drag down. Each row sums from the fixed starting point to the current row. The modern approach uses the SCAN dynamic-array function in Excel 365: =SCAN(0, A2:A100, LAMBDA(acc, val, acc + val)). The interview-relevant version is usually the absolute-reference approach because it works in all versions of Excel.

Q18. What is a dynamic-array formula and which functions return one?

Dynamic-array formulas spill results across multiple cells without requiring you to enter them as array formulas with Ctrl+Shift+Enter. FILTER, UNIQUE, SORT, SEQUENCE, RANDARRAY, and XLOOKUP can all return dynamic arrays. Example: =UNIQUE(A2:A100) returns the distinct values from the range as a single spilled range. The 2026 expectation at the entry level: know that FILTER and UNIQUE exist, know they spill, know they're the modern replacement for many old array formulas. Power Query and PivotTables are still the heavyweight tools for full reshaping.

If I had to pick three Excel features for a CS grad to learn first, it'd be pivot tables, XLOOKUP, and SUMIFS. Spend a day on each. That's the 80/20 of analyst Excel. Anyone who tells you to start with Power Query or VBA macros is wasting your prep budget.

Statistics interview questions for data analysts (7 questions)

The statistics bar at entry level is interpretation, not derivation. You don't need to prove the central limit theorem. A p-value is the probability of seeing your result, or something more extreme, if there were no real effect, and being able to explain that to a non-technical PM in one sentence matters more than any formula you can recite. You need to explain what a confidence interval means to that same PM. Master these seven.

Q19. What's the difference between mean and median, and when do you use which?

The mean is the arithmetic average. The median is the middle value when the data is sorted. Use the mean when the distribution is roughly symmetric. Use the median when the distribution is skewed or has outliers. Classic example: average household income in a region is misleading because a few high-income outliers pull the mean up. The median better represents the typical household. The 2026 interview question pattern: "Salary data has a long right tail. Which measure do you report?" Median. Always.

Q20. What is a p-value and how do you interpret p = 0.03?

A p-value is the probability of observing the data (or something more extreme) if the null hypothesis is true. p = 0.03 means there's a 3% chance of seeing this result by random chance if there's no real effect. The conventional threshold for "statistically significant" is p < 0.05. The interview-relevant nuance: p-values do NOT tell you the probability that the null is true, or the size of the effect, or whether the result is practically meaningful. A statistically significant 0.1% lift in conversion might not be worth shipping. State both significance and effect size.

Q21. What is a 95% confidence interval and how do you explain it?

A 95% confidence interval is a range of values such that, if you repeated the experiment many times, 95% of the resulting intervals would contain the true population parameter. The plain-English explanation for a non-technical audience: "We're 95% confident the true value is in this range, plus or minus this margin." The trap interpretation: "There's a 95% probability the true value is in this specific interval." Technically wrong (the true value is fixed, not random). In practice, most stakeholders use the plain-English version and that's fine.

Q22. What's the difference between a Type I and Type II error?

A Type I error is a false positive: rejecting the null hypothesis when it's true. A Type II error is a false negative: failing to reject the null when it's false. The interview-relevant tradeoff: lowering the significance threshold (alpha) reduces Type I errors but increases Type II. In an A/B test context: Type I means you ship a change that didn't help. Type II means you reject a change that would have helped. Most product teams care more about Type I (don't ship false positives) than Type II.

Q23. How do you design an A/B test for a new feature?

Four steps. (1) Define the metric. Pick one primary metric, two or three secondary metrics, and at least one guardrail metric (a metric that should NOT change). (2) Calculate sample size. Use a power calculation that takes the expected effect size, the significance threshold (usually 0.05), and the desired power (usually 0.80). (3) Randomize. Assign users to control or treatment with a deterministic hash on user_id to avoid cohort drift. (4) Run for at least one full week. Avoid stopping early. Avoid peeking at results and stopping when significant (this inflates Type I error). Calculate the test outcome at the predetermined sample size.

Q24. What's the difference between correlation and causation?

Correlation is a statistical association between two variables. Causation means one variable causes a change in the other. Correlation does not imply causation because of confounding variables (a third variable causing both), reverse causality (the dependent variable affects the independent), and chance. The classic example: ice cream sales correlate with drowning deaths. Both are caused by hot weather. Neither causes the other. The interview-relevant move: when stakeholders ask "does X cause Y?", check whether the data supports causation (controlled experiment, A/B test) or only correlation (observational data).

Q25. What is R-squared and how do you interpret a value of 0.65?

R-squared (the coefficient of determination) measures the proportion of variance in the dependent variable explained by the independent variable(s) in a regression model. R-squared of 0.65 means 65% of the variation in the outcome is explained by the predictors. The remaining 35% is unexplained (noise plus omitted variables). The interview-relevant nuance: high R-squared doesn't mean the model is correct. It means the model fits the training data. Always check on held-out data. Also: in social-science contexts, R-squared of 0.3 might be excellent. In physics, 0.99 might be insufficient. Context matters.

Data visualization interview questions: Tableau and Power BI (8 questions)

Pick the tool the job description lists. Study it to 70% depth, and the other tool to 30%. Most concepts transfer. Master these eight.

Q26. What's a calculated field and how do you write one in Tableau?

A calculated field is a Tableau formula that creates a new measure or dimension from existing fields. Syntax uses Tableau's expression language. Example: [Sales] - [Cost] for profit, or IF [Revenue] > 1000 THEN "High" ELSE "Low" END for a tier. Calculated fields run at query time, so they're slower than pre-computed columns in the source data. Reach for calculated fields when the metric definition needs to live in the dashboard, not in the warehouse.

Q27. What is a level-of-detail (LOD) expression in Tableau and when do you use one?

LOD expressions let you compute aggregations at a different granularity than the view. Three types: FIXED (ignore the view's dimensions and aggregate at the fixed level), INCLUDE (add a dimension to the view's level), and EXCLUDE (remove a dimension from the view's level). Example: { FIXED [Customer ID]: SUM([Sales]) } gives total sales per customer regardless of the dimensions on the view. Use cases: customer-level metrics on a date-level view, percent-of-total calculations, cohort analysis. LOD expressions are the single most-tested concept above the Tableau basics.

Q28. What's the difference between a dimension and a measure in Tableau?

Dimensions are categorical fields used to slice or group the data (product, region, date). Measures are numeric fields you aggregate (sales, count, profit). Tableau automatically classifies fields when you connect a data source, but you can change the classification. Some fields can be both: order_id can be a dimension when grouping orders or a measure when counting them (COUNT(order_id)).

Q29. How do you create a parameter in Tableau and how is it different from a filter?

A parameter is a user-controllable single value (a number, date, or string) that you can reference in calculations, filters, or reference lines. A filter restricts which rows appear in the view. The difference: a filter directly filters data; a parameter is a user input that you wire into a calculation or filter. Example: a parameter "Top N" wired into a Top N filter lets the user choose 5, 10, or 25 from a dropdown.

Q30. What's a Tableau dashboard action and what are the three types?

Dashboard actions are interactive behaviors that respond to user clicks or hovers. Three types: Filter actions (clicking a mark filters another view), Highlight actions (clicking highlights matching marks across views), and URL actions (clicking opens an external URL or another view). Use Filter actions for drill-down dashboards. Use Highlight actions to show relationships across multiple views. Use URL actions to link to a detail page or an external system.

Q31. What is DAX and how is it different from Tableau's expression language?

DAX (Data Analysis Expressions) is Power BI's formula language. Used for calculated columns, measures, and calculated tables. Syntax differs from Tableau but covers similar concepts: filter context, row context, and aggregation. Example DAX measure: Total Sales = SUM(Orders[Sales]). Example DAX calculated column: Margin = Orders[Sales] - Orders[Cost]. Power BI interview questions probe DAX the way Tableau interview questions probe LOD expressions.

Q32. How do you design a sales dashboard with three views?

Standard sales dashboard structure: (1) KPI summary at the top, with total sales, total orders, average order value, gross margin, as big-number tiles. (2) Time series in the middle, a monthly sales trend with a year-over-year comparison line. (3) Categorical breakdown at the bottom, with sales by region, product category, or sales rep, as a bar chart or treemap. Add filters for date range, region, and product. Add a dashboard action so clicking a region on the bar chart filters the trend line. The interview prompt is rarely "build this in 45 minutes." It's "sketch out the design and tell me why each view matters."

Q33. What's the difference between Tableau Public and Tableau Desktop?

Tableau Public is the free version that publishes dashboards to a public Tableau Public profile. Data must be public. Tableau Desktop is the paid product that connects to private data sources and publishes to Tableau Server or Tableau Cloud (paid). For a CS new grad building a portfolio, Tableau Public is the right tool. Build dashboards on public datasets (Kaggle, data.gov, fivethirtyeight) and link your Tableau Public profile from your resume.

Business case study interview questions for data analysts (5 questions)

A business case study is a metric-reasoning prompt where the interviewer asks you to think through a data question out loud without writing code, and it is the highest-variance round in a data analyst loop because it tests business sense, which can't be crammed in a week. Practice the structure on these five.

Q34. Daily active users dropped 15% on Monday. Walk me through how you'd investigate.

Structure: (1) Clarify the metric. Is DAU defined as users with at least one session, or at least one specific event? Did the definition change? (2) Segment the drop. By platform (iOS, Android, web), by geography, by user cohort (new, returning, power), by feature (which features got used less). (3) Form three hypotheses. A bug deployed Friday or over the weekend. A holiday or external event affecting traffic. A measurement issue (tracking pipeline broke). (4) Propose the queries. Show me Monday DAU split by platform versus the previous four Mondays. Check the release log for Friday deploys. Check the tracking pipeline health. (5) State what would confirm each hypothesis. Strong answers stay calm under prompted ambiguity. Weak answers panic.

Q35. The product team wants to launch a new feature. How would you design the experiment?

Structure: (1) Define the primary metric. What outcome does the feature aim to improve? Conversion rate, retention, revenue per user? Pick one primary, two or three secondary, one guardrail (a metric that shouldn't move). (2) Sample size calculation. Use a power analysis: expected effect size, alpha = 0.05, power = 0.80. Most calculators online. (3) Randomization unit. User ID is standard. Hash the ID so the assignment is deterministic. (4) Duration. At least one full week to capture weekday-weekend variance. Calculate at the predetermined sample size, not when significance is hit. (5) Analysis. Primary metric first. Check guardrails. Segment by major user cohort to look for heterogeneous effects.

Q36. A stakeholder says the conversion rate dropped from 5% to 4% last week. What do you check first?

First, ask what they want to do about it. The question isn't "investigate the drop." It's "is this real and what action does it imply." Then check: (1) Is the metric definition the same as last week? (2) Is the denominator the same population (some metric pipelines silently change population)? (3) Is the drop statistically significant given the sample size? A 5% to 4% drop in a sample of 1,000 might be noise. In a sample of 1,000,000 it's a real event. (4) Segment to find the drop's source. Once you've localized it, only then form hypotheses about why. Strong analysts ask before investigating.

Q37. How would you measure the success of a customer support team?

Five candidate metrics: time-to-first-response, time-to-resolution, ticket volume per agent per day, customer satisfaction score (CSAT), and first-contact resolution rate. The interview question isn't "list metrics." It's "pick the right combination." The trap: optimizing for time-to-resolution alone incentivizes agents to close tickets without resolving them. The combination that resists gaming: CSAT (the customer judges) plus first-contact resolution (the customer didn't need to come back) plus volume normalized by complexity (don't compare a senior agent's hard tickets to a junior agent's easy ones).

Q38. Senior leadership asks you to build a dashboard for the executive team. What questions do you ask before building?

Five questions: (1) Who's the audience and what's their reading level, finance-numerate execs or product PMs? (2) What's the cadence, do they look at it daily, weekly, or in monthly reviews? (3) What decisions does the dashboard drive, board-deck context, weekly operating reviews, ad-hoc questions? (4) What's the data lag they can tolerate, real-time or yesterday-fresh? (5) What's the canonical source for each metric, is there an existing definition or are we creating one? Strong analysts gather requirements before building. Weak analysts build, present, and rework. The first set of questions is the deliverable.

Behavioral and soft-skills questions for data analysts (5 questions)

The behavioral round closes nearly every analyst loop. Three things get graded: communication, curiosity, and tenacity. Use the STAR framework for each answer.

Q39. Tell me about a time you had to explain a complex analysis to a non-technical stakeholder.

Structure: (Situation) The context, who the stakeholder was, why the analysis mattered. (Task) What were you trying to communicate? (Action) How did you structure the explanation? Lead with the answer, then the why, then the caveats. Use a chart or a one-number summary, not the methodology. (Result) Did the stakeholder make the decision? Did the analysis change anything? Strong answer signals communication maturity. Weak answer dives into technical detail the interviewer didn't ask for.

Q40. Tell me about a time a stakeholder pushed back on your numbers.

Structure: (S) A real moment when a PM or exec questioned a metric you produced. (T) You needed to defend the number or find the discrepancy. (A) The right move: don't defend reflexively. Ask what number they expected and where they got it. Reconcile the difference. Sometimes you're wrong (a join error, a stale dashboard). Sometimes they're wrong (a different metric definition, a different time window). Sometimes both definitions are valid and the disagreement is about which to use. (R) The reconciled answer, ideally with a documented decision on which definition to use going forward.

Q41. Tell me about a time you noticed an error in your work after presenting it.

Structure: (S) You presented a metric or analysis. (T) You noticed an error after the meeting. (A) The right move: own it immediately. Email the stakeholders with the correction and the corrected analysis. Explain what was wrong and why. Don't bury it or wait for someone else to find it. (R) The trust you preserve by self-correcting beats the small embarrassment of admitting the error. Senior analysts in 2026 universally rank this behavior as the single biggest seniority signal.

Q42. Tell me about a time you had to learn a new tool quickly.

Structure: (S) A project that required a tool you didn't know, like Tableau, Power BI, dbt, or a specific database. (T) You needed to get to working-proficient fast. (A) Steps: free tutorial videos for the basics, a small toy project to internalize the syntax, then the real project with frequent searches for specific issues. The signal: you learned by building, not by reading. Self-taught analysts who can ramp on a new tool in a week beat formally-trained analysts who need a course.

Q43. Why do you want to be a data analyst instead of a software engineer?

For a CS new grad, this is the pivot question. Three honest framings: (1) Coursework signal: "I enjoyed the database class more than the algorithms class. Building dashboards from real data felt more impactful than solving abstract problems." (2) Career-shape signal: "I want to work close to the business side. Analytics gives me access to product decisions in a way pure engineering doesn't." (3) Market signal: "I've been applying for SWE roles since spring 2025 and the saturation is brutal. Analytics is where I can ship work and build tenure faster." Honesty wins. Recruiters in 2026 see this pivot constantly and respect the candidates who can name the why without flinching.

How to prepare for a data analyst interview (5-7 steps)

A focused four-week prep plan for a CS new grad with strong database fundamentals and no formal analytics experience. Adjust if your starting point differs.

Week 1: SQL deep dive. Four hours per day on window functions (RANK, DENSE_RANK, ROW_NUMBER, LAG, LEAD), multi-CTE queries, self-joins, conditional aggregation, and query optimization. Use a free practice platform with real datasets. End the week able to handle the 10 SQL questions above without hesitation.
Week 2: Excel plus pivot tables plus light statistics. Three hours per day on Excel hands-on. Pivot tables, INDEX/MATCH, XLOOKUP, SUMIFS, dynamic-array functions, data cleanup. Build one real cleanup-and-analysis project on a public CSV. One hour per day on statistics interpretation. Drill p-value reading, confidence intervals, correlation versus causation, A/B test design.
Week 3: Pick Tableau or Power BI and build one portfolio dashboard. Choose the tool the target job lists. Complete the official free training. Build one portfolio dashboard on a public dataset with at least three views (KPI summary, time series, categorical breakdown). Publish it on Tableau Public or a GitHub README with screenshots.
Week 4: Business case studies plus behavioral plus mock interviews. First three days on case study practice with five worked examples. Days 4-5 on STAR behavioral prep, writing 6-8 short stories. Days 6-7 on three timed mock interviews. SQL whiteboard, business case, behavioral.
Morning of the interview: warm up with your dashboard portfolio. Pull up your Tableau Public or Power BI artifact. Rehearse the 60-second walkthrough you'd give if asked about your dashboard project. Refresh the top 10 SQL patterns. Ten minutes. Then walk in.

Data analyst interview format by role type

The same analyst title gets interviewed differently depending on industry, company size, and team structure. The breakdown for five common analyst-track openings in 2026:

Role variant	SQL depth	Excel depth	Stats depth	Viz tool	Case study weight	LeetCode	Behavioral
Product Analyst (large tech)	High	Low	Medium	Heavy (Tableau / Looker)	High	Sometimes one round	Medium
Business Intelligence Analyst	High	Medium	Low	Heavy (Power BI / Tableau)	Medium	Almost never	Medium
Marketing Analyst	High	High	Medium	Heavy (one tool)	Medium-High	Never	Medium
Financial Analyst (tech-adjacent)	Medium	Very High	Low	Light	High	Never	High
Operations Analyst	High	High	Low	Medium	High	Almost never	Medium

Two patterns to notice. First, SQL is universal. Every variant tests it at a meaningful depth. Second, Excel weight varies by role. Marketing and Finance variants treat Excel as essential. Product analyst variants treat it as background. If you're targeting Product Analyst at a large tech company, deprioritize Excel polish in favor of SQL depth plus a Tableau or Looker portfolio. If you're targeting Marketing or Finance Analyst at a mid-market company, your Excel skills are 40% of the interview signal.

How to handle data analyst questions you've never seen

You will hit at least one question you haven't seen. A specific Excel function, a stats concept you forgot, a calculated-field syntax in the wrong tool. The candidate who freezes loses the round. The candidate who reasons out loud often passes even when they get the wrong answer.

A four-step pattern for unfamiliar questions:

1. Restate the question. Slow down. Confirm what's being asked. Buys you 10 seconds of thinking time and signals careful reasoning. Example: "So you're asking how I'd compute year-over-year growth in Tableau without precomputing it in SQL. Let me think about that."

2. State what you know and what you don't. Don't pretend. "I know there's a way to do this with a table calculation or an LOD expression. I'm not sure which one is cleaner for this case. Let me reason through both." Calibration beats confidence.

3. Reason from first principles. If you don't remember the answer, derive it. "Year-over-year growth is current-year minus previous-year, divided by previous-year. In SQL that's a self-join or a LAG window function. In Tableau, the LAG-equivalent is a table calculation called LOOKUP." That trail of reasoning is the work the interviewer is grading.

4. Test the hypothesis if you can. "Can I sketch the query on the whiteboard?" or "Can I pull up Tableau and try it?" works in most rounds. If allowed, do it. If not, walk through it verbally with as much specificity as you can.

Tricky analyst questions are rarely about memorization. They're about whether you understand the underlying model: aggregation grain, filter context, the difference between row-level and group-level calculations. Show the reasoning. Show the understanding.

Common data analyst interview mistakes for CS new grads

The most-reported mistakes from new-grad data analyst interviews in the 2025-2026 hiring cycle, in order of frequency:

Treating SQL as solved. CS new grads who took one database course assume SQL is done and bomb the window-function round. The bar at the entry-level analyst interview is meaningfully above coursework SQL. Spend the week. Window functions, multi-CTE, conditional aggregation, self-joins. Drill them.

Refusing to learn Excel. Some CS grads treat Excel as beneath them. "I'll just use Python or SQL." Interview signal: this candidate won't learn the tools the team uses. Lose the round. Learn pivot tables, INDEX/MATCH, XLOOKUP, SUMIFS. Three days of Excel practice pays off for years.

Skipping the portfolio dashboard. A pivoter without a Tableau or Power BI artifact reads as someone who hasn't done the work. The portfolio is the credibility anchor. Eight to twelve hours of focused work produces a single dashboard that makes the pivot story believable.

Panicking on case studies. The business case round tests calm under ambiguity. CS new grads who treat it as a technical problem fail. The structure to internalize: clarify, segment, hypothesize, propose the queries, state confirmation criteria. Practice five worked examples until the structure is automatic.

Quoting p-values without explaining them. Saying "the p-value was 0.03" without explaining what that means to the non-technical stakeholder is a senior-level red flag. Always state significance plus effect size plus practical interpretation. The interviewer is checking communication, not just stats fluency.

Underestimating the behavioral round. Analysts spend half their time talking to stakeholders. The behavioral round probes that skill. New grads who breeze past STAR prep arrive without specific examples and ramble. Write 6-8 short stories in advance.

One thing I'd add from watching new grads pivot in 2026: the SWE-to-DA pivot works when the candidate stops trying to prove they're a programmer who happens to be in an analyst interview. The analyst seat is a different craft. Lean into it. The CS coursework helps but doesn't substitute. Build the portfolio. Drill the SQL. Talk like an analyst.

How to switch from software engineer to data analyst in 2026

The SWE-to-DA pivot is the most reachable path off the stalled SWE pipeline for a CS new grad with database fundamentals. The pivot story works when three things line up: a resume that signals analytics interest, a portfolio that backs the signal, and a prep plan that closes the technical gap. Here's how each piece works.

Resume positioning. Don't rewrite SWE internships to claim analyst experience you didn't have. Do reframe the language to highlight the analytics-adjacent work that was there. Original line: "Built backend services in Python and Node.js." Reframed line: "Built Python data-processing services for [X] API integration, including SQL pipelines that aggregated 50K events daily into reporting tables for the ops team." Both lines describe the same work. The second signals analytics vocabulary to the resume screener.

Add a portfolio section. Two artifacts, documented at a real-code level:

A Tableau Public dashboard or Power BI report on a public dataset. Three views minimum (KPI summary, time series, categorical breakdown). Six to ten hours of focused work. Publish it. Link from your resume.
A SQL portfolio repo on GitHub: 10-15 well-commented queries against a public dataset, each solving a real business question. Window functions, multi-CTE, query optimization examples. Document the dataset and the questions in a README. Eight to ten hours.

The portfolio is the credibility anchor. Recruiters who are skeptical of the pivot become believers when they see a real dashboard and a real SQL repo.

Drop SWE-only signals from the top of the resume. If your resume opens with "Implemented Dijkstra's algorithm in C++" and "Built a React frontend for a hackathon," cut those from the top. Push them to a "Other Projects" section below the analytics portfolio. The top of the resume signals what role you want next.

Update the LinkedIn headline. Current: "Software Engineer | Python, Java, JavaScript." Updated: "Data Analyst | SQL, Tableau, Excel, Python | CS New Grad." The headline is what recruiters search for. If "data analyst" isn't in there, you don't surface in their searches.

Address the salary delta honestly with yourself. Entry-level analyst salaries run about $15-20K lower than entry-level SWE salaries in the US. The math: $75K analyst beats $0 unemployed. Five years from now the analyst path opens senior-analyst, analytics-engineer, and data-scientist seats that pay competitively. The pivot trades a year of "almost-SWE" applications for an actual job that builds tenure.

The hard delta: business sense. The technical gap from CS-new-grad to entry-level analyst is small. SQL plus Excel plus one viz tool plus light stats. Four weeks of focused prep closes it. The hard delta is the business case round. New grads who never had to interpret a metric for a PM struggle with case study questions. The fix: read five worked case studies and practice the structure out loud. Mock interview tools that drill case studies in real time accelerate this gap closure. A $3 trial buys enough timed case reps to walk into the real loop able to clarify, segment, and hypothesize without freezing, which is the difference between another rejection and the offer that ends the search.

One pattern from watching candidates run this pivot: the SWE candidate at month 11 with 487 applications and 14 interviews and zero offers usually thinks the analyst path is a step down. It isn't. The base is $15-20K lower at entry. The work is meaningfully closer to product and business decisions. The senior analyst track at year three pays comparably to senior SWE. The candidate who pivots in 2026 doesn't lose a career. They end the search.

The pivot reads as honest when:

The portfolio dashboard exists and the candidate can walk through it for 60 seconds.
The SQL repo exists and the candidate can explain three of the queries.
The candidate doesn't claim experience they don't have.
The "why analyst not engineer" answer is coherent and specific.

The pivot reads as dishonest when:

The resume claims analyst experience that doesn't appear in the work history.
The candidate can't talk about the portfolio in the technical screen.
The "why analyst" answer is "SWE wasn't hitting and I needed something."

Be honest. Build the portfolio. Drill the SQL. Walk in.

Data analyst vs data scientist vs data engineer: which seat fits a CS new grad?

CS new grads exploring the data career space need to pick one path before committing to a prep plan. Each role is reachable from a CS-new-grad background; the angles differ.

Axis	Data Analyst	Data Scientist	Data Engineer
Primary skill	SQL + business reasoning + viz tool	Statistics + ML + experimentation	Pipeline design + tooling
SQL bar	High (window functions, optimization)	Medium-High (analytical SQL)	High (window functions, optimization)
Python bar	Low-Medium (pandas, basic ETL)	High (numpy, pandas, scikit-learn, sometimes PyTorch)	Medium (data-processing patterns)
Stats / math bar	Low-Medium (interpretation)	High (hypothesis testing, regression, ML)	Low (basic probability)
ML bar	Almost none	Core competency	Almost none
LeetCode bar	Almost none	One round at FAANG	Low (one round at FAANG)
Viz tool	Heavy (Tableau, Power BI, Looker)	Light (matplotlib, plotly)	Almost none
Tooling	Excel, Tableau, SQL, sometimes dbt	Jupyter, scikit-learn, ML libraries	Spark, Airflow, dbt, warehouses
Day-to-day	Dashboards, ad-hoc reports, business questions	Models, experiments, A/B tests	Pipelines, infrastructure, data flow
Entry-level US salary (2026)	$65K-$95K	$95K-$140K	$85K-$130K
Time-to-offer for CS pivot	30-45 days prep	60-90 days prep	30-45 days prep
Saturation (vs SWE)	Meaningfully lower	Lower	Meaningfully lower
CS coursework signal needed	Database class	Stats + ML class	Distributed systems or OS class

The honest pivot recommendations for a CS new grad in 2026:

Data analyst if your strongest CS coursework was the database class, you've never enjoyed statistics, and your primary goal is ending the search fast.
Data scientist if you took stats and ML courses and enjoyed them, you have at least one ML portfolio project, and you can afford 60-90 days of focused prep.
Data engineer if you took distributed systems or OS, enjoy systems thinking, and want to work close to the infrastructure side.

Most CS grads with no stats coursework pivot best into data analyst or data engineer. The analyst path is faster (30-45 days). The engineer path pays slightly more long-term. Pick by which prep stack matches your existing strengths.

The full data engineer pivot is covered in the data engineer interview questions guide for CS new grads. The systems-thinking content there transfers if you decide engineering fits better than analytics.

How to ace the SQL whiteboard round (the highest-stakes round)

The SQL whiteboard is the single highest-stakes round in a data analyst loop. The format: 30-45 minutes, the interviewer writes the schema on the board or shares it on screen, you write queries by hand or in a shared editor without execution.

Three things win this round:

1. Narrate as you write. Silent SQL is a known anti-pattern. Talk through your reasoning out loud while writing. "I need to join orders to customers to get the customer region. Then I'll filter to last week. Then I'll aggregate by region. Then I'll order by total spend descending." The narration buys you partial credit if the syntax slips.

2. Start with the simplest query that gets close. Don't write the optimal multi-CTE solution on the first pass. Write a brute-force version that returns roughly the right shape. Then ask the interviewer if they want to see optimizations. They'll usually say yes. The partial-credit version protects you if time runs out.

3. State your assumptions about the schema and data. "I'm assuming the orders table has one row per order line. If it's one row per order with a JSON array of items, the query changes." Assumptions are seniority signals. New grads who don't verbalize them get marked down for sloppiness.

The trap that fails most candidates: writing window functions wrong. The OVER clause syntax. The PARTITION BY. The ORDER BY inside the OVER. Drill these in the week before the interview until you can write RANK() OVER (PARTITION BY department ORDER BY salary DESC) without looking it up.

The gap between knowing the pattern and saying it cleanly while someone watches is the whole game. If you want to feel that pressure before it counts, run a timed SQL-whiteboard mock and rehearse narrating the query out loud until the words come out in your own voice on the first take.

Data analyst interview cheat sheet

A one-page reference of the top idioms across SQL, Excel, statistics, and Tableau. The morning-of warmup.

#	Idiom	Domain	Use case
1	`RANK() OVER (PARTITION BY x ORDER BY y)`	SQL	Rank within group
2	`LAG(col) OVER (PARTITION BY x ORDER BY y)`	SQL	Compare to previous row
3	`SUM(CASE WHEN c THEN val ELSE 0 END)`	SQL	Conditional aggregation
4	`WITH cte1 AS (...), cte2 AS (...) SELECT ...`	SQL	Multi-CTE pipeline
5	`ROW_NUMBER() OVER (PARTITION BY g ORDER BY ts DESC)`	SQL	Top-1 per group
6	Pivot table > Rows / Columns / Values / Filters	Excel	Cross-tab summary
7	`=XLOOKUP(val, lookup_range, return_range)`	Excel	Modern lookup
8	`=SUMIFS(sum_range, c1_range, c1, c2_range, c2)`	Excel	Multi-condition sum
9	`=UNIQUE(range)`	Excel	Distinct values
10	`=FILTER(range, condition)`	Excel	Conditional filter
11	`p-value < 0.05` ≈ statistically significant	Stats	Hypothesis test threshold
12	95% CI = "true value likely in this range"	Stats	Plain-English interpretation
13	Sample size depends on effect size + alpha + power	Stats	A/B test design
14	Correlation ≠ causation	Stats	Always check for confounders
15	`{ FIXED [Customer]: SUM([Sales]) }`	Tableau	Fixed-grain aggregation
16	Calculated field: `IF [x] > 100 THEN "High" ELSE "Low" END`	Tableau	Conditional tier
17	Dashboard action: Filter / Highlight / URL	Tableau	Interactivity
18	Parameter > Dropdown > Wire into calc	Tableau	User input
19	Total Sales = SUM(Orders[Sales])	Power BI DAX	Measure syntax
20	KPI > Trend > Breakdown	Dashboard design	Three-view default

Memorize the SQL and Excel rows first. The Tableau and Power BI rows depend on which tool the target job uses.

Key terms

The analytics vocabulary surface is wider than SWE. Get these wrong and the interviewer flags you. Get them right and you pass the credibility check.

Window function: A SQL function that computes a value across a set of rows related to the current row without collapsing the rows the way GROUP BY would. Examples: RANK, ROW_NUMBER, LAG, LEAD, SUM with OVER. Used for ranking within partitions, comparing to previous rows, and running totals. The single most-tested SQL concept in 2026 analyst interviews.
Pivot table: An Excel feature that reshapes data from long format (one row per observation) to wide format (one row per group with categories as columns). Equivalent to a SQL GROUP BY with the categorical dimension promoted to columns. Pivot tables also support row totals, percent-of-total, and drill-downs interactively without writing code. The most-tested Excel skill in analyst interviews.
Calculated field: A formula in Tableau (or DAX measure in Power BI) that creates a new measure or dimension from existing fields at query time. Examples: profit = sales - cost, or a tier classification with IF/THEN logic. Runs slower than pre-computed columns in the source data because it executes per query. Tableau interview questions probe the syntax. Power BI DAX questions probe the same concept with different syntax.
Level-of-detail (LOD) expression: A Tableau construct that aggregates at a different granularity than the view. Three flavors: FIXED (ignore the view's dimensions), INCLUDE (add a dimension), EXCLUDE (remove a dimension). Used for customer-level metrics on a date-level view, percent-of-total calculations, and cohort analysis. The most-tested Tableau concept above the basics.
A/B test: An experiment where users are randomly assigned to a control group and a treatment group to measure the causal effect of a change. The interview question pattern: how would you design an A/B test for X feature? Expected answer covers metric selection, sample size calculation, randomization unit, duration, and analysis plan. The most-asked stats concept in product analyst interviews.
p-value and confidence interval: A p-value is the probability of observing the result (or more extreme) if the null hypothesis is true. p < 0.05 is the conventional threshold for "statistically significant." A 95% confidence interval is a range that would contain the true population parameter 95% of the time across repeated experiments. Interviewers test interpretation, not derivation.
Correlation vs causation: Correlation is a statistical association between two variables. Causation means one variable causes a change in the other. Correlation doesn't imply causation because of confounding variables, reverse causality, and chance. Established only through controlled experiments (A/B tests) or careful observational analysis with strong identifying assumptions. The most-confused stats concept in analyst interviews.
KPI (Key Performance Indicator): A primary metric a team tracks to measure progress against goals. Strong KPIs are specific, measurable, time-bound, and actionable. Example: weekly active users for a consumer product, gross merchandise volume for a marketplace, monthly recurring revenue for a SaaS business. The first view of nearly every executive dashboard is the KPI summary.
Dimension vs measure: In Tableau and Power BI, a dimension is a categorical field used to slice or group (product, region, date). A measure is a numeric field you aggregate (sales, count, profit). Tools auto-classify fields when you connect a data source. Some fields can be both depending on usage. The first question to ask when looking at a new dataset: which fields are dimensions and which are measures?
ETL and ELT: ETL (Extract, Transform, Load) extracts data from source systems, transforms it in a separate staging engine, then loads it into the warehouse. ELT (Extract, Load, Transform) loads the raw data into the warehouse first and transforms in place. Modern data stacks are predominantly ELT. Analysts mostly consume the output of ETL/ELT pipelines, but interview questions probe the vocabulary.

The vocabulary surface is wider than a SWE interview because the tooling is wider. Knowing the words above gets you through the credibility check. Knowing what's behind them (when each pattern applies, what the tradeoffs are) gets you through the technical depth check.

Related guides

The data analyst interview is one of several interview formats a CS new grad may face in the 2026 cycle. The following guides close adjacent gaps:

Data engineer interview questions for CS new grads: the engineer pivot from SWE. Heavier on Spark, Airflow, and distributed systems than the analyst path.
System design basics for new grads: the SWE-flavored framework. Analysts rarely face SWE-style system design, but the underlying systems thinking helps in the data flow case studies.
Technical phone screen tactics: the phone-screen format is similar across SWE, DA, and DE. The content differs (more SQL and stats, less LeetCode).
Mock interview practice methodology: the four-mode approach (solo / peer / paid / AI) applies as much to DA prep as to SWE prep.
CS new-grad interview loop: the end-to-end onsite map. Analyst loops are slightly shorter than SWE loops and skip the LeetCode rounds.
STAR behavioral frameworks: the behavioral round is identical across DA, DE, and SWE. The framework matters.
Python interview questions for 2026: pandas plus light Python shows up in some analyst loops, especially Product Analyst at large tech.

Pick the gap, jump to the matching cornerstone, close the gap, return to analyst prep. That is the loop.

The data analyst interview in 2026 is one of the most CS-new-grad-friendly entry points into the broader tech-hiring market. Lower LeetCode bar than a SWE loop, broader vocabulary surface, more entry-level openings per applicant, faster time-to-offer. The pivot is real and honest if you study the material. Thirty days of focused prep on the SQL plus Excel plus stats plus Tableau stack closes the gap from CS-new-grad-SWE to interview-ready entry-level analyst.

InterviewChamp.AI runs realistic data analyst mocks on every interview surface: the SQL whiteboard, the Excel-and-pivot screen, the business case round, the behavioral. One install, every surface. Start a practice session, narrate as you build the query or the dashboard, get scored on what your interviewer is scoring, and walk into Monday's phone screen ready.

About the author: Sam K. is the founder of InterviewChamp.AI, building AI interview prep for the new-grad CS market and writing about the modern interview gauntlet from the inside.

Related guides

Interview Process

DevOps Interview Questions for 2026: 45+ Questions Across CI/CD, IaC, Observability, Kubernetes, Deployments, SRE, and Incident Response

DevOps interview questions in 2026 cluster into six areas: CI/CD pipelines, infrastructure-as-code, monitoring and observability, containers and orchestration, deployment strategies (blue-green and canary), and the SRE plus incident-response thinking that ties them together. The new-grad gap is reasoning about production trade-offs you've never owned on call. This guide gives 45+ questions across those areas, the culture questions interviewers slip in, and the honest framing that works when you've never carried a pager.

Sam K. · 2026-06-04

Interview Process

Docker Interview Questions for 2026: 40+ Q's Across Images vs Containers, the Layer Cache, Dockerfiles, Volumes, Networking, Compose, and the Orchestration Intro New Grads Get Wrong

Docker interview questions in 2026 split between definition recall (what's an image vs a container, what's a layer) and the build-and-debug scenarios that separate the candidate who ran `docker run` once from the one who actually shipped a containerized app. Expect questions on the layer cache, Dockerfile authoring, volumes, networking, docker-compose, multi-stage builds, containers vs VMs, and a closing intro to orchestration. This guide gives 40+ questions with answer outlines and a three-week prep plan that doesn't ask you to fake production experience.

Sam K. · 2026-06-04

Interview Process

JavaScript Interview Questions for 2026: 45+ Questions on Closures, the Event Loop, Promises, this-Binding, and the Live-Coding Tasks Interviewers Actually Ask

JavaScript interview questions in 2026 cluster into three buckets: language internals (closures, the event loop, prototypal inheritance, this-binding, hoisting), async reasoning (promises, async/await, microtasks), and live coding (debounce, throttle, deep clone, array transforms). The new-grad trap is reciting MDN definitions you can't apply when the follow-up asks why your code logs that value. This guide gives 45+ questions with answer outlines, the live-coding tasks you'll actually face, and how to sound like you've shipped JavaScript instead of memorized it.

Sam K. · 2026-06-04

What do data analyst interview questions test in 2026?: Six axes: SQL fluency at the window-function-plus-multi-CTE level, Excel beyond VLOOKUP (pivot tables, INDEX/MATCH, XLOOKUP, conditional aggregations), statistics at the A/B-test-interpretation bar (p-values, confidence intervals, sample size reasoning), a data-viz tool deep dive (Tableau, Power BI, or Looker), one or two business case studies under a time limit, and the behavioral round most loops still close on. The LeetCode bar is meaningfully lower than a SWE loop, often skipped entirely below FAANG tier. Loops typically run 3-4 rounds with one 4-5 hour onsite, which is shorter than a pure SWE pipeline.
What SQL questions do data analysts get asked?: Window functions (RANK, DENSE_RANK, ROW_NUMBER, LAG, LEAD), multi-CTE queries, self-joins, conditional aggregation with CASE inside SUM or AVG, second-highest-salary patterns, cohort retention queries, and query optimization at a read-the-plan level. The bar is roughly the same as a data engineer at the entry level for pure SQL writing, but data analysts get more business-flavored prompts (find customers whose spend dropped 20% week-over-week) versus the data-flow prompts a data engineer would face. Window functions are universal in 2026. If you can't write a query that ranks within a partition, that's an automatic fail signal.
What Excel skills should a data analyst know for an interview?: Pivot tables (the single most-tested Excel skill), INDEX/MATCH and XLOOKUP, VLOOKUP (still asked even though XLOOKUP supersedes it), SUMIFS and COUNTIFS, conditional formatting, basic data-cleaning functions (TRIM, CLEAN, TEXT, LEFT/RIGHT/MID), and at least one dynamic-array function (FILTER or UNIQUE). The 2026 Excel question pattern at entry level: 'here's a messy CSV, clean it, build a pivot, surface three insights.' If you can do that in 20 minutes without Googling, you pass the Excel axis. Power Query is a senior-level expectation, not entry-level.
What statistics questions do data analysts get asked at the entry level?: Five categories: descriptive statistics (mean vs median when to use which, standard deviation interpretation, outlier detection), probability fundamentals (independence, conditional probability, Bayes at a light level), hypothesis testing (null vs alternative hypothesis, p-value interpretation, type I vs type II errors), A/B test design (sample size, statistical significance, what a 95% confidence interval actually means), and regression intuition (correlation vs causation, R-squared interpretation, linear vs logistic in plain English). The bar is interpretation, not derivation. You don't need to derive the t-distribution. You need to explain what a p-value of 0.03 means to a non-technical PM.
What Tableau interview questions should I prepare for?: Calculated fields (the syntax differs from SQL), level-of-detail (LOD) expressions (FIXED, INCLUDE, EXCLUDE: the single most-tested Tableau concept above the basics), dashboard actions (filter, highlight, URL), parameters versus filters, the difference between dimensions and measures, table calculations (Tableau's running totals and percentage-of-total), and one real-world dashboard design question (build a sales dashboard with three views). Power BI questions cover similar terrain with DAX as the calculated-field language instead of Tableau's expression language. If the job description names a specific tool, study that one to 70% depth and the other to 30%.
What's the difference between data analyst, data scientist, and data engineer interviews?: Data analyst interviews focus on SQL plus business reasoning plus a viz tool. Light statistics, no ML, no infrastructure. Data scientist interviews go deep on statistics, machine learning, A/B test design, and experiment analysis. Data engineer interviews focus on pipelines, distributed systems, Spark, Airflow, dbt, and infrastructure tradeoffs. The analyst seat is the most reachable for a CS new grad without statistics or ML coursework. The data scientist seat needs at least one stats or probability class plus an ML project on the resume. The data engineer seat needs Python plus SQL plus a portfolio project on the modern data stack. Salary differs too: analysts typically $65K-$95K entry level, scientists $95K-$140K, engineers $85K-$130K in the US.
Should CS new grads pivot from SWE to data analyst in 2026?: For most CS new grads applying since spring 2025 with no SWE offers, the data analyst seat is the most reachable pivot. Lower LeetCode bar (often zero), fewer applicants per opening, and faster time-to-offer per Class-of-2025 cohort reports on r/cscareerquestions through Q1 2026. The honest tradeoff: analyst salaries run about $20K lower than SWE at entry level in the US. The pivot is worth it if your goal is to end the search and start building tenure. The CS coursework that helps most: any database class for SQL, any introductory stats class for the statistics axis, and any project that involved building a dashboard or report. The pivot reads as honest when the resume signals analytics interest and the portfolio includes one real Tableau or Power BI artifact.
How many rounds is a data analyst interview loop?: Entry-level data analyst loops are typically 3-4 rounds: recruiter screen, technical phone screen (SQL plus one stats question plus one Excel question), onsite SQL whiteboard plus business case plus behavioral, and sometimes a final-round chat with the hiring manager. The onsite runs 3-4 hours, meaningfully shorter than a SWE onsite. Mid-market and consulting firms often replace the onsite with a take-home (build a dashboard from a provided dataset, write up three insights) and one follow-up technical screen. The take-home is more common in analytics than in software engineering.
What is a SQL window function and how do I use it in an analyst interview?: A window function computes a value across a set of rows related to the current row, without collapsing the rows the way GROUP BY would. Syntax: function() OVER (PARTITION BY col ORDER BY col). The most-asked window functions in data analyst interviews are ROW_NUMBER (assign sequential numbers within a partition), RANK and DENSE_RANK (rank with and without gaps), LAG and LEAD (access previous or next row in a partition), and SUM or AVG with OVER (running totals and moving averages). Canonical interview prompt: 'find the second-highest salary per department.' Solve with DENSE_RANK() OVER (PARTITION BY department ORDER BY salary DESC) and filter to rank = 2. Memorize this pattern.
What is the difference between a pivot table and a SQL GROUP BY?: A pivot table is an Excel feature that reshapes data from long format (one row per observation) to wide format (one row per group with categories as columns). A SQL GROUP BY collapses rows into aggregated values but keeps the long format. Pivot tables also let you add row totals, column totals, percent-of-total calculations, and drill-downs interactively without writing code. The interview-relevant trap: when a hiring manager says 'pivot the data,' they usually mean the Excel-style reshape. If they say 'aggregate the data by region,' they usually mean GROUP BY. Listen for the verb.
How do I prepare for a data analyst interview in 30 days as a CS new grad?: Week 1: SQL deep dive (window functions, multi-CTE, conditional aggregation, optimization). 4 hours per day on a free practice platform. Week 2: Excel plus pivot tables plus one stats refresh. 3 hours per day Excel hands-on, 1 hour per day on stats interpretation drills. Week 3: Pick Tableau or Power BI (whichever the target job lists), build one portfolio dashboard on a public dataset, study the tool's calculated-field language to a working level. Week 4: business case studies (4-5 worked examples), behavioral STAR prep, and three timed mock interviews. The plan compresses to 20 days for someone with strong CS coursework foundations and stretches to 45 days for someone weaker on SQL.
What's a data analyst business case study question?: A business prompt where the interviewer asks you to reason through a metric question without writing code. Example: 'Daily active users dropped 15% on Monday. Walk me through how you'd investigate.' The expected answer shape: clarify the metric definition, segment the drop by user cohort and platform and geo and feature, form three hypotheses, propose the SQL queries you'd run to test each, and state what would confirm or rule out each hypothesis. Strong answers stay calm under prompted ambiguity. Weak answers panic and start naming tools instead of thinking through the data. The case study round is the highest-variance round in a data analyst loop because it tests business sense, which can't be crammed in a week.
What soft skills do data analyst interviewers grade for?: Three things show up in nearly every behavioral round: communication (can you explain a chart to a non-technical PM without using jargon), curiosity (do you ask clarifying questions before diving into the SQL or do you guess at the requirement), and tenacity (when a query gives you a weird answer do you investigate or do you submit the weird answer). The behavioral round in analytics often probes one specific scenario: a stakeholder asked for a metric, you built the dashboard, they pushed back on the number, what did you do. The expected answer involves checking your work, asking what number they expected, and reconciling the difference. The trap answer: defending your number without checking.
What's the salary range for entry-level data analysts in 2026?: Entry-level data analyst base salaries in the US run $65K-$95K depending on company tier and location, with mid-market employers concentrated at $70K-$85K. Large public tech employers push past $95K with equity. The narrative that analytics pays less than engineering is partially true at the entry level (about $15-20K gap) and narrows meaningfully by year 3-4 when senior analyst and analytics engineer roles approach $130K-$160K. For a CS new grad whose primary goal is ending the search, the analyst entry point is a faster path to tenure than continuing to apply for SWE roles that aren't biting.
What's the most common data analyst interview mistake CS new grads make?: Underestimating SQL. CS new grads who took one database course assume SQL is solved and bomb the window-function round. The interview SQL bar is meaningfully higher than coursework SQL. The second-most-common mistake: skipping the dashboard portfolio. A data analyst pivoter without a Tableau or Power BI artifact on GitHub or a portfolio site reads as someone who hasn't done the work. The third mistake: treating Excel as beneath them. Interviewers ask Excel questions to filter out CS grads who refuse to learn the tools the team actually uses. Excel is the lingua franca of analytics. Learn it.

What data analyst interview questions test in 2026

How data analyst questions differ from data engineer and data scientist questions

The 40+ data analyst interview questions you should rehearse

SQL interview questions for data analysts (10 questions)

Excel interview questions for data analysts (8 questions)

Statistics interview questions for data analysts (7 questions)

Data visualization interview questions: Tableau and Power BI (8 questions)

Business case study interview questions for data analysts (5 questions)

Behavioral and soft-skills questions for data analysts (5 questions)

How to prepare for a data analyst interview (5-7 steps)

Data analyst interview format by role type

How to handle data analyst questions you've never seen

Common data analyst interview mistakes for CS new grads

How to switch from software engineer to data analyst in 2026

Data analyst vs data scientist vs data engineer: which seat fits a CS new grad?

How to ace the SQL whiteboard round (the highest-stakes round)

Data analyst interview cheat sheet

Key terms

Related guides

Related guides

DevOps Interview Questions for 2026: 45+ Questions Across CI/CD, IaC, Observability, Kubernetes, Deployments, SRE, and Incident Response

Docker Interview Questions for 2026: 40+ Q's Across Images vs Containers, the Layer Cache, Dockerfiles, Volumes, Networking, Compose, and the Orchestration Intro New Grads Get Wrong

JavaScript Interview Questions for 2026: 45+ Questions on Closures, the Event Loop, Promises, this-Binding, and the Live-Coding Tasks Interviewers Actually Ask

Frequently asked questions