In today’s data-driven landscape, mastering advanced SQL techniques isn’t just valuable for day-to-day analysisβ€”it’s essential for showcasing your expertise during live-coding interviews where you’ll need to efficiently manipulate complex datasets under pressure.

1️⃣ Running Totals & Cumulative Sums

βœ… Key Function: SUM() OVER (ORDER BY column)
βœ… Use Case: Running totals of revenue, sales, or cumulative counts.

Example: Compute cumulative revenue over time.

SELECT order_date, total_amount, 
       SUM(total_amount) OVER (ORDER BY order_date) AS running_total 
FROM orders;

2️⃣ Moving Averages (Rolling Window Aggregates)

βœ… Key Function: AVG() OVER (ORDER BY column ROWS BETWEEN N PRECEDING AND CURRENT ROW)

Example: Compute 7-day rolling average of revenue.

SELECT order_date, total_amount, 
       AVG(total_amount) OVER (ORDER BY order_date ROWS BETWEEN 6 PRECEDING AND CURRENT ROW) AS moving_avg 
FROM orders;

3️⃣ Handling NULLs

βœ… Key Functions:

  • COALESCE(column, default_value) β†’ Replace NULL with a default value
  • NULLIF(value1, value2) β†’ Returns NULL if both values are equal
  • IS NULL / IS NOT NULL β†’ Filter NULL values

Example: Replace missing values with 0.

SELECT customer_id, COALESCE(total_amount, 0) AS total_amount 
FROM orders;

Example: Handle division by zero using NULLIF.

SELECT revenue / NULLIF(orders, 0) 
FROM sales;

4️⃣ Date & Time Manipulation

βœ… Key Functions:

  • DATE_TRUNC() β†’ Truncate date to year, month, day, etc.
  • EXTRACT(YEAR FROM date) β†’ Get year, month, or day from a date
  • DATEADD(interval, value, date) β†’ Add/subtract days/months
  • DATEDIFF(end_date, start_date) β†’ Difference between dates

Example: Find total sales per month.

SELECT DATE_TRUNC('month', order_date) AS month, SUM(total_amount) 
FROM orders 
GROUP BY month;

Example: Find orders placed in the last 7 days.

SELECT * FROM orders 
WHERE order_date >= CURRENT_DATE - INTERVAL '7 days';

5️⃣ CTEs (Common Table Expressions) & Subqueries

βœ… CTEs (WITH cte AS (…)) β†’ Make queries more readable
βœ… Subqueries β†’ Nested queries within SELECT, FROM, or WHERE

Example: Compute top customers by revenue using a CTE.

WITH customer_revenue AS (
  SELECT customer_id, SUM(total_amount) AS total_revenue 
  FROM orders 
  GROUP BY customer_id
)
SELECT * FROM customer_revenue 
ORDER BY total_revenue DESC 
LIMIT 10;

Example: Find customers who spent more than the average order amount using a subquery.

SELECT customer_id, total_amount 
FROM orders 
WHERE total_amount > (SELECT AVG(total_amount) FROM orders);

6️⃣ Recursive CTEs (Hierarchical Data)

βœ… Key Function: Recursive WITH
βœ… Use Case: Hierarchical relationships (e.g., org charts, folder structures).

Example: Find all employees reporting to a specific manager.

WITH RECURSIVE employee_hierarchy AS (
  SELECT employee_id, manager_id, 1 AS depth 
  FROM employees 
  WHERE manager_id IS NULL -- Start with the CEO
  
  UNION ALL
  
  SELECT e.employee_id, e.manager_id, h.depth + 1 
  FROM employees e 
  JOIN employee_hierarchy h ON e.manager_id = h.employee_id
)
SELECT * FROM employee_hierarchy;

7️⃣ Joins (Inner, Left, Right, Full, Self Joins, Cross Joins)

βœ… Use Case: Combine data from multiple tables efficiently.

Example: Find customer orders and their details.

SELECT customers.name, orders.order_date, orders.total_amount 
FROM customers 
LEFT JOIN orders ON customers.customer_id = orders.customer_id;

8️⃣ EXISTS vs. IN vs. JOIN (Performance Considerations)

βœ… Key Functions:

  • EXISTS β†’ Stops at first match (efficient for large datasets)
  • IN β†’ Works well for small lists but slower for large datasets
  • JOIN β†’ Preferred when retrieving related columns

Example: Find customers who have placed at least one order.

SELECT name 
FROM customers 
WHERE EXISTS (
  SELECT 1 
  FROM orders 
  WHERE orders.customer_id = customers.customer_id
);

9️⃣ GROUP BY vs. DISTINCT vs. Window Functions

FeatureGROUP BYDISTINCTWindow Functions
Use CaseAggregatesRemoves duplicatesRanking, cumulative sums
PerformanceMediumFastCan be slow with large data

Example: Count unique customers per region.

SELECT region, COUNT(DISTINCT customer_id) 
FROM customers 
GROUP BY region;

πŸ”Ÿ Pivoting & Unpivoting Data (CASE WHEN & PIVOT)

βœ… Key Functions:

  • CASE WHEN for manual pivoting
  • PIVOT() (if supported)
  • UNPIVOT() for reversing pivot tables

Example: Pivot sales data by region.

SELECT 
  SUM(CASE WHEN region = 'North' THEN total_amount END) AS north_sales,
  SUM(CASE WHEN region = 'South' THEN total_amount END) AS south_sales
FROM orders;

1️⃣1️⃣ Web Scraping & Large-Scale Data Aggregation

βœ… Key Techniques:

  • Extracting data from APIs (JSON functions in SQL)
  • Combining web-scraped datasets using UNION
  • Cleaning and normalizing extracted data

Example: Aggregating data from multiple sources into a single table.

SELECT name, address 
FROM web_data_source1 
UNION 
SELECT name, address 
FROM web_data_source2;

1️⃣2️⃣ Performance Optimization (Indexing, Query Execution Plans)

βœ… Key Concepts:

  • Use indexes (CREATE INDEX) to speed up queries.
  • Analyze query performance with EXPLAIN ANALYZE.
  • Optimize JOIN orders (use smaller tables first).
  • Use LIMIT or TOP to improve efficiency in large queries.

Example: Add an index to speed up searches.

CREATE INDEX idx_orders_customer ON orders (customer_id);

πŸ›  SQL Interview Practice Checklist

  • βœ… Aggregation Queries (SUM, COUNT, AVG, MIN, MAX, GROUP BY)
  • βœ… Window Functions (Running Sums, Moving Averages, Ranking, LAG/LEAD)
  • βœ… Joins (INNER, OUTER, SELF, CROSS Joins)
  • βœ… Performance Tuning (INDEXES, Query Execution Plans, EXISTS vs. IN)
  • βœ… Recursive Queries (Recursive CTEs, Hierarchical Data Processing)
  • βœ… Pivoting Data (CASE WHEN, PIVOT(), UNPIVOT())
  • βœ… Web Scraping & Large-Scale Data Aggregation (JSON, API Data, UNION, ETL Processes)
  • βœ… Handling NULLs (COALESCE, NULLIF, IS NULL)
  • βœ… Date/Time Manipulation (DATE_TRUNC, EXTRACT, DATEDIFF)