Data Science

Python for Data Analysis: A Pandas Handbook

A practical Pandas reference for data analysts: loading data, filtering, groupby aggregations, merging DataFrames, pivot tables, rolling window functions, and a SQL-to-Pandas cheat sheet.

Efaix AdminVerified

4 minMay 7, 2026

Data Science

Reviewed

Data ScienceBeginner

Data Storytelling for Data Analysts

Learn to communicate data insights that drive action: story structure, audience-tailored communication, insight headlines, the SCR framework, chart annotations, and the most common storytelling mistakes to avoid.

Data Cleaning and Preparation

Why Data Cleaning Matters Data cleaning — also called data wrangling or data preparation — is the process of detecting and correcting (or removing) corrupt, inaccurate, or irreleva

Dashboard Design Principles for Data Analysts

Learn how to design effective dashboards: choosing the right chart type, establishing visual hierarchy, using colour correctly, writing consistent SQL metric definitions, and the most common design mistakes to avoid.

Statistical Analysis Fundamentals for Data Analysts

A practical introduction to the statistics every data analyst needs: descriptive statistics, probability distributions, hypothesis testing, confidence intervals, linear regression, and a guide to choosing the right test.

Data Cleaning and Preprocessing for Data Analysts

A practical guide to cleaning raw data: handling missing values with imputation, removing duplicates, standardising formats, detecting outliers, and running cleaning pipelines in both Python and SQL.

Exploratory Data Analysis: A Practical Guide

A systematic guide to EDA: data quality audits, univariate and bivariate analysis, correlation matrices, SQL profiling queries, and the red flags every analyst should know how to spot.

A/B Testing and Experimentation for Data Analysts

A complete guide to designing and analysing A/B tests: sample size calculation, two-proportion z-tests in Python, SQL experiment queries, common pitfalls, and an introduction to multi-armed bandit methods.

Funnel Analysis for Data Analysts

Learn how to build sequential conversion funnels using SQL and Python, calculate drop-off rates at each step, segment by user dimensions, and identify where users abandon the journey.

Cohort Analysis for Data Analysts

Learn how to build retention and LTV cohort tables in SQL and Python, read the triangle heatmap, and avoid the common pitfalls that lead to misleading conclusions.

SQL Window Functions for Data Analysts

Master SQL window functions — ranking, LAG/LEAD, running totals, moving averages, and sessionisation — with practical examples for every common analytical pattern.

Feature Engineering for Machine Learning

A comprehensive guide to creating, transforming, encoding, and selecting features that improve ML model accuracy — with Python examples using pandas and scikit-learn.

Data Cleaning and Preparation

A practical guide to detecting and fixing the most common data quality issues — missing values, duplicates, outliers, type errors, and inconsistent categories — with Python code examples using pandas.

Time Series Analysis for Data Analysts

What Is Time Series Data? A time series is a sequence of observations recorded at successive, equally-spaced points in time. Unlike cross-sectional data — which captures a snapshot

SQL for Data Analysis: Joins, Aggregations, and Window Functions

Why SQL Is the Core Language of Data Analysis Structured Query Language (SQL) remains the most widely used tool in a data analyst's toolkit. Unlike programming languages that requi

Customer Segmentation and Clustering

What Is Customer Segmentation? Customer segmentation is the process of dividing a customer base into distinct groups of individuals who share similar characteristics — such as beha

Customer Segmentation and Clustering

What Is Customer Segmentation? Customer segmentation is the process of dividing a customer base into distinct groups of individuals who share similar characteristics — such as beha

A/B Testing and Experimentation for Data Analysts

Why Experimentation Is Central to Data-Driven Decision Making A/B testing — formally called a randomised controlled experiment — is the gold standard for establishing causal relati

Regression Analysis for Data Analysts

What Is Regression Analysis? Regression analysis is a statistical technique for modelling the relationship between a dependent variable (the outcome you want to predict or explain)

Data Warehousing and ETL/ELT Concepts

The Role of a Data Warehouse in an Analytics Stack A data warehouse is a centralised repository that integrates data from multiple source systems, organises it for analytical queri

Efaix AdminVerified

4 minApr 29, 2026