Beginner’s Guide to Exploratory Data Analysis

Table Overview of Exploratory Data Analysis Phase What to check Red flags What to record (reproducibility) 1) Dataset shape # rows/cols, file sources, time range Weirdly small/huge, mixed sources w/o labels Data snapshot: counts, source paths, date pulled 2) Schema sanity dtypes, parsing issues, category levels IDs stored as floats, dates as strings dtype conversions […]