The Analysis Workflow
Each previous lesson covered one step: exploring, counting, deriving, visualizing. This lesson chains them into a complete workflow, from raw data to a finished answer.
Each previous lesson covered one step: exploring, counting, deriving, visualizing. In practice, these steps chain together into a repeatable workflow.
Here's that workflow on a small sales dataset. This one has missing values, unlike the clean companies data.
Step 1: Size up the data (familiar from the first lesson):
Step 2: Check for gaps. Two Sales entries are missing:
What will be the output?
Step 3: Fill the gaps. Replace missing Sales with the column average:
What will be the output?
Step 4: Answer a question. Which region sells more on average?
What will be the output?
Step 5: Make the answer visual:
Five steps: explore → clean → analyze → visualize → answer. Each feeds into the next.
Real workflows loop back. A chart might reveal outliers that require more cleaning, or spark an entirely new question.
What will be the output?