Data Analysis Fundamentals

The Analysis Workflow

Each previous lesson covered one step: exploring, counting, deriving, visualizing. This lesson chains them into a complete workflow, from raw data to a finished answer.


Each previous lesson covered one step: exploring, counting, deriving, visualizing. In practice, these steps chain together into a repeatable workflow.

Here's that workflow on a small sales dataset. This one has missing values, unlike the clean companies data.

Step 1: Size up the data (familiar from the first lesson):

Python
Output

Step 2: Check for gaps. Two Sales entries are missing:

Python
Output

What will be the output?

Python

Step 3: Fill the gaps. Replace missing Sales with the column average:

Python
Output

What will be the output?

Python

Step 4: Answer a question. Which region sells more on average?

Python
Output

What will be the output?

Python

Step 5: Make the answer visual:

Python

Five steps: explore → clean → analyze → visualize → answer. Each feeds into the next.

Real workflows loop back. A chart might reveal outliers that require more cleaning, or spark an entirely new question.


What will be the output?

Python