The Decisions Behind The Dashboard
Your final insight is shaped by what you remove, group, and calculate.
Every choice you make while cleaning data changes what the reader sees at the end.
It changes what they believe. It can even change the recommendation you give.
When you drop rows with missing values, you are removing real people or real transactions from your analysis. They did not stop existing. They are just no longer being counted.
When you group small categories into a bucket called “Other,” you are deciding those groups are not worth showing separately.
When you cap a number that looks too high, you are changing the shape of your data before anyone else sees it.
Each choice changes what comes next. That is why data cleaning should not be treated like a small step you hide in the background. It is part of the story.
Cleaning Is Not Just a Technical Task
Many beginners treat data cleaning like a simple checklist. They remove repeat rows, fix blank cells, and make sure each column is in the right format. These are good steps, but they are only part of the job. The real skill is knowing why each step matters.
For example, if a customer dataset has missing location values, removing those rows may make sense if your analysis is about location. But your final result now only represents customers with known locations.
If a sales dataset has one very large order, removing or capping that value may make the chart easier to read. But it could also hide an important customer or a major deal.
If a job dataset has 80 different job titles, grouping similar titles may make the project easier to understand. But it also means the small differences between those titles may get hidden.
That is why cleaning is not just about fixing mess. It is about making careful choices.
Do Not Hide The Choices You Made
Many portfolio projects include one short line like this: “I cleaned the data before analysis.”
It does not tell the reader anything useful. What did you clean? Why did you clean it? What changed after you cleaned it?
Those are the questions that matter.
A hiring manager, client, or team lead is not only looking at your final dashboard. They are also looking at how you think. If your cleaning step is vague, the project can feel like a list of tasks, not a clear analysis.
There is a difference between doing tasks and explaining decisions. Explaining decisions is the part of the job that actually matters.
No One Wants To Read A Manual
This does not mean every project needs a long section called “Data Cleaning.”
Nobody wants to read two pages of small technical steps like trimming spaces, renaming columns, or fixing a typo in a header. Those things are useful, but they do not always matter to the story.
A better approach is to explain the cleaning choices that shaped the analysis.
For example:
“I removed rows where the region field was blank because this analysis compares sales by region. Those rows made up about 3% of the data.”
Or:
“I grouped five small product categories into ‘Other’ so the chart would be easier to read. Together, they made up less than 2% of total sales.”
That one sentence tells the reader what changed, why it changed, and how much it affected the project.
That is enough. The goal is not to prove you did every cleaning step. The goal is to help the reader trust the path you took.
Which Choices Actually Need Explaining
Not every cleaning step needs an explanation. But some choices should almost always be explained because they can change the final result.
If you drop rows with missing values in an important column, mention it.
If you remove outliers, mention it.
If you group categories together, mention it.
If you filter the data to one time period, one region, or one group of customers, mention it.
These choices affect what the analysis includes and excludes. They also affect how the reader should understand the final chart.
Once you know which choices matter, the next step is explaining them in a simple way.
Three Questions Every Note Should Answer
A good cleaning note answers three things: what did you change, why did you change it, and how did it affect the analysis.
Instead of writing “I handled missing values,” write: “I removed rows with missing prices because price was needed to calculate revenue. This kept the analysis focused on complete records.”
Instead of writing “I removed outliers,” write: “I removed orders above $10,000 because they were flagged as test entries in the dataset. Keeping them would have pushed the average order value too high.”
Instead of writing “I created new columns,” write: “I built a profit margin column so each product could be compared by percentage, not just total profit.”
These short notes show that you understand your data, not just the tool.
Companies hire analysts to make smart choices. They want people who can look at messy data, choose a clear path, and explain that path to others.
When you explain your cleaning steps, you show that you can make careful choices and explain them clearly. That is what makes the project feel stronger.


