A correct result that is misinterpreted can be as harmful as an incorrect result.
Here are a few examples of errors of communication, taken from a talk Nick Radcliffe gave to the 2023 Toronto Conference on Reproducibility:
Errors of Interpretation (a.k.a. Type VI Errors)
In 1998, the Mars Climate Orbiter was lost because NASA worked in SI units
while its contractor Lockheed Martin works in imperial units,
at a cost of about $125m (USD, 1998).
Report
What do these prefixes mean?
or probability of crashing? You’d want to know.
When a population is segmented on some variable, e.g. spend, and then
measured again later across the previously-allocated segments, the values
for the segments are likey to be more similar because of
regression to the mean.
This is usually not a novel or meaningful finding.
Randall Munroe nails it: https://xkcd.com/2303.
Those errors of interpretation that are errors of communication
are also “Type VI” errors.
Those who fail to learn lessons from history, and those who learn the wrong lessons,
are doomed to repeat it.
Each digit in the total shown comes from a single line of data,
and those water bodies are of different orders of magnitude.
The zeros in the non-total lines reflect significant figures,
and the key takeaway is that 97% of all the Earth’s water is in
oceans, not that the volume is known to 6 significant figures.
(It is not.)
Be careful talking about changes in percentages.
are everywhere.
Learn how to avoid making them.
Learn what they are.
Seriously. ISO8601
is almost always the right choice in data.
In 2022, who is spending more, US or China?
What happened to gun deaths after Florida introduced its
Stand Your Ground law?
Two chapters of the book, Test-Driven Data Analysis of the TDDA Book are specifically concerned with errors of communication.
Additionally, several of TDDA’s Checklists are directly relevant to this topic. Those specifically about errors of communication are:
In the case of errors of process, the most relevant checklists are:
Additionally
covers both classes of errors of interpretation (formulation as well as communication).
If you would like help with data science that is communicated clearly, talk to us. We cannot guarantee we will produce outputs so clear they are impossible to misinterpret, but we take output quality seriously and have the scars on our backs from errors of communication we have witnessed, been subject to, and—yes—occasionally committed ourselves. But we have written the book on how to avoid them.