What Should “Correlation Does Not Imply Causation” Be Replaced With?

I shed an invisible tear whenever I hear “correlation does not imply causation” which the otherwise excellent swivel (a website about correlations) emphasizes. Of course, there’s truth to it. It saddens me because:

  • It’s dismissive. It is often used to dismiss data from which something can be learned. The life-saving notion that smoking causes lung cancer was almost entirely built on correlations. For too long, these correlations were dismissed.
  • It’s misleading. In real life, nothing unfailingly implies causation. In my experience, every data set has more than one interpretation. To “imply” causation requires diverse approaches and correlations are often among them.
  • It’s a missed opportunity — namely, an opportunity to make a more nuanced statement about what we can learn from the data.
  • It’s dogmatic (see “Jane Jacobs on Scientific Method”). Some correlations, such as those from “natural experiments,” imply causation much more than others. I suspect it does more harm than good to lump all of them together.
  • 13 thoughts on “What Should “Correlation Does Not Imply Causation” Be Replaced With?

    1. Well said.

      In the end, all we really have is correlation — all scientific models are based on correlation of model predictions with observations. If our observations don’t match model predictions, then the model is abandoned or changed.

      And you are exactly right that correlation is a good starting point for looking for cause/effect models.

      So I suppose the phrase should really be “correlation does not imply that there is a cause-effect model that will stand up to other experiments or analysis”.

    2. Yes, that would be an improvement. My candidate would be something like “Correlations are generally — not always, but usually — less persuasive evidence for causality than experiments.” To make explicit the implied comparison.

    3. Well, if you’re asking for a catchy replacement phrase, I think that “correlation suggests causation” isn’t bad. It can be used to reply to people saying “correlation doesn’t imply causation”. You can say “but it *does* suggest it”.

    4. The swivel people, on their blog, link to this description of correlation on Good Math, Bad Math, which seems to get it about right.

      The catch – and it’s a big one – is that correlation does strongly suggest a causal relationship. (There’s a Yale professor of statistics who’s famous for saying something close to “Correlation is not the same thing as causation – but it’s a darn good hint!”. ) It may not be the case that X causes Y or Y causes X – but if there’s a strong correlation between them, you should suspect that there’s a causal relationship

      So if they agree with the post they approvingly linked to, then it sounds like they understand the issues. And given that their site seems to be mainly concerned with public data gathered from uncontrolled circumstances, in most of the use-cases of their site, much caution *is* needed when inferring causality, so their approach to it might be justified by their (presumably statistically naive) audience.

    5. The statement “correlation does strongly suggest a causal relationship” strikes me as too strong. It’s too easy to think of exceptions.

      The Good Math, Bad Math essay you link to goes on to say:

      To show causation, you need to show a mechanism for the cause, and demonstrate that mechanism experimentally.

      This does not describe how scientists in my field (experimental psychology) act. Usual behavior is to infer X causes Y if every time you (intentionally) change X, Y changes. The author is a computer scientist; that explains the lack of familiarity.

    6. Yeah, that point about inference based on manipulating random variables was brought up in the comments.

      But do you really think that statement goes too far? I mean, a case could be made that correlation virtually always implies a causal relationship, just not necessarily a direct one. If X and Y are correlated, and both caused by Z, they have a causal relationship with each other–and indirect one via Z. So saying that “correlation does strongly suggest a causal relationship” seems to me just to mean that when you see correlation, you should definitely look for the causal relationship, but it doesn’t tell you to assume that it’s a direct relationship. (Or if it is, which direction it’s in.)

    7. I am not familiar with the concept of an “indirect” causal relationship. If I were, I might agree with you. To me, “causal relationship” means what you call “direct causal relationship.”

    8. Seth, great post, you are spot on. We basically whimped out after we launched Swivel after all the barrage of protests from academics and data vigilantes about our correlation meters. Even the Wikipedia article about correlations says something along the lines of, “correlation doesn’t always mean causation but it sure is a hint.” And when some say, well, maybe two trends are correlated because there’s something else that’s causing both to change, that’s great: it made you think about what that third thing could be. So, I took out the link from our code just now, and it will disappear after we refresh our site. We’re not some kind of anointed data gatekeepers trying to protect people from the dangers of correlation. People, look at the data and make up your own mind. That’s what Swivel is about.

    9. In the end, all we really have is correlation

      Tim, this isn’t entirely fair to say. In many fields (and in mine in particular) all we choose to look at are correlations. However, we also have temporal precedence. If one thing doesn’t precede the other thing, it is hard to argue for a causal relationship no matter how nicely the correlational analysis works out.

      a case could be made that correlation virtually always implies a causal relationship

      If X and Y are correlated, it suggests at least two models, (1) X -> Y and (2) Y -> X, one of which is probably wrong. This is before considering any common causes of both.

    10. Thanks, Dmitry, I really appreciate that comment. You should know.

      That you got a “barrage of protests from academics and data vigilantes” is interesting. There’s something backwards about that. Surely the point of data collection and analysis is to learn something. Just like you say, it makes a lot more sense to emphasize what you do learn rather than what you don’t.

    11. An indirect relationship is where X -> A -> B -> Y, or A -> B -> X and A -> C -> Y. Of course, there’s a limit to how indirect it can be before the limited correlation between each of the steps multiplies together making the effect of A, B, and C on X and Y less negligible. If that path doesn’t explain all the correlation, you look for additional mechanisms.

    12. I happened to see this quote from Jensen just now:

      Of course, a statistical correlation between two variables doesn’t imply direct causality of the one variable by the other, nor does it imply the absence of causation. A lack of correlation, however, is more apt to imply the absence of causation. Investigation can’t afford to eschew correlations as clues to causation, which of course must be established by other kinds of analysis. At an exploratory stage of investigation, reciting the “correlation is not causation” mantra is a premature criticism.

      https://psycprints.ecs.soton.ac.uk/archive/00000019/

      Seth, Jensen says that inspection time is highly g-loaded, looks like another possible metric for measuring mental acuity re omega-3 intake…

    13. Re the Jensen quote: I was once interviewing a job candidate in my office. She said something like “but of course correlation does not imply causality.” I said, “wait a minute. it’s not that simple. doesn’t the absence of correlation suggest the absence of causality?” She thought about it, and agreed with me.

      Re inspection time: That wouldn’t be a good measure. To determine inspection time you show stimuli very briefly (e.g., 200 msec) many times. If a whole trial takes 4 seconds, then only a small fraction of it (200 msec/4000 msec) involves brain work. That’s the opposite of what I want.

    Leave a Reply

    Your email address will not be published. Required fields are marked *