Thursday , April 25 2024
Home / Lars P. Syll / Data without theory is always treacherous

Data without theory is always treacherous

Summary:
Data without theory is always treacherous Data without theory can lead to bogus inferences … Before being comforted or alarmed, consider whether it makes sense to extrapolate. Is there a persuasive reason why the future can be predicted simply by looking at the past? Or is that wishful thinking? Or nothing at all? … Remember that even random flips can yield striking, even stunning, patterns that mean nothing at all … A statistical comparison of two things is similarly unpersuasive unless there is a logical reason why they should be related … Ask yourself whether the people who did the study thought before calculating. The central problem with the present ‘machine learning’ and ‘big data’ hype is that so many — falsely — think that they can get away with

Topics:
Lars Pålsson Syll considers the following as important:

This could be interesting, too:

Lars Pålsson Syll writes The importance of ‘causal spread’

Lars Pålsson Syll writes Applied econometrics — a messy business

Lars Pålsson Syll writes Feynman’s trick (student stuff)

Lars Pålsson Syll writes Difference in Differences (student stuff)

Data without theory is always treacherous

Data without theory is always treacherousData without theory can lead to bogus inferences …

Before being comforted or alarmed, consider whether it makes sense to extrapolate. Is there a persuasive reason why the future can be predicted simply by looking at the past? Or is that wishful thinking? Or nothing at all? …

Remember that even random flips can yield striking, even stunning, patterns that mean nothing at all …

A statistical comparison of two things is similarly unpersuasive unless there is a logical reason why they should be related … Ask yourself whether the people who did the study thought before calculating.

The central problem with the present ‘machine learning’ and ‘big data’ hype is that so many — falsely — think that they can get away with analysing real-world phenomena without any (commitment to) theory. But — data never speaks for itself. Without a prior statistical set-up, there actually are no data at all to process. And — using a machine learning algorithm will only produce what you are looking for.

Machine learning algorithms always express a view of what constitutes a pattern or regularity. They are never theory-neutral.

Clever data-mining tricks are not enough to answer important scientific questions. Theory matters.

Lars Pålsson Syll
Professor at Malmö University. Primary research interest - the philosophy, history and methodology of economics.

Leave a Reply

Your email address will not be published. Required fields are marked *