From Lars Syll Data without theory can lead to bogus inferences … Before being comforted or alarmed, consider whether it makes sense to extrapolate. Is there a persuasive reason why the future can be predicted simply by looking at the past? Or is that wishful thinking? Or nothing at all? … Remember that even random flips can yield striking, even stunning, patterns that mean nothing at all … A statistical comparison of two things is similarly unpersuasive unless there is a logical reason why they should be related … Ask yourself whether the people who did the study thought before calculating. The central problem with the present ‘machine learning’ and ‘big data’ hype is that so many — falsely — think that they can get away with analysing real-world phenomena without any (commitment to)
Topics:
Lars Pålsson Syll considers the following as important: Uncategorized
This could be interesting, too:
John Quiggin writes Trump’s dictatorship is a fait accompli
Peter Radford writes Election: Take Four
Merijn T. Knibbe writes Employment growth in Europe. Stark differences.
Merijn T. Knibbe writes In Greece, gross fixed investment still is at a pre-industrial level.
from Lars Syll
Data without theory can lead to bogus inferences …
Before being comforted or alarmed, consider whether it makes sense to extrapolate. Is there a persuasive reason why the future can be predicted simply by looking at the past? Or is that wishful thinking? Or nothing at all? …
Remember that even random flips can yield striking, even stunning, patterns that mean nothing at all …
A statistical comparison of two things is similarly unpersuasive unless there is a logical reason why they should be related … Ask yourself whether the people who did the study thought before calculating.
The central problem with the present ‘machine learning’ and ‘big data’ hype is that so many — falsely — think that they can get away with analysing real-world phenomena without any (commitment to) theory. But — data never speaks for itself. Without a prior statistical set-up, there actually are no data at all to process. And — using a machine learning algorithm will only produce what you are looking for.
Machine learning algorithms always express a view of what constitutes a pattern or regularity. They are never theory-neutral.
Clever data-mining tricks are not enough to answer important scientific questions. Theory matters.