← Back to Part I: What Is Science?

Chapter 1: The Demarcation Problem

What distinguishes science from non-science and pseudoscience?

The demarcation problem is the philosophical question of how to distinguish between science and non-science, and more particularly between science and pseudoscience. It is one of the oldest and most central problems in the philosophy of science, with roots stretching back to Aristotle. Despite centuries of effort, no universally accepted criterion of demarcation has been found — leading some philosophers to declare the problem unsolvable and others to insist that solving it is more urgent than ever.

The stakes are high. If we cannot say what science is, how can we defend it against imitators? How can courts decide what counts as “scientific evidence”? How can funding agencies distinguish legitimate research from pseudoscientific projects? The demarcation problem is not merely a philosopher’s puzzle — it is a practical problem with consequences for education, law, medicine, and public policy.

1.1 Historical Attempts at Demarcation

Aristotle’s Demonstrative Knowledge

Aristotle’s Posterior Analytics provides the first systematic account of scientific knowledge (episteme). For Aristotle, genuine science proceeds from first principles that are true, primary, immediate, better known than, prior to, and explanatory of the conclusion. Scientific knowledge is demonstrative — it is knowledge of the causes of things, obtained through syllogistic reasoning from self-evident axioms.

This model dominated Western intellectual life for nearly two millennia. Its key features were: (1) certainty — scientific knowledge admits of no doubt; (2) necessity — scientific truths could not be otherwise; (3) universality — science deals with what is always the case, not with particulars. On this view, the demarcation between science and opinion is sharp: science yields certain knowledge, while opinion is merely probable.

“We suppose ourselves to possess unqualified scientific knowledge of a thing, as opposed to knowing it in the accidental way in which the sophist knows, when we think that we know the cause on which the fact depends, as the cause of that fact and of no other, and, further, that the fact could not be other than it is.”— Aristotle, Posterior Analytics I.2, 71b9–12

Bacon’s Inductivism

Francis Bacon’s Novum Organum (1620) broke decisively with Aristotelian science. Bacon argued that scientific knowledge must be built up from careful observation and systematic induction, not deduced from first principles. His method of eliminative induction — using tables of presence, absence, and degrees — was designed to extract the “forms” (underlying causes) of natural phenomena.

For Bacon, the demarcation criterion was methodological: science follows a disciplined inductive procedure, while non-science relies on authority, tradition, and uncritical generalization. He identified four “idols” (idola) that obstruct genuine inquiry: the Idols of the Tribe (cognitive biases shared by all humans), the Idols of the Cave (individual biases), the Idols of the Marketplace (confusions arising from language), and the Idols of the Theatre (dogmatic philosophical systems).

“The subtlety of nature is greater many times over than the subtlety of the senses and understanding; so that all those specious meditations, speculations, and glosses in which men indulge are quite from the purpose, only there is no one by to observe it.”— Francis Bacon, Novum Organum I, Aphorism 10

Logical Positivism: The Verification Principle

The Vienna Circle (discussed in detail in Chapter 4) proposed the verification principleas a demarcation criterion: a statement is meaningful if and only if it is either analytically true (true by definition) or empirically verifiable. Statements that are neither — including the pronouncements of metaphysics, theology, and ethics — are not false but literally meaningless.

The verification principle faced devastating objections. It appeared to be self-refuting: the principle itself is neither analytic nor empirically verifiable, so by its own criterion it is meaningless. Successive reformulations (weak verification, confirmability in principle) only multiplied the problems. By the 1950s, even sympathizers acknowledged that the verification principle could not serve as a criterion of demarcation. This failure set the stage for Popper’s alternative.

1.2 Popper’s Falsifiability Criterion

Karl Popper (1902–1994) proposed what remains the most influential solution to the demarcation problem. In The Logic of Scientific Discovery (1934/1959) and Conjectures and Refutations(1963), Popper argued that the distinguishing feature of science is not that its theories are verifiable, but that they are falsifiable.

A theory is scientific if and only if it makes predictions that could, in principle, be shown to be false by observation or experiment. The key logical insight is an asymmetry between verification and falsification: no finite number of observations can verify a universal statement (“All swans are white”), but a single observation can falsify it (one black swan). Science advances not by confirming theories but by subjecting them to severe tests and eliminating those that fail.

“It must be possible for an empirical scientific system to be refuted by experience... I shall not require of a scientific system that it shall be capable of being singled out, once and for all, in a positive sense; but I shall require that its logical form shall be such that it can be singled out, by means of empirical tests, in a negative sense: it must be possible for an empirical scientific system to be refuted by experience.— Karl Popper, The Logic of Scientific Discovery (1934), §6

Popper’s motivating examples were drawn from early 20th-century intellectual life in Vienna. He was struck by the contrast between Einstein’s general theory of relativity — which made the bold, risky prediction that light would be deflected by gravity (confirmed by Eddington’s 1919 eclipse observations) — and the theories of Marx, Freud, and Adler, which seemed compatible with any possible observation.

Freudian psychoanalysis, Popper argued, could explain any behavior: a man who pushes a child into water to drown and a man who sacrifices his life to save the child could both be “explained” by the theory (the first by repression, the second by sublimation). A theory that is compatible with everything explains nothing. It is precisely the riskiness of a scientific theory — its willingness to “stick its neck out” — that distinguishes it from pseudoscience.

Criticisms of Falsifiability

Popper’s criterion, while enormously influential, has been subjected to powerful objections:

  • The Duhem-Quine problem: No scientific hypothesis faces the tribunal of experience alone. Every test involves auxiliary assumptions (about instruments, background conditions, etc.). When a prediction fails, it is always logically possible to blame an auxiliary assumption rather than the core theory. This means no theory is strictly falsifiable in isolation (see Chapter 6).
  • Too broad: Some paradigmatically non-scientific claims are falsifiable. “All marriages between Sagittarians and Leos end in divorce” is falsifiable but is clearly astrology, not science.
  • Too narrow: Some paradigmatically scientific claims are not straightforwardly falsifiable. Existential statements like “There exist black holes” cannot be falsified by any finite number of failed observations. Probabilistic theories (e.g., in quantum mechanics) do not make categorical predictions.
  • Historical inaccuracy: Kuhn showed that scientists do not, in practice, abandon theories at the first sign of falsification. Anomalies are common and are typically shelved rather than treated as refutations.

1.3 Kuhn’s Puzzle-Solving Criterion

Thomas Kuhn (1922–1996) offered a radically different account. In The Structure of Scientific Revolutions (1962), Kuhn argued that the distinctive feature of mature science is not falsifiability but the presence of a paradigm — a shared framework of theories, methods, standards, and exemplary problem-solutions (exemplars) that defines a scientific community.

For Kuhn, what distinguishes science from non-science is not any logical feature of its theories but a sociological one: mature sciences engage in puzzle-solvingwithin the framework of an accepted paradigm. Just as a crossword puzzle has a definite solution and rules for finding it, scientific problems have solutions constrained by paradigmatic assumptions. A scientist who fails to solve a puzzle has failed as a scientist, not refuted the paradigm.

“One of the things a scientific community acquires with a paradigm is a criterion for choosing problems that, while the paradigm is taken for granted, can be assumed to have solutions. To a great extent these are the only problems that the community will admit as scientific or encourage its members to undertake.”— Thomas Kuhn, The Structure of Scientific Revolutions (1962), p. 37

On Kuhn’s view, astrology fails to be a science not because it is unfalsifiable (it makes many falsifiable predictions) but because it never developed a tradition of puzzle-solving. When astrological predictions fail, there is no mechanism for identifying and correcting errors within the framework. Compare this with astronomy, where discrepancies between prediction and observation generate productive research programmes.

Kuhn’s criterion has its own difficulties. It seems to make demarcation a sociological rather than an epistemological matter. It is also unclear how to apply it to immature sciences (those that have not yet achieved paradigm status) or to interdisciplinary fields. Nonetheless, Kuhn’s insight that science is a communal activity governed by shared standards has profoundly shaped subsequent discussion.

1.4 Lakatos’s Research Programmes

Imre Lakatos (1922–1974) attempted to synthesize the insights of Popper and Kuhn. His methodology of scientific research programmes distinguishes between the hard core of a theory (which is treated as irrefutable by methodological decision), a protective belt of auxiliary hypotheses (which can be modified to absorb anomalies), a negative heuristic (which directs researchers away from modifying the hard core), and a positive heuristic (which guides the development of the protective belt).

For Lakatos, the demarcation criterion is progressiveness. A research programme is scientifically progressive if it occasionally predicts novel facts — facts not used in the construction of the theory. A programme is degenerating if it only accommodates facts after the fact, through ad hoc modifications. Science is characterized by progressive research programmes; pseudoscience by degenerating ones.

“A research programme is said to be progressing as long as its theoretical growth anticipates its empirical growth, that is, as long as it keeps predicting novel facts with some success... it is stagnating if its theoretical growth lags behind its empirical growth, that is, as long as it gives only post-hoc explanations either of chance discoveries or of facts anticipated by, and discovered in, a rival programme.”— Imre Lakatos, “Falsification and the Methodology of Scientific Research Programmes” (1970)

Lakatos’s approach has the virtue of being more historically accurate than Popper’s: it acknowledges that scientists rightly hold on to theories in the face of anomalies, as long as the programme continues to be progressive. However, critics (especially Feyerabend) argued that the distinction between progressive and degenerating programmes can only be made in retrospect, and that programmes often appear to degenerate before making a dramatic comeback.

1.5 Laudan’s “Demise of the Demarcation Problem”

In a landmark 1983 paper, Larry Laudan argued that the demarcation problem is a pseudo-problem. Every proposed criterion, he contended, either fails to exclude clear cases of pseudoscience or excludes clear cases of science. There is no set of necessary and sufficient conditions that captures the extension of “science.”

Laudan proposed that we abandon the science/non-science distinction in favor of a more fine-grained assessment of the epistemic credentials of individual claims and methods. Rather than asking “Is this scientific?” we should ask “Is this well-supported by evidence?” “Is the methodology reliable?” “Are the inferences valid?” The label “science” is, on Laudan’s view, a honorific that obscures rather than illuminates.

“If we would stand up and be counted on the side of reason, we ought to drop terms like ‘pseudo-science’ and ‘unscientific’ from our vocabulary; they are just hollow phrases which do only emotive work for us.”— Larry Laudan, “The Demise of the Demarcation Problem” (1983)

Laudan’s skepticism has been influential but not universally accepted. Many philosophers (including Paul Thagard, Massimo Pigliucci, and Sven Ove Hansson) have argued that the failure to find a simple criterion does not mean that demarcation is impossible. It may require a multi-criteria approach rather than a single litmus test.

1.6 Thagard’s Multi-Criteria Approach

Paul Thagard proposed that pseudoscience should be identified not by a single feature but by a constellation of characteristics. On his account, a theory or discipline is pseudoscientific if and only if:

  1. It has been less progressive than alternative theories over a long period of time
  2. The community of practitioners makes little attempt to develop the theory toward solutions of the problems
  3. The community shows no concern for attempts to evaluate the theory in relation to others
  4. The community is selective in considering confirmations and disconfirmations

Thagard’s approach is explicitly comparative: a theory is pseudoscientific relative to a more progressive alternative. This handles cases like astrology (which is pseudoscientific relative to astronomy) without requiring a single sharp criterion. However, it has difficulty with cases where there is no clear alternative (as some have argued about string theory).

Comparison of Demarcation Criteria

PhilosopherCriterionStrengthsWeaknesses
PopperFalsifiabilityClear, logical, captures scientific risk-takingDuhem-Quine problem; too broad and too narrow
KuhnPuzzle-solving within a paradigmHistorically accurate; handles astrology wellSociological rather than epistemological; vague
LakatosProgressive research programmesCombines Popper and Kuhn; novel predictionsOnly assessable retrospectively
LaudanNo criterion; assess individual claimsAvoids oversimplificationAbandons a practically important distinction
ThagardMulti-criteria (comparative)Handles complex cases; flexibleRequires an alternative for comparison

1.7 Case Studies

Astrology

Perhaps the most commonly cited example of pseudoscience. Popper: astrology is unfalsifiable because its predictions are too vague. Kuhn: astrology fails because it never developed a puzzle-solving tradition. Thagard: astrology is pseudoscientific relative to astronomy, having made no progress over centuries.

Interestingly, astrology does make testable predictions — and these have been tested and found wanting (e.g., Shawn Carlson’s 1985 double-blind test published in Nature). The issue is that the astrological community does not respond to disconfirmation.

String Theory

A provocative contemporary case. Critics (including Lee Smolin and Peter Woit) have argued that string theory is “not even wrong” — that it makes no testable predictions and therefore fails Popper’s criterion.

Defenders respond that string theory is mathematically progressive, that it has produced novel insights (e.g., the AdS/CFT correspondence), and that the demand for direct falsifiability is too narrow. This debate shows that demarcation criteria have consequences for funding and prestige within physics itself.

Homeopathy

Homeopathy claims that water retains a “memory” of substances previously dissolved in it, and that extreme dilutions (often beyond Avogadro’s number) have therapeutic effects. It is falsifiable, and it has been repeatedly falsified: systematic reviews and meta-analyses consistently show no effect beyond placebo.

On every proposed criterion, homeopathy qualifies as pseudoscience: it is incompatible with well-established chemistry and physics, it has made no novel predictions, its practitioners show no concern for negative evidence, and there are well-established alternatives (evidence-based medicine).

Psychoanalysis

Popper’s original target. Freudian theory, Popper argued, can explain any behavior and is therefore unfalsifiable. However, Adolf Grünbaum argued in The Foundations of Psychoanalysis(1984) that Freud’s theory does make testable predictions — and that many of them have been falsified.

The status of psychoanalysis remains contested. Some aspects (e.g., the existence of unconscious mental processes) have been vindicated by cognitive science, while others (e.g., penis envy, the Oedipus complex as universal) are widely regarded as refuted or untestable.

1.8 The Legal Dimension

The demarcation problem has been litigated in US courts, most famously in cases involving the teaching of creationism in public schools. These cases have forced judges to articulate criteria for what counts as “science.”

McLean v. Arkansas Board of Education (1982)

Judge William Overton ruled that “creation science” is not science. He proposed five criteria for science: (1) it is guided by natural law; (2) it has to be explanatory by reference to natural law; (3) it is testable against the empirical world; (4) its conclusions are tentative; (5) it is falsifiable. This Popperian ruling has been criticized by Laudan and others as philosophically naive, but it had enormous practical impact.

Daubert v. Merrell Dow Pharmaceuticals (1993)

The US Supreme Court established criteria for the admissibility of scientific expert testimony: (1) whether the theory or technique can be and has been tested; (2) whether it has been subjected to peer review; (3) the known or potential rate of error; (4) general acceptance in the relevant scientific community. The Daubert standard remains the governing framework for scientific evidence in US federal courts.

Note how Daubert implicitly adopts a multi-criteria approach, combining elements of Popper (testability), Kuhn (community acceptance), and Lakatos (track record). This suggests that in practice, demarcation requires multiple criteria rather than a single litmus test.

1.9 Contemporary Assessment

Where does the demarcation problem stand today? Several positions are defended in the contemporary literature:

  • Multi-criteria approaches (Pigliucci, Hansson, Mahner): Science is characterized by a cluster of features — testability, progressiveness, theoretical integration, methodological rigor, community standards — none of which is individually necessary or sufficient.
  • Family resemblance (Dupre, Irzik & Nola): “Science” is a Wittgensteinian family-resemblance concept. There is no essence of science, but there are overlapping similarities that connect the various sciences.
  • Social epistemological approaches (Longino): Science is demarcated not by the content of its theories but by the social processes through which they are produced — including critical scrutiny, shared standards, uptake of criticism, and equality of intellectual authority.
  • Pragmatic approaches: The demarcation problem need not be solved in the abstract. Different contexts (courts, funding agencies, classrooms) may appropriately use different criteria.
“The demarcation between science and pseudoscience is not merely a problem of armchair philosophy: it is of vital social and political importance.”— Massimo Pigliucci, Nonsense on Stilts (2010)

Key Takeaways

  1. The demarcation problem asks for a principled distinction between science and non-science/pseudoscience.
  2. Popper’s falsifiability criterion is the most famous answer but faces serious objections (Duhem-Quine, too broad/narrow).
  3. Kuhn shifted the question from logical features of theories to sociological features of scientific communities.
  4. Lakatos offered a sophisticated synthesis: science is characterized by progressive research programmes.
  5. Laudan declared the problem dead; most contemporary philosophers disagree but accept that no single criterion suffices.
  6. Multi-criteria and family-resemblance approaches are now dominant.
  7. The problem has practical consequences for law, education, medicine, and public policy.