Contradictions in expert advice

Two doctors, opposite advice, both right

You go to your annual physical. Your doctor looks at your bloodwork, notes your borderline cholesterol, and tells you to cut back on eggs and saturated fat. You nod. This is what doctors have said for decades. Then you mention this to a friend who happens to be a cardiologist, and she tells you the opposite: dietary cholesterol has minimal impact on blood cholesterol for most people, the saturated fat story is more complicated than the guidelines suggest, and the real issue is refined carbohydrates and metabolic health. She cites recent meta-analyses. She is not a fringe practitioner. She trained at the same kind of institution as your primary care doctor.

You now hold two expert opinions that directly contradict each other, from two credentialed professionals, both citing published research. Your instinct — everyone's instinct — is to resolve the contradiction as quickly as possible. Pick one. Discard the other. Get back to knowing what to eat for breakfast.

That instinct will cost you. Because the disagreement between these two doctors is not a malfunction. It is the most informative thing either of them told you. Their contradiction is a precise map of a fault line in nutritional science — a place where the evidence base genuinely does not converge, where methodological choices determine conclusions, and where the state of human knowledge is honestly uncertain. Picking one expert and ignoring the other does not resolve the uncertainty. It just makes you unaware of it.

This lesson is about what to do when experts disagree — not how to pick a winner, but how to extract the information that the disagreement itself carries.

Expert disagreement is the norm, not the exception

Expert disagreement is not rare. It is the default state of every domain that matters. In economics, Nobel laureates line up on opposite sides of minimum wage effects, inflation mechanisms, and fiscal policy. In medicine, the Cochrane Collaboration was founded in 1993 precisely because individual experts could not be trusted to synthesize evidence objectively — systematic reviews replaced individual expert judgment with structured protocols. In psychology, the replication crisis revealed that expert consensus on hundreds of findings was built on studies that could not be reproduced. In technology, credentialed experts disagree about AI timelines, architectural patterns, and management methodologies.

If you wait for experts to agree before making decisions, you will never make a decision. The question is not whether experts disagree. It is what to do with the disagreement.

Goldman's framework: evaluating rival experts

Alvin Goldman, the philosopher who essentially founded modern social epistemology, addressed this problem directly in his 2001 paper "Experts: Which Ones Should You Trust?" Goldman's question was precise: when two credentialed experts disagree, and you are a novice in their field, what evidence can you use to determine which expert to believe?

Goldman identified five sources of evidence available to novices:

1. The arguments each expert presents. Even as a non-expert, you can assess whether an argument is internally consistent, whether it addresses counterarguments, and whether its reasoning is transparent. An expert who explains the mechanism behind their claim gives you more to evaluate than one who simply asserts authority.

2. Agreement from other experts. If ninety-seven percent of climate scientists agree on anthropogenic warming and three percent dissent, the weight of numbers matters — not because truth is democratic, but because widespread expert agreement suggests the evidence is converging. A lone dissenter is not automatically wrong, but the burden of explanation is higher.

3. Appraisals from meta-experts. These are experts on expertise itself — people who evaluate the quality of research methodologies, the strength of evidence bases, and the reliability of different expert communities. Cochrane reviewers are meta-experts. So are statisticians who evaluate study design. Their assessments of the disagreement are often more informative than either side's first-order claims.

4. Evidence of interests and biases. An expert funded by the sugar industry who concludes that sugar is harmless warrants different treatment than an independent researcher reaching the same conclusion. Bias does not automatically invalidate a claim, but it changes the weight you assign to it. Goldman emphasized that this includes both financial conflicts of interest and ideological commitments that might shape which evidence an expert foregrounds.

5. Track records. Goldman argued this may be the novice's single best source of evidence. Has the expert made verifiable predictions that turned out correct? Have their past recommendations produced the outcomes they predicted? A track record of calibrated, accurate forecasting is harder to fake than credentials, publications, or rhetorical skill.

The Goldman framework does not eliminate the difficulty of navigating expert disagreement. But it transforms the task from "pick the expert I like" to "systematically evaluate the evidence for each expert's reliability." That transformation is the beginning of genuine epistemic agency in the face of contradiction.

Tetlock's discovery: most experts are bad at prediction

Goldman's framework becomes sharper when combined with Philip Tetlock's empirical findings about expert performance. In 2005, Tetlock published the results of a twenty-year study tracking 28,000 predictions by 284 political experts. The headline finding was brutal: the average expert performed barely better than random chance. Dart-throwing chimpanzees would have done comparably.

But Tetlock's deeper finding was more nuanced and more useful. Not all experts failed equally. Tetlock borrowed Isaiah Berlin's distinction between hedgehogs (who know one big thing) and foxes (who know many things), and found that the two groups performed very differently.

Hedgehog experts — those who organized their thinking around a single powerful theory or framework — were worse than chance. They were confident, articulate, and media-friendly, but their predictions were systematically terrible. Their one-big-idea approach made them incapable of updating when evidence pushed against their framework. They explained away disconfirming evidence rather than incorporating it.

Fox experts — those who drew on multiple frameworks, maintained calibrated uncertainty, and aggregated information from diverse sources — performed significantly better. They were less confident, less quotable, and less likely to appear on television. But they were right more often, and crucially, they knew when they did not know.

Tetlock's work reveals that the form of expert disagreement carries information beyond the content. When a hedgehog disagrees with a fox, the disagreement often tells you that the hedgehog's single framework has encountered a case it cannot handle, and the fox's integrative approach is producing a more nuanced (and likely more accurate) assessment. The hedgehog's confidence is a bug, not a feature. The fox's uncertainty is a feature, not a bug.

This directly informs how you process expert contradictions. When two experts disagree, do not just evaluate their conclusions. Evaluate their cognitive style. Is one reasoning from a single powerful framework and the other aggregating across multiple frameworks? Tetlock's data strongly suggests you should weight the fox more heavily — not because foxes are always right, but because their thinking process is better calibrated to genuinely uncertain domains.

How medicine institutionalized disagreement

The most mature system humanity has built for handling expert disagreement is evidence-based medicine, and its central tool is the systematic review.

Before systematic reviews, medicine handled expert disagreement the way most fields still do: senior physicians would reach conclusions based on their clinical experience, cite the studies they found most compelling, and issue authoritative pronouncements. When experts disagreed, the field defaulted to the most prestigious or most vocal authority. This produced decades of medical practice based on what turned out to be wrong — hormone replacement therapy recommended to all postmenopausal women, routine episiotomies during childbirth, bed rest for back pain, aggressive treatment of mildly elevated blood pressure in the elderly.

The Cochrane Collaboration, founded in 1993, institutionalized a different approach. Instead of asking "which expert is right," systematic reviews ask "what does the totality of evidence show when we control for methodological quality?" Every relevant study is identified, evaluated for bias, and synthesized using predefined protocols. The result is not an expert opinion. It is a structured assessment of what the evidence actually supports, including explicit acknowledgment of where the evidence is insufficient, contradictory, or absent.

The critical insight from this system: when a Cochrane review finds that experts disagree about a treatment, the review does not pick a side. It maps the disagreement. It identifies whether the contradiction stems from different study populations, different outcome measures, different follow-up periods, different methodological quality, or genuinely conflicting evidence. The disagreement is decomposed into its structural components, and each component is assessed independently.

This is the most sophisticated thing you can do with any expert contradiction: instead of asking "who is right," ask "why do they disagree?" The answer to that question — different evidence bases, different methodological standards, different definitions of the outcome, different time horizons, different populations studied — is where the real information lives.

The ensemble principle: disagreement as diagnostic

Machine learning provides a precise formal analog. In ensemble methods, multiple independently trained models are combined to produce predictions. The key discovery in this field is that an ensemble of diverse, imperfect models routinely outperforms any single model — even the best individual model in the group.

The mechanism is instructive. Each model captures some aspects of the underlying pattern and misses others. Their errors, if the models are sufficiently diverse, are uncorrelated. When you aggregate their predictions — through voting, averaging, or more sophisticated combination methods — the individual errors cancel out and the shared signal strengthens. A single model with a twenty percent error rate, combined with four other diverse models of similar quality, can produce an ensemble error rate below eleven percent.

But the ensemble's most valuable output is not its prediction. It is its uncertainty estimate — derived directly from the degree of disagreement among the component models. When all models agree, the ensemble is confident. When models diverge, the ensemble reports high uncertainty. Recent research has shown that this disagreement-based uncertainty is one of the most reliable indicators available for predicting where the ensemble's own accuracy will degrade. The disagreement does not just flag current uncertainty. It predicts future error.

A 2024 study in Science Advances demonstrated that an ensemble of twelve large language models achieved forecasting accuracy statistically indistinguishable from aggregated human crowd predictions. The mechanism was the same: diverse models, aggregated through their disagreements, produced better calibrated predictions than any single model.

Your epistemic situation when facing expert contradiction is structurally identical. Each expert is a model — trained on different data (different clinical experience, different research emphases, different patient populations), with different architectural biases (different theoretical frameworks, different disciplinary norms). Their disagreement is your uncertainty estimate. It tells you where the evidence is thin, where hidden variables are operating, and where your confidence should be lowest.

The lesson from ensemble methods is not "ignore all experts because they disagree." It is "use the disagreement itself as your most honest measure of how much is actually known."

The philosophical problem: what should you believe?

The philosophical literature on peer disagreement addresses the hardest version of this problem. David Christensen argues for the conciliatory position: when an epistemic peer disagrees with you, rationality requires reducing your confidence, because their disagreement is evidence that the question is harder than your certainty suggests. Richard Feldman pushes further — when genuine peers disagree, the only rational response is for both to suspend judgment, because the existence of an equally qualified dissenter means the evidence itself is ambiguous.

The opposing "steadfast" view, defended by Thomas Kelly and others, holds that you can rationally maintain your position if your first-order evidence is strong enough. The mere fact that someone smart disagrees does not automatically override your assessment.

The practical synthesis: the existence of expert disagreement is always evidence that the question is harder than it looks. It does not tell you to stop believing things. It tells you to calibrate your confidence downward and invest effort in understanding why the disagreement exists. Christensen's insight is not "give up your views." It is "treat disagreement as data about the difficulty of the problem."

The nutrition case: what happens when you suppress the signal

Return to the dietary fat story from the opening — it is the clearest case study of what happens when expert disagreement gets resolved by authority rather than investigation.

In 1961, the American Heart Association recommended reducing saturated fat, based largely on Ancel Keys's epidemiological work. By 1980, the US government enshrined "low-fat" in the Dietary Guidelines. But this was never unanimous consensus. John Yudkin, a British physiologist, published Pure, White, and Deadly in 1972, arguing that sugar — not fat — was the primary dietary driver of heart disease. Yudkin was credentialed. His evidence was real. But Keys was more politically connected and more aligned with institutional momentum. Yudkin's dissent was marginalized.

The consequences were enormous. Low-fat guidelines drove food manufacturing toward high-sugar, low-fat products. Obesity and diabetes rates climbed. The expert consensus was wrong in a way that the existing disagreement had predicted — Yudkin's contradicting position contained the information needed to avoid the error. The field chose to suppress the dissent rather than investigate the gap.

In 2015, the Dietary Guidelines Advisory Committee removed cholesterol as a nutrient of concern. The "settled science" of 1980 had been unsettled by exactly the evidence the dissenting experts had pointed toward decades earlier. The lesson is not "all dissenting experts are right." It is: when credentialed experts disagree, the disagreement marks an unresolved empirical question. Suppressing it destroys the signal. Investigating it is how you find the hidden variables the field has not yet resolved.

A protocol for processing expert disagreement

Understanding that expert disagreement carries information is the conceptual shift. Extracting the information requires a practice.

Step 1: Map the disagreement precisely. Do not settle for "Expert A says X and Expert B says not-X." Specify the exact claims. Often, experts who appear to contradict each other are actually making claims about different populations, time horizons, outcome measures, or definitions. The first step is disambiguating whether you are facing a genuine contradiction or a surface one that dissolves under scrutiny.

Step 2: Apply Goldman's five sources. For each expert, assess: the quality and transparency of their arguments, the degree of agreement from other experts in the field, the assessments of meta-experts and systematic reviews, any interests or biases that might shape their conclusions, and their track records of calibrated prediction. This is not a formula that produces a clear winner. It is a structured method for weighting the disagreement rather than resolving it prematurely.

Step 3: Ask why they disagree, not who is right. Identify the structural source of the contradiction. Different evidence bases? Different methodological standards? Different theoretical frameworks? Different definitions of key terms? The answer to "why" is almost always more informative than the answer to "who." The fat-versus-sugar debate persisted because the two camps were using different study designs (epidemiological versus interventional), different outcome measures (blood lipids versus cardiovascular events), and different populations. The disagreement was an artifact of methodological pluralism, not of one side being stupid.

Step 4: Calibrate your confidence to the disagreement. The wider and more persistent the expert disagreement, the lower your confidence should be in any single position. This is not paralysis. It is calibration. You can still act — you have to eat breakfast, you have to make investment decisions, you have to raise your children. But you act with uncertainty proportional to the genuine state of knowledge, rather than with false certainty borrowed from whichever expert you found most persuasive.

Step 5: Record the disagreement in your contradiction journal. Note the domain, the opposing positions, the structural sources of disagreement you identified, and your current best assessment. Return to it when new evidence emerges. Expert disagreements are living contradictions — they evolve as new studies are published, new methods are developed, and new data becomes available. Your record of the disagreement becomes a running log of how an entire field's knowledge is developing.

The cost of choosing sides too fast

The deepest risk in expert disagreement is not confusion. It is premature certainty. When you pick one expert and dismiss the other, you collapse a genuinely uncertain situation into a falsely certain one. You stop tracking the disagreement, stop looking for new evidence, and stop being open to the possibility that the dismissed expert was pointing at something real.

Tetlock's hedgehogs failed not because they were unintelligent but because they were too certain — one big theory, everything contradicting it explained away. The foxes succeeded because they held multiple frameworks in tension and maintained calibrated uncertainty. Your goal is not to become an expert in every domain where experts disagree. It is to become skilled at reading the disagreement itself — asking what it reveals about what is not yet known, what hidden variables the experts are implicitly disagreeing about, and what a more complete model would look like.

Those questions will not always produce clean answers. But they will always produce better calibrated understanding than picking a side and calling it settled.

The bridge forward

You now have a framework for handling expert contradiction — Goldman's evaluation criteria, Tetlock's hedgehog-fox distinction, medicine's systematic review model, and the ensemble principle. But expert disagreement is the external version of a more intimate problem. The contradictions that matter most to your epistemic development are the ones inside you — between beliefs you hold, values you endorse, and commitments that pull in incompatible directions.

In L-0375, you will learn that those internal contradictions are not failures of your thinking. They are growth edges — precise markers of the places where your epistemic system is ready to evolve.