Facing the Facts about Test Score Gaps

The genetic hypothesis is the most plausible causal explanation

Dec 14, 2023

Article voiceover

1×

0:00

-56:46

Written by Gregory Conner.

The existence of race-related differences in average intelligence test scores, such as the gap between the average scores of self-identified black and white Americans, is widely acknowledged, but the underlying causes remain a source of contention. In this essay I first review multiple strands of evidence for a causal role of genetic variation associated with biogeographic ancestry. I focus on the black-white test score gap; this gap is not the only observed difference in average intelligence test scores across racial groups but is the best studied. I describe seven mutually supportive lines of evidence indicating that this test score gap is partly caused by genetic variation between the black and white racial groups related to their differing average proportions of African and European ancestry. I reject as implausible the widely promulgated theory that the gap is entirely environmental. Other observed test score gaps, such as the white-East Asian gap, may also have partially genetic causes. I then address the sensitive question of whether public acknowledgement or suppression of these findings is wise social policy. The durable, diverse, and widespread evidence against the environment-only theory is now so strong that continued adherence to that theory may be unviable and counterproductive.

Although evidence has been steadily accumulating for many years, public discussion of genetic causes of race-related test score gaps is treated as toxic, particularly within the administrative structures of government, academic, and research institutions. The dominant institutional view is that genetic explanations for test score gaps should not be explored, any positive findings supporting such explanations must be suppressed, and when questioned publicly about such findings researchers and policy analysts must state that the evidence shows that the causes are entirely environmental. In other words, a widespread institutional noble lie has prevailed. A noble lie is a statement by knowledgeable elites that is factually untrue but is believed to be justified because it promotes another social good, such as social equity or harmony.

The taboo against criticism of the environment-only theory is not usually stated explicitly, but this taboo is deeply embedded in social and institutional practices. Research grant applications and submitted research papers that are in any way critical of the environment-only theory are swiftly rejected. Academics and others who publicly cast doubt on the veracity of the environment-only theory, even if their comments are quite circumspect, are often harassed by activists and administrators and sometimes fired.

Earlier papers have listed and discussed multiple lines of evidence contradicting the environment-only theory. I do not repeat all the technical details covered in earlier papers; I provide an up-to-date and accessible overview, with footnotes and citations relegated to a technical version publicly available online. The first goal of the essay is to convince readers that a large and diverse body of evidence shows that the environment-only theory is false, at least for the case of the black-white test score gap. I then discuss the implications of this large body of contrary empirical evidence for the social desirability and long-term viability of the environment-only theory as the dominant public paradigm.

Definitions of self-identified race and ethnicity and of biogeographic ancestry:

There are two distinct systems of categorization associated with racial identity, one socially defined and the other genetically defined. The socially defined categorization relies on the standard Self-identified Race and Ethnicity (SIRE) question which is a staple of social scientific research. Individuals are simply asked to describe their own race and ethnicity by choosing one or more from a list of race/ethnicity categories.

The other race-related category definition has a genetic foundation. Following the completion of the human genome project in 2003, researchers discovered that they could genetically divide human populations into biogeographic ancestry groups based on patterns of genetic similarity and difference. For analysis of the US population, the most appropriate biogeographic ancestry groups are the five groups: Eurasian, African, Amerindian, East Asian, and Oceanian. Recent advancements in statistical methodology now enable very accurate measurements of proportional ancestry at the continental level.

The two race-related category definitions are conceptually distinct, but there are some strong statistical relationships between them. In the US population, self-identified membership in socially constructed racial categories is highly correlated with membership in the corresponding biogeographic ancestry groups defined genetically. Only few individuals show a clear discordance between genetic cluster membership and their self-identified race/ethnicity.

The test score gaps are measured using the SIRE categories, not the biogeographic ancestry categories. The key scientific question to be confronted in this essay is whether the link between SIRE and test scores is partly due to cognition-related genetic variation, as posited by the mixed genetic/environmental theory, or whether this link is entirely non-causal as posited by the environment-only theory.

Cognitive ability test scores and their meaning:

Cognitive ability has been defined by Rinderman as “The ability to think (intelligence), knowledge (the store of true and relevant knowledge) and the intelligent use of knowledge.” Cognitive ability can be measured using intelligence tests or by using academic performance tests since these tend to correlate strongly with general cognitive ability.

Intelligence test scores can be rescaled by multiplying all scores by a constant and/or by adding a constant to all of them. The two most commonly used scales are the IQ scale, in which the mean test score is set to 100 and the standard deviation is set to 15, and the standardized scale, in which the mean test score is set to zero and the standard deviation is set to 1. A gap of 1.0 on the standardized scale equates to a gap of 15 on the IQ scale.

Intelligence test scores statistically correlate with social and economic outcomes including academic achievement, career progression, employment stability, and long-term income. The predictive power of intelligence test scores does not vary notably across blacks and whites, so that the predictive value of a given score for educational and career achievement is not notably impacted by the racial self-identification of the individual.

Multiple lines of evidence on race-related test score gaps

This section reviews multiple lines of evidence relevant to evaluating whether observed intelligence test score differences across self-identified racial and ethnic groups can be attributed to genetic variation associated with biogeographic ancestry or whether these observed differences are entirely environmentally caused. The lines of evidence presented below vary in their strength; researchers reasonably disagree about the importance and/or strength of the various strands. Two principles of scientific inference are critical for reading the section. One: the reader must balance the totality of evidence across these diverse, multiple lines of evidence. Two: the reader must be able to decouple their political/moral concerns about racism from their objective evaluation of the evidence on purely scientific grounds. This rational decoupling does not diminish the importance of careful consideration of whether public acknowledgement of the findings could generate an increase in racism; those concerns are treated separately later in the essay.

I place a strong focus on the black-white test score gap which is the most carefully examined cognitive ability gap across socially defined racial groups; some other race-related test score gaps are mentioned briefly.

First line of evidence: The intractable black-white test score gap

The environment-only theory predicts that the black-white test score gap can be closed by eliminating environmental impediments to black cognitive development and learning. After six decades of aggressive policy measures in the US to address the gap, a very substantial black-white test score gap stubbornly remains.

There was a decline in the gap on educational performance tests during the 1970s and 1980s, probably due to improved educational opportunities for the black population born in the late 1950s and after, but no noticeable change over the subsequent thirty-year period, post-1990; see Murray (2007) and Murray (2021) for a detailed discussion.

Roth et al. conducted a comprehensive meta-analysis of 105 studies, including standard general intelligence tests, academic achievement tests, mass-conscription general intelligence tests of military personnel, and private employment-based tests. The meta-analysis was restricted to studies meeting various quality control criteria and covers an aggregate sample of 6,246,729 individuals. On the basis of this wide-ranging meta-analysis, they concluded that the black-white general intelligence test score gap is approximately one standard deviation (15 IQ points). Estimates from two recent, carefully constructed cognitive development research databases reproduce this value. The Philadelphia Neurodevelopmental Cohort is a population-representative database consisting of 9,421 eight to twenty-one year olds in the Philadelphia metropolitan area, all of whom self-identified their racial and ethnic identities and took the Penn Computerized Neurocognitive Battery of cognitive tests from 2010-2013. The general intelligence factor scores computed from their test results gave a black-white score gap of 1.01 standard deviations. The Adolescent Brain and Cognitive Development database consists of a nationally-representative sample of 10,370 nine and ten year-olds, who took the NIH Toolbox Cognitive Battery of tests in the period 2018-2021, and had their parents or guardians declare their racial and ethnic identities. The estimated general intelligence factor test score difference between white and black individuals in this database is 1.03 standard deviations.

The stubborn existence of the test score gap despite half a century of ameliorative policies is matched by its geographical uniformity. Reardon et al. obtained access to over 100 million student achievement test scores from the period 2009-2013, covering virtually the entire US public school population, for a varying selection of test scores from school grades three through eight during those four years. Carefully aggregating across school grades and test scoring scales, they found that the average black-white school achievement test score difference equalled 0.70 standard deviations. Using an empirical Bayesian estimate to control measurement error, they found a positive black-white test score gap in every single one of the 2,899 school districts in their sample. Under the mixed environment/genetic theory with a substantial genetic component to the gap, this uniform statistical outcome across 2,899 school districts is unsurprising. It would be a remarkable outcome if the environment-only theory were true.

Second line of evidence: Cross-national comparisons of average cognitive ability

The environment-only theory of the black-white gap predicts that the gap will exist in countries/regions where racial discrimination-linked disadvantages push down black scores. The evidence indicates, on the contrary, that the gap exists worldwide.

The environment-only theory attributes the US black-white test score gap to environmental features which are specific to the US, including the legacy of black enslavement in the 18^th and 19^th centuries, the effects of pervasive racial discrimination, and associated dislocations in the US black learning environment. These US-specific explanations become strained once international test score evidence is included in the analysis. The international data on test score performance shows a uniform, worldwide underperformance of all populations with predominantly African biogeographic ancestry on cognitive tests when compared to populations with predominantly European or East Asian biogeographic ancestry.

The World Bank gives harmonized national average academic achievement test scores for students in 174 countries. The scores are calibrated so that an individual student’s score of 300 corresponds to minimal attainment and 625 is advanced attainment. The 36 sub-Saharan African countries in the database have an average national score of 366.3. The two Caribbean countries in the database whose populations have predominantly sub-Saharan ancestry, Jamaica and Haiti, have average national scores of 387.1 and 337.8, respectively, in the same very low range as those in sub-Saharan Africa. The 41 European countries in the database have an average national score of 486.6 and the 11 East Asian countries have an average national score of 491.0; the US national score is 511.8.

Gust et al. used statistical inference and a comprehensive micro database of international and regional achievement tests to map academic achievement of youths in 159 countries on to a common measurement scale. They estimated that only 6% of youth in sub-Saharan Africa had the basic academic skills necessary to be internationally competitive in the modern global economy. This compared to 72% in Europe, 76% in North America, and 71% in East Asia and the Pacific.

At the national level, there can be an interaction effect between genetic variation and environmental influences on academic performance: nations where average cognitive ability is low tend to have low per-capita income and a less enriched educational environment. This interaction effect together with a partially genetic component to the gap can explain the extremely large academic achievement and IQ gaps between sub-Saharan Africa and North America/Europe/East Asia. A purely environmental explanation is much more difficult to justify.

Third line of evidence: Spearman’s Hypothesis

The environment-only theory predicts that the test score gap will be larger for intelligence test questions which are more susceptible to environmental influences. Empirically, the opposite holds: the gap is larger for intelligence test questions more strongly linked to genetic influences and smaller for test questions more strongly linked to environmental influences.

Some intelligence test questions have a stronger dependence on underlying general intelligence than others; for example, correctly recalling strings of digits (conventionally called digit span) tends to be less indicative of underlying general intelligence than correctly recalling and reciting strings of digits in reverse order (conventionally called reverse-digit span). A question or sub-test which is closely tied to underlying general intelligence is said to have a high g factor loading, referring to its strong link to the g factor of general intelligence. Scores on sub-tests with high g factor loadings tend to have higher heritability, whereas scores on subtests with low g factor loadings tend to be more susceptible to environmental influences.

Spearman’s Hypothesis, fully explicated by Jensen but named after Charles Spearman, posits that if the black-white gap has substantial genetic causes then the relative size of the gap across intelligence sub-tests will be positively correlated with the relative g factor loadings of the subtests. Conversely, if the gap has only environmental causes as posited by the environment-only theory then this correlation should be zero. The evidence shows that the correlation is strongly positive, suggesting substantial genetic causes of the gap. There is also some evidence of an analogous positive correlation when comparing the performance of Hispanics and whites and when comparing (opposite sign gap) East Asians and whites. This indicates that there may also be a genetic component to the (smaller) Hispanic-white test score gap and the (opposite in sign) East Asian-white test score gap. Warne provides a detailed discussion of empirical evidence regarding the Spearman hypothesis and its implication that the black-white test score gap is partially genetically caused.

Fourth line of evidence: The low power of shared environment in explaining cognitive ability

The environment-only theory posits that test score gaps can be explained by within-group shared environmental influences which differ across individuals with different racial identities. Given the low statistical power of shared environmental influences in explaining observed variation in cognitive ability, the implied difference between black and white environments would need to be extraordinarily large to fully explain the magnitude of the observed US black-white test score gap.

Jensen pointed out that the magnitude of the US black-white test score gap, and its presence over the entire life cycle from young adolescence to retirement age, is very difficult to reconcile with established findings regarding the magnitude of shared environmental influences on test scores. Using a reasonable linear approximation, the variation in cognitive ability across individuals can be statistically decomposed into three components, 1. genetic variation (inherited from parents’ DNA), 2. shared environmental influences (common environmental effects across siblings raised together), and 3. non-shared environmental influences (individual-specific environmental influences which are not common within the family). The environment-only theory of a racial test score gap attributes all the test score gap to component 2, in particular to the difference in average shared environment across the two racial groups. The problem with this explanation is that in the contemporary US environment of universal public education and adequate child nutrition the magnitude of shared-environment variation in cognitive ability is very small.

Fifth line of evidence: Admixture regression with cognitive test scores

The environment-only theory predicts that the gap will be related to black and white social identity and not related to African/European genetic admixture except as this admixture proxies for social identity. Admixture regression finds the opposite: the gap is strongly related to African/European genetic admixture and shows almost no statistically identifiable connection to black/white social identity.

Admixture regression tests provide the cleanest and most robust evidence for a substantial genetic contribution to the observed black-white test score gap. Although the details are technical, the basic idea is not. The regression relates individuals’ test scores to their proportions of biogeographic ancestry (what many would call race), such as African, European, East Asian, and Amerindian. The regression analysis also includes self-identified race and ethnicity (SIRE) and often other socially based variables that might affect cognitive ability. By including both admixture proportions and SIRE variables, admixture regression identifies the separate influences on cognitive ability of racial identity (captured by the SIRE identity variables) and genetic variation (captured by the admixture proportions determined from DNA). The environment-only theory predicts that the SIRE variables and other socially defined variables will explain score disparities and that the genetically defined admixture proportions will not; the mixed genetics/environment theory predicts that both sets of explanatory variables, including crucially the genetic admixture proportions, will have explanatory power.

The admixture regression technique relies on the biological feature that inter-ancestry mating randomly combines the genetic variants responsible for differences in cognitive ability across individuals in proportion to their ancestries. For example, if genetic variation contributes to differences in cognitive ability across African and European ancestries and inter-ancestry mating is random with respect to cognitive ability, then an individual who has 60% African and 40% European ancestry will have a 60%-40% expected weighting of the cognitive-ability related genetic variants from these two populations. This leads naturally to a simple linear relationship between individuals’ vectors of ancestry proportions and their expected cognitive abilities.

Admixture regression greatly improves the ability to separate environmental and genetic causes of ethnic and racial group differences in an array of medical, anthropological, and behavioral traits and was once championed by environmentalists as a way to solve the causality debate. Admixture regression analysis has been used over the last two decades by genetic epidemiologists and other researchers to study race and ethnicity related differences in alcohol dependence, height, asthma risk, cardiovascular disease, sleep depth, cigarette smoking behavior, metabolomics, cancer, and diabetes. With the empirical success of admixture regression in explaining ethnicity and race related differences in this wide array of medical and behavioural traits, the application of the technique to cognitive ability differences is natural and inevitable. It is “controversial” in this application only due to the political sensitivity of the findings.

Across a wide range of studies, there is a substantial and highly significant negative regression coefficient linking cognitive ability test scores to African admixture proportions. Other ancestry admixture proportions also show some statistically significant explanatory power; East Asian ancestry often has a positive coefficient, but the evidence is less conclusive than the negative coefficient associated with African ancestry. In many studies the SIRE variables are surprisingly unimportant: their estimated coefficients are often relatively small and sometimes statistically insignificant. Multiple studies using a range of different data sources all find substantial and statistically significant negative coefficients for African ancestry explaining cognitive ability in US sample data.

Fuerst explored the robustness of the finding of a negative impact of African ancestry on cognitive ability by testing the sensitivity of the result to sample selection rules and included/excluded explanatory variables. Across the full set of different tests, African ancestry had a strong negative impact on cognitive ability, whereas the SIRE-linked identity variables often had a measured impact not significantly different from zero. In terms of statistical fit, black/white identity was not a major source of the test score gap; it was mostly due to genetic variation associated with biogeographic ancestry—in particular, African versus European biogeographic ancestry.

Sixth line of evidence: Brain size differences across biogeographic ancestries

The environment-only theory does not predict any difference in average brain size linked to biogeographic ancestry. Since brain size is correlated with cognitive ability, the alternative theory of mixed genetic/environment causes fits naturally with brain size differences across ancestries. The evidence finds substantial differences, with on average larger brains among individuals with higher proportions of European versus African ancestry.

One of the early pieces of scientific evidence pointing toward a genetic component to test score gaps was the discovery of average brain size differences between individuals with African, European, and East Asian ancestry. Individuals with African ancestry have average cranial volume 6% below those with European ancestry, whereas individuals with East Asian ancestry have average cranial volume 1.3% higher. Although the link between brain size and cognitive ability is not conclusive, the implication is strong. Statistical analysis shows a moderate correlation between brain size and cognitive ability.

The African/European average brain size difference was first identified by pioneering American scientist Samuel Morton in the 19^th century, but this scientific knowledge was institutionally “forgotten” following the 1981 publication of Stephen J. Gould’s widely praised book, The Mismeasure of Man. Gould claimed that his own re-examination of the brain size evidence showed the earlier work to be sloppy and motivated by racial prejudice. He accused Morton and other scientists who had found such differences of deliberate or clumsy errors, in part motivated by their in-built racial animosity toward non-whites.

During the summer of 1971 I spent several weeks reanalyzing Morton’s data … In short, and to put it bluntly, Morton’s summaries are a patchwork of fudging and finagling in the clear interest of controlling a priori convictions.

Gould made clear that Morton’s purportedly “fudged and finagled” measurements were motivated by Morton’s racial prejudices:

Morton began his first and largest work, the Crania Americana of 1839, with a discourse on the essential character of human races. His statements immediately exposed his prejudices.

Gould’s intimation that any research showing race-related brain size differences was motivated by the researcher’s racial prejudices helped discourage further inquiry.

It took the technical innovation of magnetic resonance imagery (MRI) to restore previous scientific knowledge of average brain size differences after Gould’s warmly received critique. The pre-MRI measurements, which Gould had disparaged as the product of gross incompetence by scientists motivated by their racial prejudice, were shown to be accurate.

Rushton noted that before Gould prepared the 1996 revised version of his book, newly available MRI techniques verified the pioneering measurements by Morton that Gould extensively attacks, yet Gould made no substantive textual adjustments or mention of this fact in his revision. Gould’s book is best understood not as a popular science book but rather as a political document designed to persuade people of the environment-only theory irrespective of scientific evidence. Although he never stated it explicitly for obvious reasons, Gould presumably felt that perpetuation of the noble lie was more important than adhering to scientific accuracy. Viewed from this alternative perspective, the book must be acknowledged as a resounding success. The book’s powerful influence on the political milieu within the academic and scientific establishment helped to slow down the pace of new findings which contradict the environment-only theory, and to prevent such findings’ public dissemination.

Seventh line of evidence: New findings on the recent evolution of intelligence-related genetic variation

Recent genomic research shows that patterns in genetic variation associated with higher intelligence increased substantially in the European population over the last ten thousand years. Parallel evolution in other regions could have similar effects after the African dispersal of humankind to multiple continents, but the impact on average intelligence of any such parallel evolution is unlikely to be exactly uniform worldwide.

Until the early years of this century, the genetic component of human cognitive ability could only be observed indirectly, by using heritability-linked sorting variables such as in twin and adoption studies. With the completion of the Human Genome Project and the subsequent genomic revolution, a new frontier has opened for the measurement of the genetic component of many human traits. Researchers have been able to create genetic-variant-based indices for human traits, including cognitive ability. These indices, called polygenic risk indices, map an individual’s genotyped DNA (which is a list of the individual’s genetic variants) into a single number that partly captures the genetic component of the observed trait for that individual. An individual’s measured value from application of the index to their genotyped DNA is called their polygenic risk score.

Kuipers et al. estimated a dynamic statistical model of genetic variation linked to several human traits, thereby tracking the evolution of modern European ancestry from earlier genetic ancestry. Their findings rely on the massive database of genotyped archaic human DNA maintained by the Reich Laboratory at Harvard University. The Reich Laboratory database provides genotyped records of recovered archaic DNA from human skeletal remains time-dated using radiocarbon techniques. Each genotyped DNA record is geolocated to its recovery site. The Reich Laboratory database has particularly strong coverage across the European continental region. Kuipers et al. were able to map 827 samples of European-area archaic DNA genotypes into a modern polygenic risk index for intelligence, and thereby calculate polygenic risk scores for intelligence for these archaic genotyped DNA samples. They repeated the procedure for 250 modern European genotyped DNA samples. Using this data they tracked the intelligence-related polygenic risk scores of the European population over the long period from 40 thousand years ago to modern day. They found a strong positive trend in polygenic risk scores for intelligence over the period from ten thousand years ago to modern day. Based on these findings, they noted:

The strong increase in social complexity resulting from the Neolithic revolution and the process of urbanization and occupational specialization are likely factors that could have driven the evolutionary advantage of improved intelligence-related scores.

Parallel evolution in other continental regions could have analogous impacts on genetic variation there, but it is implausible that any such parallel evolution would result in a completely homogenous worldwide statistical distribution of cognitive-ability-related genetic variation.

Early research comparing polygenic risk scores across continental ancestries supports the existence of cognitive ability differences tied to the evolution of genetic variation after humanity’s African dispersal. The cognitive-ability-related average polygenic risk score for academic achievement of black SIRE individuals is significantly lower than that of white SIRE individuals. Piffer directly connected average polygenic score differences to biogeographic ancestries (African, European, East Asian) and showed that the observed relationship between polygenic score differences and ancestries partly explained the observed relationship between average test score differences and ancestries. Lasker et al. found that twenty percent of the observed negative relationship between African ancestry and cognitive ability shown via admixture regression was explained by the lower polygenic risk scores associated with higher African ancestry. All these cross-ancestry findings are not yet definitive since they are dependent to some degree upon the comparability of cross-ancestry polygenic risk scores.

For the lines of evidence based on new genomic methods a troubling new counterstrategy does not attempt to logically argue against the findings (though a few scholars do engage the genetic literature, which is laudable). Instead, the new counterstrategy is to block their publication and eliminate genetic data access for anyone who makes such findings public, on the grounds that the results are stigmatizing toward vulnerable minorities. This enforced-ignorance counterstrategy is not logically coherent if one simultaneously claims that the environment-only theory is true. If the theory were true, then new research findings would tend to support rather than refute it, potentially reducing stigma.

Should the Findings be Openly Acknowledged or Suppressed?

The multiple lines of evidence surveyed above all point toward the mixed genetic/environment theory and against the environment-only theory of the black-white test score gap. This has been noted previously by others. Jensen and Rushton and Gottfredson described the consilience of the multiple-component evidence supporting the mixed genetic/environment theory. They noted that the lines of evidence on the black-white test score gap are based on different empirical methodologies and widely diverse data sources, yet the mixed genetic/environment theory explains them seamlessly: the various empirical findings complement each other and point toward a cohesive theoretical structure. The environment-only theory, on the other hand, relies on a patchwork of explanations to counter each separate line of empirical evidence against it. Winegard et al. made a similar point and discussed how the observed findings might reflect regional variation in recent human evolutionary history. Warne carefully delineated five lines of evidence and concluded that they show that the environment-only theory is false. Fuerst et al. used the Adolescent Brain Cognitive Development database of genomic and test score data on ten thousand U.S. adolescents to empirically replicate and expand upon Warne’s five lines of evidence.

Is it plausible for a rational and honest individual to believe the environment-only theory of the black-white test score gap? The two modifiers “rational and honest” are critical. Emotional or spiritual beliefs need not have rational foundations and therefore a sanctified belief in the environment-only theory cannot be ruled out. Also, some commentators take a moral stance that it is best to dishonestly espouse the environment-only theory “whether or not” it is true, see, e.g., Dennett who stated this explicitly. The willingness to be dishonest for a good cause is usually left unstated since directly stating that one is lying reveals the lie. Also, the best way to propagate a lie is to believe it, so individuals who feel morally bound to lie about the environment-only theory are likely to internalize that dishonesty and effectively “believe it” by any observable criteria.

Advocates of the environment-only theory have offered rejoinders to all seven lines of evidence discussed above. A problem with piecemeal responses to the various lines of evidence is that even if some of the responses are at least plausible, when aggregated together the responses go from plausible to implausible. Seven diverse, extensive lines of evidence against the environment-only theory are too many for that theory to remain credible. If one is scientifically rational and honest, one is forced to acknowledge the mixed genetic/environment theory as the clearly superior theory of the black-white test score gap. Other test score gaps may also have a genetic component.

Reasons that the environment-only theory dominates in the public arena

A large group of thoughtful and informed people, many of them presumably familiar with the contrary evidence, remain active proponents of the environment-only theory. It seems clear (although usually unspoken) that many scientifically informed people have personally decided that only environmental explanations should be encouraged in public discourse and that any evidence pointing toward partially genetic causes should be downplayed or suppressed completely. This is justifiable if one believes that avoiding any potential increase in racial hostility more than outweighs any costs of scientific dishonesty. Such people feel an ethical obligation to publicly espouse the theory, to deliberately obfuscate or downplay the strong evidence against it, and to block renegade research which might reveal its flaws. The relative size of this activist group is difficult to document explicitly, for obvious reasons that the Noble Lie cannot be openly acknowledged without damaging it.

By its very nature, publicly advocating the noble lie is logically complex. A proponent cannot state openly “everyone should suppress the evidence about the falsity of the environment-only theory in order to realize the social benefits from popular ignorance.” There is a Bertrand Russell-like paradox: a person cannot publicly espouse the noble lie without thereby undermining it; arguments must be hidden in subterfuge. Cofnas documented some of the garbled combinations of scientific half-truths intermixed with emotion-laden statements of moral principles that result when prominent thinkers attempt to shield the noble lie from criticism while denying its existence.

Not all advocates of the environment-only theory support it on purely rational grounds. For many people, the environment-only theory is such a deeply cherished component of their belief system that they self-censor evidence contradicting it; the environment-only theory acts effectively as a sanctified belief rather than a scientific one. These sanctified belief holders emotionally equate a strong commitment to the moral stance that “all people are created equal” to a scientific statement that “all races are identical” in terms of the statistical distribution of genetic variation as it impacts average cognitive ability conditional upon race. These two large groups, those who oppose honest disclosure of the weaknesses in the environment-only theory on ethical grounds and those who accept the theory unquestioningly as a sanctified belief, together constitute a formidable force. They shield the environment-only theory from most public criticism, protecting it as a hegemonic doctrine that it is dangerous to contradict.

Many individuals outside these two groups are privately aware of the evidence and personally oppose scientific dishonesty but cannot publicly express any misgivings for fear of personal or professional retribution. This can be a logical and morally justifiable personal strategy: renegades who openly criticize the environment-only theory can face severe consequences. Rindermann et al. surveyed the opinions of intelligence experts (anonymously) on the sources of the black-white intelligence score gap. In the survey, 49% of experts attributed the gap to 50% or more genetic causes, over 80% attributed the gap to at least 20% genetic causes. Only 16% adhered to the publicly dominant environment-only theory, assigning 0% of the black-white test score gaps to genetic causes. An earlier poll of intelligence experts had similar findings. These polls expose a giant chasm between the private views of intelligence experts and publicly available “expert opinion” which almost never mentions the possibility of partially genetic causes of the gap. These anonymous polls make clear that many experts are aware of the evidence but choose not to speak or write about it in non-anonymous communication channels.

Genetic explanations of test score gaps are distasteful to modern sensibilities, socially uncomfortable, and may (possibly) increase racial disharmony in American society. Most people only tangentially interested in the topic find it intellectually challenging and uncomfortable to consider alternative theories and have no incentive to go against the reigning orthodoxy. For this reason and those described above, the environment-only theory reigns supreme in all major academic and research institutions.

The social desirability of the noble lie about test score gaps

Given the overwhelming predominance of the environment-only theory in the public sphere, is its position as the reigning orthodoxy secure despite the strong evidence against it? Should a false scientific theory be promoted in the interest of racial harmony? This subsection considers these difficult questions.

It is impossible to reliably assess the claim that public honesty about statistical links between genetic ancestry and cognitive ability would open the floodgates to white racial supremacist political movements or revive segregationist sentiment in US society. Public honesty could have an exactly opposite effect, it might improve social harmony between races rather than worsening it: it gives the possibility of knowledge-based solutions to social discord rather than requiring ignorance-based ones. Which outcome is the more likely consequence of public openness on this topic cannot be reliably forecast.

The widespread policy of suppression and censorship, although done with good intentions, can have nasty unintended consequences. With the slow but inevitable accretion of these existing findings into broader public awareness, the proportion of the educated populace who can be convinced to honestly believe this scientifically untenable theory will continue to shrink, thereby lowering the policy’s social benefits and increasing its social costs. Compared to earlier decades, the accumulation of contrary evidence is now so strong that it is no longer possible for a well-informed individual to accept the environment-only theory by giving it the benefit of the doubt: there is not enough doubt left. A well-meaning policy of information suppression can be very destructive of trust in scientific, academic, and media institutions. If the dominant strategy continues to block honest discussion, cynicism toward the research, media and policy establishment will increase as more people realize that mainstream authorities are being disingenuous. As the noble lie becomes more transparent to more people, it may dangerously erode social trust and democratic legitimacy.

Current findings on race-related test score gaps generate a clash between two very common, deeply cherished beliefs: the belief that all people in positions of responsibility should work diligently toward eliminating racial animosity in society, and the belief that open and objective scientific methods should be employed to advance human knowledge. The harsh reality of partially genetic causes of test score gaps puts these two cherished beliefs in conflict. Aiding the progress of science by openly acknowledging current findings might potentially contribute to increasing racial animosity (there is no guarantee that this would happen, but the possibility cannot be denied). On the other hand, suppressing current findings is a clear violation of the core belief that one should encourage open scientific discourse to advance human knowledge and social progress. Everyone aware of the findings is forced to choose which cherished belief matters more to them; there is no universal guidance as to which is the “correct” choice. Depending upon one’s personal preferences, background, and upbringing, a deep personal dedication to the fight against racism might override any concerns about abandoning scientific principles of openness. For others, unimpeded scientific inquiry is so central to their beliefs that it outweighs any social policy considerations. Advocates on both sides will be tempted to rationalize away any internal conflict: scientists who are devout anti-racists will convince themselves that the noble lie is actually true; individuals with a passionate devotion to the pursuit of scientific truth will convince themselves there is no risk of stirring racial animosity by being honest. Such rationalizations push the two sides further apart and, furthermore, the two opposing camps never have an opportunity to debate their differing views.

Acknowledging a lie reveals the lie, hence those in favor of the noble lie strategy do not usually publicly defend the strategy or engage in honest debate regarding its strengths and weaknesses. This leaves no natural mechanism for open, democratic discussion between opponents and proponents. Who gets to decide on this harshly enforced policy of dishonesty and censorship? Some method must be found to allow democratic society to openly discuss the Noble Lie’s costs, benefits, and net social value.

Conclusion

The core goal of science is to help society learn true facts about the world, but some scientific findings can be extremely difficult for society to absorb. This is the case with research on the observed differences in cognitive ability across racial and ethnic groups. In the mid-twentieth century, a politically and socially appealing theory arose that all such observed differences were entirely due to the differing environmental stresses experienced by these groups and not at all to genetic variation across biogeographic ancestries such as African, European, and East Asian. Although the evidence in support of this environment-only theory was always tentative, the theory has been aggressively propounded by its many passionate advocates, with powerful backing from government, academic and research institutions. The evidence against the theory has grown progressively stronger over time, but its public dominance has so far survived unimpeded. The penalties for publicly doubting its veracity can be extremely harsh.

This essay violates the widely imposed taboo against publicly questioning the environment-only theory. Seven major weaknesses are reviewed, focussing on the special case of the black-white test score gap. Confronted by this strong, diverse evidence, the environment-only explanation of the black-white test score gap does not retain scientific credibility. The hegemonic doctrine that the black-white test score gap has only environmental causes endures as a socially appealing falsehood, a Noble Lie. Other race-related test score gaps may also have a genetic component.

Although it is no longer credible, the environment-only explanation of test score gaps continues to hold an iron grip on acceptable public discourse. This reflects several strong influences: one, people worry that public acceptance of partially genetic explanations might provide a spur to racial discord and this leads them to deny or downplay adverse findings; two, some individuals regard the absence of genetic influences on racial test score gaps as a sacred value that cannot be questioned; three, for most people it is socially comfortable to accept the environment-only theory but uncomfortable to consider the alternative; and four, the potentially career-destroying reactions of aggressive activists keep many doubters quiet. Partially genetic explanations are broached in private conversations between trusted confidants, or via anonymous communication channels, or by renegade voices outside the circle of institutionally acceptable discourse.

There is a reasonable argument, rarely made explicitly, that fabricated confirmation of the environment-only theory serves a positive social role by aiding racial harmony. This must be balanced against the costs of public deception and the concomitant erosion of social trust. The tradeoff depends to some degree upon the credibility of the theory. The public dominance of the environment-only theory needs a careful reevaluation since it is no longer consistent with rapidly accumulating evidence.

Gregory Connor is newly retired from his position as Professor of Finance, Maynooth University, Ireland. He previously taught at the London School of Economics, the University of California, Berkeley, and Northwestern University.

Support Aporia with a $6 monthly subscription:

You can also follow us on Twitter.

Facing the Facts about Test Score Gaps

The genetic hypothesis is the most plausible causal explanation

Multiple lines of evidence on race-related test score gaps

Should the Findings be Openly Acknowledged or Suppressed?

Conclusion

Discussion about this post