please confirm that you agree to abide by our usage policies. For example, on tests of spatial skill, requiring visualization and imagery, men and boys tend to score higher than do women and girls. This latter determination commonly based on effect size, which is easily obtained by squaring the partial r value. Dickens, W. T., & Flynn, J. R. (2006). Put another way, there is always more within-group variability in performance on mental tests than between-group variability. 2. This condition is referred to variously as homogeneity of regression across groups, simultaneous regression, or fairness in prediction. 34, pp. Bias in Psychological Assessment - Wiley Online Library Explain the cultural test bias hypothesis. A number and variety of criteria need to be explored further before the question of bias is empirically resolved. Bias in psychological assessment: An empirical review and recommendations. Related to inappropriate test content mentioned earlier, this position asserts that the tests measure different constructs when used with children from other than the middle-class culture on which the tests are largely based, and thus do not measure minority intelligence validly. Because most psychologists are White and speak only standard English, they may intimidate Black and other ethnic minorities and so examiner and language bias result. Describe the results of research on the presence of bias in other internal features of educational and psychological tests. (DOC) Cultural Bias in Psychological Assessment - Academia.edu ), The bell curve wars: Race, intelligence, and the future of America (pp. 82-113). 545573). New York, NY: Plenum Press This chapter provides a thorough review of the literature. Rushton, J. P., & Jensen, A. R. (2005). Journal of School Psychology, 25, 309312. As a result of the inappropriate standardization samples, the tests are unsuitable for use with minority children. Prior to examining the evidence on the CTBH, the concept of culture-free testing and the definition of mean differences in test scores as test bias merit attention (Special Interest Topic 15.5). A large reliability coefficient does not eliminate the possibility of bias, but as reliability is lowered, the probability that bias will be present increases. (1999). The extent to which these factors influence individual as opposed to group performance is difficult to determine. The structure of ability tests for other groups has been researched less extensively, but evidence thus far with Chinese, Japanese, and Native Americans does not show substantially different factor structures for these groups. Psychology, Public Policy, and Law, 11, 235294. 545576). Understanding the Phenomena of Cultural Bias With Examples A variety of views toward bias have been expressed in many sources; many with differing opinions offer scholarly, nonpolemical attempts directed toward a resolution of the issue. Feature Flags: { They also prompted the development of new, up-to-date IQ tests and more frequent revisions or updating of older tests. The scoring of the items is improper, unfairly penalizing the minority child because the test author has a Caucasian middle-class orientation that is reflected in the scoring criterion. Gender Issues in Psychological Testing of Personality and Abilities This represents inequitable social consequences. Certainly data reflecting racial differences on various aptitude measures have been interpreted to indicate support for a hypothesis of genetic differences in intelligence and implicating one group as superior to another. There are simply too many examples of specific abilities and even sensory capacities that have been shown to differ unmistakably across human populations. Figure 15.1 demonstrates an ICC that uses a one-parameter (also known as the Rasch Model after its originator) unidimensional model which is widely used in aptitude testing. can also be characterized by the extent to which the characteristics of the test itself are related to the construct being measured, and not to characteristics that are associated with attributes of a specific group. This had been obvious to the measurement community for quite some time prior to Diana, but it had not found its way into practice. This chapter traces the historical roots of the CTBH to the present day, provides important distinctions regarding different definitions of test bias that are critical for empirical examination of the issue, presents common objections to the use of psychological testing, and describes how test authors and publishers detect bias in psychological tests. In the third and final section, we discuss the remaining challenges and the future directions related to multicultural issues in clinical psychological assessment. Lower test scores for minorities, then, may reflect only this intimidation and difficulty in the communication process, not lower ability. While we find this explanation somewhat vague and lacking specificity for research purposes, in experimental research regarding mental testing outcomes, stereotype threat is most often operationalized as being given a test that is described as diagnostic of ones ability and/or being asked to report ones race prior to testing. 41100). Thus, to enable fairness across such settings, it is important to establish guidelines for use across multiple settings. Describe the relationship between bias and reliability. Indeed, there are still disputes as to the nature and number of processes such as emotion and motivation. It is the inference that tests demonstrating such differences are inherently biased that is faulty. Some good readings on this issue for follow up include the following works: Nomura, J. M., Stinnett, T., Castro, F., Atkins, M., Beason, S., Linden, S., Wiechmann, K. (2007, March). On the other hand, it also refers to the bias created due to the norms of the majority ethnic group. Although tests may accurately predict a variety of outcomes for middle-class children, they do not predict successfully any relevant behavior for minority group members. } Although some biased items are nearly always found, they seldom account for more than 25% of the variance in performance and often, for every item favoring one group, there is an item favoring the other group. 3355). It is also the appeal of the egalitarian fallacy. There are four key court cases to consider when reviewing this question, two from California and one each from Illinois and Georgia. In R. L. Linn (Ed. Such attempts at solutions are difficult. Cultural Bias in Psychological Testing. American Psychologist, 30, 1541 This is the report of a group appointed by the APAs Board of Scientific Affairs to study the use of psychological and educational tests with disadvantaged students--An early and influential article. As the discipline of psy-chology has expanded, the application of psychological assessment has also developed in response to new areas of practice. Mastering Modern Psychological Testing pp 573613Cite as, 3 @kindle.com emails can be delivered even when you are not connected to wi-fi, but note that service fees apply. In T. B. Gutkin & C. R. Reynolds (Eds. Culture loading must be viewed on a continuum from general (defining the culture in a broad, liberal sense) to specific (defining the culture in narrow, highly distinctive terms). Arriving at a consensual definition of bias in prediction is also a difficult task. Virtually all tests in current use are bound in some way by their cultural specificity. Two computer programs widely used to estimate item and latent parameters are LOGIST and BILOG, which use the joint maximum likelihood (JML) and marginal maximum likelihood (MML) methods, respectively. Diane Halpern (1997) has written extensively on gender differences in cognitive abilities. The evaluation of bias in prediction under the Cleary et al. Administration of a test in English to an individual for whom English is a poor second language is inexcusable both ethically and legally, regardless of any bias in the tests themselves (unless of course, the purpose of the test is to assess English-language skills). on the Manage Your Content and Devices page of your Amazon account. Reynolds, C. R., Lowe, P. A., & Saenz, A. ) in the prediction of college performance (typically measured by grade point average). Additionally, some research that has taken a thorough look at the issue using multiple statistical approaches has argued that stereotype threat may have just the opposite effect at times from what was originally proposed by Steele and Aronson (e.g., see Nomura et al., 2007). The committee report (Cleary, Humphreys, Kendrick, & Wesman, 1975) was subsequently published in the official journal of the APA, American Psychologist. Further, there are objections to the use of the standard criteria against which tests are validated with minority cultural groups. Anastasi, A., & Urbina, S. (1997). are generally inadequate from a statistical or psychometric perspective (e.g., Anastasi & Urbina, 1997). "corePageComponentGetUserInfoFromSharedSession": true, A more recently proposed method for assessing test bias is comparative item selection (Reynolds, 1998). Hilliard (1979), one of the more vocal critics of IQ tests on the basis of cultural bias, pointed out early in test bias research that one of the potential ways of studying bias involves the comparison of factor analytic results of test studies across race. American Psychologist, 59(1), 713. In the statistical sense, this is bias. In an impressive review of 866 Black-White prediction comparisons from 39 studies of test bias in personnel selection, Hunter, Schmidt, and Hunter (1979) concluded that there was no evidence to substantiate hypotheses of differential or single-group validity with regard to the prediction of the job performance across race for Blacks and Whites. Psychologists and educators will need to keep abreast of new findings in the area. In general, these studies have found either no difference in the prediction of criterion performance for Blacks and Whites or a bias (underprediction of the criterion) against Whites. The question of cultural bias in psychological assessment is serious and merits our full scholarly attention. Total loading time: 0 If the bias is a function of a nominal cultural variable (e.g., ethnicity or gender), then the test has a cultural bias. (The Milwaukee Project was discussed in Chap. A biased assessment is one that systematically underestimates or overestimates the value of the variable it is designed to measure. Paper presented to the annual meeting of the National Association of School Psychologists, New York. Therefore we see two components to the threatbeing told ones ability is to be judged on a test of mental ability and secondly, being asked to report ones racial identification, or at least believing it to be relevant in some way to the evaluation of examination results (although some argue either component is sufficient to achieve the effect). Department of Psychology, University of Nevada, Las Vegas, Las Vegas, NV, USA, You can also search for this author in Over the past decade, psychologists have come under criticism for maintaining a mainstream cultural status quo in clinical practice. Then enter the name part Dent, Harold E. Despite its reputation for being a scientific and precise tool for measurement, psychological testing is a culturally biased procedure that results in discrimination against minority groups, particularly against minority students. Chinn, P. C. (1979). Why is psychometric research on bias in mental testing so often ignored? (AERA et al., 2014) present four different ways that fairness is typically used in the context of assessment. Factors related to individual characteristics can restrict accessibility can interfere with measuring the construct. Are these criticisms of test items accurate? Reynolds, C. R. (2000). (1993). 379440). Sandoval and Milles (1979) two major conclusions were that 1) judges are not able to detect items which are more difficult for a minority child than an Anglo child, and 2) the ethnic background of the judge makes no difference in accuracy of item selection for minority children (p. 6). It seems obvious to us now that whenever persons are assessed in other than their native language, the validity of the results as traditionally interpreted would not hold up, at least in the case of ability testing. These are questions for empirical resolution rather than armchair speculation, which is certainly abundant in the evaluation of test bias. Nondiscriminatory testing of minority and exceptional children. Standards for educational and psychological testing. On the other hand, carrying out Psychological research can make us more aware of cultural variations and differences. That is, regardless of the level of performance on the independent variable, the direction and degree of error in the estimation of the criterion (systematic over- or underprediction) will remain the same. Reynolds, C. R., & Kamphaus, R. W. (2003). American Psychologist, 52, 11031114 A good discussion of the topic with special emphasis on educational implications and alternative assessment methods. method for detecting item bias arises from applications of item-response theory (IRT), followed by a thoughtful, logical analysis of item content (Reynolds, 2000). The final explanation (i.e., Category 4) is embodied in the CTBH introduced earlier in this chapter. Cultural Biases in Interpretation and Meaning of Words in Assessment What is considered wise in one society may not be considered wise in another; the value and meaning of intelligence depends on cultural norms. A visual representation of DIF is the region between group As and group Bs item characteristic curves (ICCs). Those components are the following: each individual's personal self-assignment of identity position (e.g., I am a woman), the identity assigned by external observers to an individual (e.g., he is a man), and . 306, 1972; 495 F. Supp. Sandoval and Milles (1979) results indicated that the judges were not able to differentiate between items that were more difficult for minorities and items that were of equal difficulty across groups. (1995). Making the problem of bias in mental testing even more complex, not all mental tests are of the same quality; some are certainly psychometrically superior to others. There are two concepts of special importance in this definition. (1975) definition (known as the regression definition) is quite straightforward. However, it is clearly possible that an unbiased test might be considered unfair by at least some. https://doi.org/10.1007/978-3-030-59455-8_15, DOI: https://doi.org/10.1007/978-3-030-59455-8_15, eBook Packages: Behavioral Science and PsychologyBehavioral Science and Psychology (R0). A number of attempts have been made to develop a culture-free (sometimes referred to as culture fair) intelligence test. In concluding the discussion of fairness, the Standards notes that fairness should not be perceived as equality of testing outcomes for relevant population subgroups. Bias in test use. Earlier in the study of item bias, it was hoped that the empirical analysis of tests at the item level would result in the identification of a category of items having similar content as biased and that such items could then be avoided in future test development (Flaugher, 1978). Because, as previously noted, no detectable pattern or common characteristic of individual items statistically shown to be biased has been observed (given reasonable care at the item writing stage), it seems reasonable to question the armchair or expert minority panel approach to determining biased items. Raising intelligence: Clever Hans, Candides, and the Miracle in Milwaukee. Much effort has been expended to determine why group differences occur but we do not know for certain why they exist. Unfortunately, these objections are frequently stated, still, as facts, on rational rather than empirical grounds. For example, Richardson (1993) holds that researchers have not satisfactorily settled the debate over whether intelligence tests measure general intelligence or a European cognitive style. In particular, indigenous peoples throughout the world have pointed out that clinical psychologists, in both research and . He prohibited (or enjoined) the use of IQ tests with Black children in the California public schools. A related issue that is also likely involved in the professions reluctance to abandon the CTBH is a failure to separate the CTBH from questions of etiology. Sandoval, J., & Mille, M. P. W. (1979). Reynolds, Willson, and Chatman (1984) provide more information on the development of this method. After identifying the eight most racially discriminating and eight least racially discriminating items on the Wonderlic Personnel Test, Jensen (1976) asked panels of five Black psychologists and five Caucasian psychologists to sort out the eight most and eight least discriminating items when only these 16 items were presented to them. The purpose of the present study was to examine the relationship between culture and the clinical practice of psychological assessment. New York, NY: Norton. Bias in Psychological Assessment . Test developers and publishers also routinely examine tests for potentially biasing factors as well, prior to making tests commercially available. There is no question concerning the validity of measures of height or weight of anyone in any culture. "corePageComponentUseShareaholicInsteadOfAddThis": true, of Georgia, 1984). Psychological testing (7th ed.). In support of Bias in Mental Testing and scientific inquiry. Race and IQ: A theory-based review of the research in Richard Nisbetts intelligence and how to get it. Bias in a test may be found to exist in any or all of these categories of validity evidence. Disputes among different camps in learning were controversial and of long duration. Helms, J. E. (1992). Note: Equal slopes and intercepts result in homogeneity of regression where the regression lines for different groups are the same. However, culture-free tests There is no single Lutz, FL: Psychological Assessment Resources. Homogeneity of regression is illustrated in Fig. 178208). Members of various ethnic, religious, or other groups that have a cultural system in some way unique may well be able to identify items that contain material that is offensive, and the elimination of such items is proper. Put another way, if well-constructed and properly standardized tests are biased, then less standardized, more subjective approaches are almost certain to be at least as biased and probably more so. Journal of Psychoeducational Assessment, 2(3), 219224. Nevertheless, certain myths persist in the writings and actions of many professional psychologists who are either unaware of this research or choose to ignore it. Standardization and cultural bias as impediments to the scientific study and validation of intelligence. Test bias in the assessment of intelligence and personality. Although IQ and other aptitude test score differences undoubtedly occur, the differences do not indicate deficits or superiority by any group, especially in relation to the personal worth of any individual member of a given group or culture. Those who support mean differences as an indication of test bias state correctly that there is no valid a priori scientific reason to believe that intellectual or other cognitive performance levels should differ across race. Although some of the same individuals testified in this case, several new opinions were offered. In C. R. Reynolds & R. T. Brown (Eds. To the legal mind, bias denotes illegal discriminatory practices while to the lay mind it may conjure notions of prejudicial attitudes. It is imperative that the evaluation of bias in testing be undertaken from the standpoint of scholarly inquiry and debate. Cultural bias is the interpretation of any phenomena based on one's own cultural standards. Unfortunately, measured differences themselves have often been inferred to indicate genetic differences and therefore the genetically based intellectual inferiority of some groups. An example of an item characteristic curve or ICC. A considerable body of literature currently exists failing to substantiate cultural bias against native-born American ethnic minorities with regard to the use of well-constructed, adequately standardized intelligence and aptitude tests. New York, NY: Norton. Steele, C. M., Spencer, S. J., & Aronson, J. ), Educational measurement (3rd ed., pp. method for the accurate determination of the degree to which educational and psychological tests measure a distinct construct. So far, we view the findings of racial equalization due to the neutralization of the so-called stereotype effect as a statistical artifact, but the concept remains interesting, is not yet fully understood, and we may indeed be proven wrong! Any differences with regard to any aspect of the distribution of mental test scores indicate that something is wrong with the test itself. At present, only scattered and inconsistent evidence for bias exists. Sex differences in intelligence: Implications for education. However, these cases and other societal factors did foster much research that has brought us closer to a scientific resolution of the issues. Information concerning the home, community, and school environment must all be evaluated in individual decisions. This method also requires large samples for stable results. Contending with group image: The psychology of stereotype and social identity threat. Such models specify various parameters that describe the behavior of the item within the model; most IRT models include one, two, or three parameters, which may be graphically represented in an item characteristic curve (ICC). Render date: 2023-06-27T21:36:27.775Z The items require the child to use information in arriving at an answer that minority or disadvantaged individuals have not had equal opportunity to learn. Journal of Personality and Social Psychology, 69(5), 797811. Just as there is no a priori basis for deciding that differences exist, there is no a priori basis for deciding that differences do not exist. The differences have persisted at relatively constant levels for quite some time and under a variety of methods of investigation. Behavioral and Brain Sciences, 3, 352. Certainly, the greater the cultural specificity of a test item, the greater the likelihood of the item being biased when used with individuals from other cultures. It is believed that sex hormone levels and social factors both influence the development of these differences. is added to your Approved Personal Document E-mail List under your Personal Document Settings New York, NY: Plenum Press. Both the degree of error in prediction and the direction of the bias will vary as a function of level of performance on the independent variable. One major, carefully studied explanation is that the tests are biased in some way against certain groups. Black Americans reduce the racial IQ gap: Evidence from standardization samples. With regard to a resolution of bias, we believe that were a scholarly trial to be held, with a charge of cultural bias brought against mental tests, the jury would likely return the verdict other than guilty or not guilty that is allowed in British lawnot proven. Until such time as a true resolution of the issues can take place, we believe the evidence and positions taken in this chapter accurately reflect the state of our empirical knowledge concerning bias in mental tests. They argued that such differences were created by a variable they deemed Stereotype Threat. More recently, they defined stereotype threat as follows: When a negative stereotype about a group that one is part of becomes relevant, usually as an interpretation of ones behavior or an experience one is having, stereotype threat is the resulting sense that one can then be judged or treated in terms of the stereotype or that one might do something that would inadvertently confirm it (Steele, Spencer, & Aronson, 2002, p. 389). In S. Fraser (Ed. This method essentially holds the total score constant across the groups, and the resulting differences may be used to identify problematic items if it is significant and particularly if meaningful. Race-ethnicity and measured intelligence: Educational implications. The bias identified has tended to overpredict how well minority children will perform in academic areas and to underpredict how well White children will perform. The practical implications of such bias have been pointed out previously and are the issues over which most of the court cases have been fought. Bias in the content Many different achievement tests and teacher-made, classroom-specific tests need to be employed in future studies of predictive bias. These problems with test items cause the items to be more difficult than they should actually be when used to assess minority individuals. Many challenges remain, especially that of understanding the continued higher failure rates (relative to the majority ethnic population of the United States) of some ethnic minorities in the public schools (while other ethnic minorities have a success rate that exceeds the majority population) and the disproportionate referral rates by teachers of these children for special education placement. As is typical of group differences in intellectual abilities, the variability in performance within groups (i.e., males and females) is much larger than the mean difference between groups (Neisser et al., 1996). This chapter provides a discussion of multicultural issues in clinical psychological assessment.
What To Do Before Receiving Holy Communion,
10 Day London Paris Itinerary,
Canada Driver Job Visa,
Emergent Bilingual Definition Texas,
Coast Guard Selected Reserve Direct Commission,
Articles C
