Home Contents

FAQ
 

Frequently Asked Questions

  1. Why did you do this experiment?

  2. What were the independent and dependent variables in the experiment?

  3. What hypotheses were tested in the experiment?

  4. Why did you choose Integrative Complexity as the dependent measure? 

  5. Why did you expect to find race effects on Integrative Complexity?

  6. Can you elaborate on the reliability of the Integrative Complexity measure that was used in the study? 

  7. Why did you use the topics of child labor and death penalty for your discussions?

  8. How did you come to decide on your sample size?  

  9. What differences did you find between the three sites: Stanford, UCLA and Maryland?    

  10. Did you encounter any problems in conducting the experiment as planned?

  11. Why did you report two types of statistical analyses – one at the level of individuals and one at the level of groups? 

  12. Isn’t an experiment unnatural? How can you draw conclusions from an experiment to the real world?

  13. Diversity as represented in your study means Blacks and Whites.  Why did you make that choice?

  14. Why did you find an effect for race in the pre-discussion essay?

  15. Why do you think that current contact with diversity was a relevant factor in your results? 

  16. In your results, minority opinion expressed by the collaborators seemed to exert a larger influence than did race.  Does that compromise your claim that race matters in higher education?

  17. What important questions from your study should social psychology address? 

  18. What relevance does the experiment have to legal challenges to affirmative action in college admissions?

  19. What are your qualifications to conduct the study?  

  20. Who funded your study?

  21. Did you design the study so that it was guaranteed that you would get results that you found? 

 

  1. Why did you do this experiment?

The purpose of this experiment was to extend current social science research on the impact of racial diversity on college students.  The extant research contains clear and consistent evidence that racial diversity contributes positively to students' social and intellectual development across a variety of domains.  These studies, however, do not include designs that incorporate random assignment of participants, and they typically rely on self-reported outcome measures.  Our study extends the knowledge base by using an experimental design and a behaviorally-based outcome measure.

 

  1. What were the independent and dependent variables in the experiment?

There were two dependent variables, Integrative Complexity and Perceived Novelty.  Integrative Complexity, the primary dependent variable of interest, is a measure of complex reasoning or thinking developed in the social psychological literature.  Scores of Integrative Complexity are coded by raters from student essays, and is therefore a "performance-based" measure or a measure based upon a production task.  As a companion to the behavioral data, Perceived Novelty measures students' perception of how novel the arguments of the collaborator were and how much the collaborator made the student think.

There were two primary independent variables.  Racial diversity, the main focus of the study, was manipulated with the use of either a Black or White research collaborator who participated in the group discussion with three White participants.  Since theory predicts enhanced integrative complexity with the presence of a minority opinion, we also manipulated (via the research collaborator) minority opinion in the group.

We controlled for several variables in the data analyses.  They included university site, discussion issue, age, gender, and current contact with racially diverse others.

 

  1. What hypotheses were tested in the experiment?

There were four hypotheses tested with regard to racial diversity.

With regard to Integrative Complexity:

(1)  In essays written prior to discussion, students in groups with a Black collaborator will exhibit higher levels of integrative complexity, relative to those in groups with a White collaborator.

(2)  In essays written after discussion, students in groups with a Black collaborator will exhibit higher levels of integrative complexity, relative to those in groups with a White collaborator.

(3)  In the "transfer" essays written on a different topic, students in groups with a Black collaborator will exhibit higher levels of integrative complexity, relative to those in groups with a White collaborator.

With regard to Perceived Novelty:

(4) After discussion, students will view the collaborator as contributing more to novel perspectives and as more influential when the collaborator is Black than when the collaborator is White.

 

  1. Why did you choose Integrative Complexity as the dependent measure?

We were interested in measuring outcomes that reflected behavior and relied less on self-reports.  We also preferred a measure that had a been used in the academic, scientifically-based literature.  We consulted broadly with colleagues in experimental social psychology and discovered one measure being used by Professor Deborah Gruenfeld of the Stanford Business School, a social psychologist specializing in small group interactions. Integrative Complexity, originally developed by Peter Suedfeld and Philip Tetlock, is a versatile measure that has been used in hundreds of studies. The content of essays and speeches are coded for integrative complexity along two dimensions: the extent of differentiation of ideas, and the extent to which the ideas are integrated using overarching principles.  The measure yields an overall score of integrative complexity, ranging from 1 to 7, and the scale is independent of specific attitudes toward an idea.  The scale has been used in a variety of applications.  One study analyzed Supreme Court opinions for integrative complexity, and found that unanimous decisions were less complex than split decisions.  Another study found a positive effect for the presence of members of groups who held minority opinions.  Specifically, those who participated in discussion groups where diverse opinions were expressed resulted in higher integrative complexity.  Integrative complexity has also been linked with higher grades in college students. These considerations led us to decide that integrative complexity would be an ideal measure for our experiment.

 

  1. Why did you expect to find race effects on Integrative Complexity?

Our expectation is based upon the theory of minority influence in social psychology.  The presence of a minority opinion stimulates more complex or divergent thinking because an individual feels they must take diverse views into account in their reasoning processes.  We expected the presence of racial diversity to have a similar kind of effect in that the presence of racial diversity stimulates an individual to consider the possibility of difference or deviance within an intellectual exchange.  In the American cultural context, racial diversity is commonly associated with variance in experiences, aspirations, values, and attitudes.  Individuals responding to racial diversity, then, are responding to this variance.  In our experiment, we scripted the arguments of the collaborators so that both Black and White collaborators held the same opinions, and yet race effects were detected.  As we predicted, therefore, race carries with it an additional "stimulant" to complexity.

We had thought that racial diversity might interact with opinion diversity such that the effects of opinion diversity would be stronger when the collaborator was a member of a different race.  We found no such interaction, perhaps indicating that when members of a different race agreed with participants, that too created greater complexity.

 

  1. Can you elaborate on the reliability of the Integrative Complexity measure that was used in the study? 

We had six different coders who read essays.  We trained our coders using the methods developed by Suedfeld, Tetlock and others.  In all, the coders scored over 180 pieces of writing each during the training process.  Different sets of three coders read each of the participants' essays.  We preformed a multilevel analysis that examined how much the coders agreed with one another to determine the reliability of ratings.  We found that the reliabilities were 0.70 for the pre-discussion measure of integrative complexity and 0.62 for the post-discussion measure.  There are two related reasons why the reliability of the post-discussion measure is not so high.  First, essays were shorter because some participants only wrote what new thoughts they had as a result of the discussion.  Second, the post-discussion measure is in essence a change measure and change measures typically have low reliability.  However, it is important to note that despite their typically low reliability, change measures can nonetheless show experimental effects.  For example, the paper by Overall & Woodward cited in the paper shows that our ability to detect an existing effect can be maximized in the analysis of change scores when the reliability of change scores is zero.

 

  1. Why did you use the topics of child labor and death penalty for your discussions?

Topics were chosen based upon the degree of racial content in the issue.  The primary topic, child labor, was set in a fictional developing country in Asia and judged to contain very little racial content.  In other words, we believed the topic would not stimulate the issue of race in an American context.  The death penalty was chosen as a topic without any explicit racial content, but one on which opinion polls suggest that Blacks and Whites hold opposing positions. 

 

  1. How did you come to decide on your sample size?  

Several factors entered into our decision concerning sample size.  Among those factors were having enough power (i.e., the probability of obtaining a statistically significant effect when the effect actually exists in the population we are studying) and our budget.  We reasoned that if we had a small to moderate effect size of 0.3 (an effect size is a standardized measure of difference between the means of the populations we are studying), we would need a sample size of 350 to have an 80% chance of detecting such an effect.  Fortunately we were able to secure funding for a study with about 350 cases.  Our final same size was 357.

 

  1. What differences did you find between the three sites: Stanford, UCLA and Maryland?    

In all of our analyses, we controlled for differences between the three sites because we were not specifically interested in the ways in which students at the three sites differed, for example, in Integrative Complexity. However, we did check to see if the effects we found, for example, for racial diversity, varied across the different sites and we discovered that they did not (i.e., there were no interaction effects of any of the independent variables with site). This suggests that the study was implemented in a consistent way across the three sites.

 

  1. Did you encounter any problems in conducting the experiment as planned?

We intended to manipulate majority and minority opinion by selecting participants who all held the same opinion with regard to the discussion issue (in the case of child labor, for example, they were all against it) and then randomly placing them in groups where the collaborator either shared their position or disagreed with their position.  After reading the discussion prompt, some participants (15%) took a position that was inconsistent with their opinion on the screening instrument.  The result was that instead of just having groups where either the collaborator agreed with the three other students or disagreed with all of them, we also had some groups where the collaborator agreed with one other student and some groups where the collaborator agreed with two other students.  In other words, we had a continuous condition of student agreement with the collaborator.  The effect on our analysis was to simply change opinion diversity from a dichotomous variable to a continuous one.  Moreover, our results are essentially the same when we analyze only groups in which the participants did not take a position different from the one they had indicated in the screening survey.

 

  1. Why did you report two types of statistical analyses – one at the level of individuals and one at the level of groups? 

The data from the study occur at different levels.  They are at the level of the group, the three persons who discuss the issue together, and the level of the individual.  If there is a “group effect,” it means that the groups differ from one another in ways that can not just be explained by the differences between the individuals in the groups.  On the other hand, if there is no true "group effect" and the only level of importance is the individual, then the variation between groups can be entirely explained by who is in the group.  We tested for the presence of group effects in all of our analyses.  In the analyses of integrative complexity we found no evidence for a group effect.  However, in the analysis of perceived novelty, there was an effect due to group.  The analysis of this variable, therefore, also included the group effect in the formal model.

 

  1. Isn’t an experiment unnatural? How can you draw conclusions from an experiment to the real world?

Any experiment is by definition artificial because its key characteristic is the isolation, control, and manipulation of variables.  Putting an idea to experimental test is the most rigorous way to determine if a variable really causes another variable, what has come to be called "internal validity."  In every area of science, theories that are developed through observations and experiences are usually eventually tested by using an experiment.  In our case, we believed from the results of other non-experimental studies of college students that there was sufficient evidence to test experimentally the hypothesis that racial diversity leads to positive cognitive outcomes.  Despite the necessarily artificiality of an experiment, we worked very hard to ensure that we tested the hypothesis in a way that matched the context to which we hope our results will generalize:  the university context.  We tried to create an experiment that mimics reality in essential ways.  Our experiment used actual college students attending selective universities, the exact population to which we wished to generalize our findings.  The students sat in small groups with peers and discussed and wrote about complex social topics, similar to what they are expected to do in seminars and sections of lecture courses.  The artificiality was introduced by the fact that the treatment was a single session that lasted one hour, rather than discussions that take place over the course of an entire academic term.  We would like to have had an experiment in the context of a real course and observed students over the entire course, but that would have made the design more complicated and prohibitively expensive.

 

  1. Diversity as represented in your study means Blacks and Whites.  Why did you make that choice?

This was purely a pragmatic decision. The theory about diversity contributing to cognitive complexity can be applied to all forms of diversity, but we felt in this first study, we could only look at one minority group.  We choose African-Americans for several reasons, the most important being that White participants would immediately know that the person was different from them. We would hope that future lines of research would extend this work to examine whether the results can be replicated with other forms of diversity (e.g., adding a White person to an all Black group).

 

  1. Why did you find an effect for race in the pre-discussion essay?

We asked our participants to write their first essay before any discussion had taken place to see whether there would be an effect simply from the anticipation of participating in a discussion with a racially diverse group.  Our results showed a marginally significant effect, indicating evidence of the "priming" of complexity, due perhaps, to the expectation of novelty coincident with racial diversity. 

 

  1. Why do you think that current contact with diversity was a relevant factor in your results? 

Contact with racially different others, based on self-report, had an effect such that those who had little or no contact showed the biggest gains in Integrative Complexity after discussion had taken place. Indeed, for those with frequent contact, there were much smaller differences between those in racially diverse or homogeneous groups. This speaks directly to the argument for why campus diversity is so important, in that students who have not been previously exposed to racially diversity are intellectually challenged by the new social setting.

 

  1. In your results, minority opinion expressed by the collaborators seemed to exert a larger influence than did race.  Does that compromise your claim that race matters in higher education?

No.  With regard to the importance of race in higher education, our experiment should not be seen as the sole or decisive source of evidence.  The existing evidence clearly demonstrates beneficial effects of race for students in the domains viewed as central to the missions of higher education institutions.  Our study adds further evidence supporting that general conclusion.  Opinion diversity could also be seen as important for higher education, and that point is not disputed here.  Rather, in addition to previous work demonstrating the role of racial diversity in increasing cultural awareness, enhancing social and intellectual self-confidence, increasing the frequency of discussions about race and other social issues, and positively impacting community involvement and other civic participation, our study shows that racial diversity also contributes to the development of higher-order thinking processes.  Opinion diversity has not been shown to have a similar broad range of effects among college students.

 

  1. What important questions from your study should social psychology address? 

There is an unfortunate paucity of research in social psychology on the effects of inter-racial interactions on cognitive outcomes.  Our study indicates that racial diversity, in classroom-like settings as well as among students' informal contacts, does affect people's integrative complexity.  We would hope that the field of social psychology, which has often benefited from the study of applied social problems (e.g., the study of authoritarianism following World War II), would begin to explore these phenomena in more detail.

 

  1. What relevance does the experiment have to legal challenges to affirmative action in college admissions?

The constitutionality of affirmative action rests essentially on one premise – that racial diversity serves a compelling interest for higher education.  A large body of social science research has been conducted over the past five or six years with the intention of determining whether that premise can be substantiated.  In the most recent judgment on affirmative action in undergraduate admissions (Gratz v. Bollinger & Grutter v. Bollinger), the Supreme Court ruled that the social science evidence presented a strong rationale for the compelling interest premise.  Critics of the research, however, continue to raise concerns over specific aspects of how the scientific research was conducted.  Our experiment addresses those concerns directly.

 

  1. What are your qualifications to conduct the study?  

The team envelops expertise in higher education research and experimental psychology.  Anthony Lising Antonio, Mitchell Chang, and Jeffrey Milem all received their training at the pre-eminent center for higher education research at UCLA.  Each of these three experts has published widely in the area of diversity and higher education.  Kenji Hakuta, David Kenny, and Shana Levin come out of the tradition in psychology where experiments are the norm.  Hakuta is an experimental psycholinguist by training, and has not conducted research in higher education, but has extensive experience in bringing empirical research to bear on important social policy issues.  Kenny is an experimental social psychologist who has expertise in statistics and research methodology and has conducted substantive work in analyzing dyadic and small group data.  Levin is an experimental social psychologist whose expertise is in the psychology of race and racism.  This combination of expertise in theory, methodology, and policy makes the team unique. 

 

  1. Who funded your study?

The study was funded by Carnegie Corporation of New York, the Ford Foundation, the William and Flora Hewlett Foundation, the James Irvine Foundation, and a private donation from the Richard Parsons Family Foundation.

 

  1. Did you design the study so that it was guaranteed that you would get results that you found? 

While we expected to obtain results that diversity would lead to greater levels of Integrative Complexity, one could argue that the results might have been the opposite.  For instance, one might think that Whites might have been uncomfortable with a Black student in the group and that this anxiety might have interfered with the processing of information.  Additionally, Blacks as outgroup members to Whites may not have been listened to carefully by White participants, resulting in no stimulant to complex thinking.  Thus, it was hardly a certainty that we would have found the results that we found.  Moreover, given the short duration of the experiment, the effect might have been too small to measure.

 

 

Home ]

Send mail to wperez@stanford.edu with questions or comments about this web site.
Last modified: 02/04/03