Which test is a statistical method to determine if two categorical variables have a significant correlation between them * 1 point t test chi

Published on May 23, 2022 by Shaun Turney. Revised on September 14, 2022.

A Pearson’s chi-square test is a statistical test for categorical data. It is used to determine whether your data are significantly different from what you expected. There are two types of Pearson’s chi-square tests:

  • The chi-square goodness of fit test is used to test whether the frequency distribution of a categorical variable is different from your expectations.
  • The chi-square test of independence is used to test whether two categorical variables are related to each other.

    Chi-square is often written as Χ2 and is pronounced “kai-square” (rhymes with “eye-square”). It is also called chi-squared.

    What is a chi-square test?

    Pearson’s chi-square (Χ2) tests, often referred to simply as chi-square tests, are among the most common nonparametric tests. Nonparametric tests are used for data that don’t follow the assumptions of parametric tests, especially the assumption of a normal distribution.

    If you want to test a hypothesis about the distribution of a categorical variable you’ll need to use a chi-square test or another nonparametric test. Categorical variables can be nominal or ordinal and represent groupings such as species or nationalities. Because they can only have a few specific values, they can’t have a normal distribution.

    Note: Parametric tests can’t test hypotheses about the distribution of a categorical variable, but they can involve a categorical variable as an independent variable (e.g., ANOVAs).

    Test hypotheses about frequency distributions

    There are two types of Pearson’s chi-square tests, but they both test whether the observed frequency distribution of a categorical variable is significantly different from its expected frequency distribution. A frequency distribution describes how observations are distributed between different groups.

    Frequency distributions are often displayed using frequency distribution tables. A frequency distribution table shows the number of observations in each group. When there are two categorical variables, you can use a specific type of frequency distribution table called a contingency table to show the number of observations in each combination of groups.

    Example: Bird species at a bird feeder Frequency of visits by bird species at a bird feeder during a 24-hour period
    Bird speciesFrequency
    House sparrow 15
    House finch 12
    Black-capped chickadee 9
    Common grackle 8
    European starling 8
    Mourning dove 6

    A chi-square test (a chi-square goodness of fit test) can test whether these observed frequencies are significantly different from what was expected, such as equal frequencies.

    Example: Handedness and nationality Contingency table of the handedness of a sample of Americans and Canadians
    Right-handedLeft-handed
    American236 19
    Canadian157 16

    A chi-square test (a test of independence) can test whether these observed frequencies are significantly different from the frequencies expected if handedness is unrelated to nationality.

    The chi-square formula

    Both of Pearson’s chi-square tests use the same formula to calculate the test statistic, chi-square (Χ2):

       

    Which test is a statistical method to determine if two categorical variables have a significant correlation between them * 1 point t test chi

    Where:

    • Χ2 is the chi-square test statistic
    • Σ is the summation operator (it means “take the sum of”)
    • O is the observed frequency
    • E is the expected frequency

    The larger the difference between the observations and the expectations (O − E in the equation), the bigger the chi-square will be. To decide whether the difference is big enough to be statistically significant, you compare the chi-square value to a critical value.

    What can proofreading do for your paper?

    Scribbr editors not only correct grammar and spelling mistakes, but also strengthen your writing by making sure your paper is free of vague language, redundant words and awkward phrasing.

    Which test is a statistical method to determine if two categorical variables have a significant correlation between them * 1 point t test chi
    Which test is a statistical method to determine if two categorical variables have a significant correlation between them * 1 point t test chi

    See editing example

    When to use a chi-square test

    A Pearson’s chi-square test may be an appropriate option for your data if all of the following are true:

    1. You want to test a hypothesis about one or more categorical variables. If one or more of your variables is quantitative, you should use a different statistical test. Alternatively, you could convert the quantitative variable into a categorical variable by separating the observations into intervals.
    2. The sample was randomly selected from the population.
    3. There are a minimum of five observations expected in each group or combination of groups.

    Types of chi-square tests

    The two types of Pearson’s chi-square tests are:

    • Chi-square goodness of fit test
    • Chi-square test of independence

    Mathematically, these are actually the same test. However, we often think of them as different tests because they’re used for different purposes.

    Chi-square goodness of fit test

    You can use a chi-square goodness of fit test when you have one categorical variable. It allows you to test whether the frequency distribution of the categorical variable is significantly different from your expectations. Often, but not always, the expectation is that the categories will have equal proportions.

    Example: Hypotheses for chi-square goodness of fit testExpectation of equal proportions
    • Null hypothesis (H0): The bird species visit the bird feeder in equal proportions.
    • Alternative hypothesis (HA): The bird species visit the bird feeder in different proportions.

    Expectation of different proportions

    • Null hypothesis (H0): The bird species visit the bird feeder in the same proportions as the average over the past five years.
    • Alternative hypothesis (HA): The bird species visit the bird feeder in different proportions from the average over the past five years.

    Chi-square test of independence

    You can use a chi-square test of independence when you have two categorical variables. It allows you to test whether the two variables are related to each other. If two variables are independent (unrelated), the probability of belonging to a certain group of one variable isn’t affected by the other variable.

    Example: Chi-square test of independence
    • Null hypothesis (H0): The proportion of people who are left-handed is the same for Americans and Canadians.
    • Alternative hypothesis (HA): The proportion of people who are left-handed differs between nationalities.

    Other types of chi-square tests

    Some consider the chi-square test of homogeneity to be another variety of Pearson’s chi-square test. It tests whether two populations come from the same distribution by determining whether the two populations have the same proportions as each other. You can consider it simply a different way of thinking about the chi-square test of independence.

    McNemar’s test is a test that uses the chi-square test statistic. It isn’t a variety of Pearson’s chi-square test, but it’s closely related. You can conduct this test when you have a related pair of categorical variables that each have two groups. It allows you to determine whether the proportions of the variables are equal.

    Example: McNemar’s testSuppose that a sample of 100 people is offered two flavors of ice cream and asked whether they like the taste of each. Contingency table of ice cream flavor preference
    Like chocolateDislike chocolate
    Like vanilla47 32
    Dislike vanilla8 13
    • Null hypothesis (H0): The proportion of people who like chocolate is the same as the proportion of people who like vanilla.
    • Alternative hypothesis (HA): The proportion of people who like chocolate is different from the proportion of people who like vanilla.

    There are several other types of chi-square tests that are not Pearson’s chi-square tests, including the test of a single variance and the likelihood ratio chi-square test.

    How to perform a chi-square test

    The exact procedure for performing a Pearson’s chi-square test depends on which test you’re using, but it generally follows these steps:

    1. Create a table of the observed and expected frequencies. This can sometimes be the most difficult step because you will need to carefully consider which expected values are most appropriate for your null hypothesis.
    2. Calculate the chi-square value from your observed and expected frequencies using the chi-square formula.
    3. Find the critical chi-square value in a chi-square critical value table or using statistical software.
    4. Compare the chi-square value to the critical value to determine which is larger.
    5. Decide whether to reject the null hypothesis. You should reject the null hypothesis if the chi-square value is greater than the critical value. If you reject the null hypothesis, you can conclude that your data are significantly different from what you expected.

    How to report a chi-square test

    If you decide to include a Pearson’s chi-square test in your research paper, dissertation or thesis, you should report it in your results section. You can follow these rules if you want to report statistics in APA Style:

    • You don’t need to provide a reference or formula since the chi-square test is a commonly used statistic.
    • Refer to chi-square using its Greek symbol, Χ2. Although the symbol looks very similar to an “X” from the Latin alphabet, it’s actually a different symbol. Greek symbols should not be italicized.
    • Include a space on either side of the equal sign.
    • If your chi-square is less than zero, you should include a leading zero (a zero before the decimal point) since the chi-square can be greater than zero.
    • Provide two significant digits after the decimal point.
    • Report the chi-square alongside its degrees of freedom, sample size, and p value, following this format: Χ2 (degrees of freedom, N = sample size) = chi-square value, p = p value).
    Example: Reporting a chi-square testThere was no significant relationship between handedness and nationality, Χ2 (1, N = 428) = 0.44, p = .505.

    Practice questions

    Frequently asked questions about chi-square tests

    What is the difference between quantitative and categorical variables?

    Quantitative variables are any variables where the data represent amounts (e.g. height, weight, or age).

    Categorical variables are any variables where the data represent groups. This includes rankings (e.g. finishing places in a race), classifications (e.g. brands of cereal), and binary outcomes (e.g. coin flips).

    You need to know what type of variables you are working with to choose the right statistical test for your data and interpret your results.

    Sources in this article

    We strongly encourage students to use sources in their work. You can cite our article (APA Style) or take a deep dive into the articles below.

    This Scribbr article

    Turney, S. (September 14, 2022). Chi-Square (Χ²) Tests | Types, Formula & Examples. Scribbr. Retrieved October 10, 2022, from https://www.scribbr.com/statistics/chi-square-tests/

    Is this article helpful?

    You have already voted. Thanks :-) Your vote is saved :-) Processing your vote...

    Which test is a statistical method to determine if two categorical variables have a significant correlation between them?

    One statistical test that does this is the Chi Square Test of Independence, which is used to determine if there is an association between two or more categorical variables.

    How do you determine if there is a significant relationship between two categorical variables?

    This test is used to determine if two categorical variables are independent or if they are in fact related to one another. If two categorical variables are independent, then the value of one variable does not change the probability distribution of the other.

    What is chi

    The Chi-Squared test is a statistical hypothesis test that assumes (the null hypothesis) that the observed frequencies for a categorical variable match the expected frequencies for the categorical variable.

    What is the chi

    A chi-square test is a statistical test used to compare observed results with expected results. The purpose of this test is to determine if a difference between observed data and expected data is due to chance, or if it is due to a relationship between the variables you are studying.

    Which statistical test is used to identify whether there is a relationship between two categorical variables 40?

    The chi-square test is an overall test for detecting relationships between two categorical variables. If the test is significant, it is important to look at the data to learn the nature of the relationship.

    What test would you use to determine if two categorical variables are independent?

    The Chi-square test of independence is a statistical hypothesis test used to determine whether two categorical or nominal variables are likely to be related or not.