Question 1

What exactly is a chi-square test testing?

Accepted Answer

A Pearson chi-square test asks whether observed category counts deviate from what some null hypothesis predicts. The classic goodness-of-fit variant — which this calculator runs — tests whether a single categorical distribution matches a fully specified expected distribution. The null hypothesis says the population proportions are exactly the ones you supplied; the alternative says they are not. There is also a chi-square test of independence on a contingency table, which tests whether two categorical variables are associated; the formula is identical but the expected counts come from row and column totals and df = (rows − 1)(cols − 1). Both share the same χ² statistic and the same chi-square sampling distribution.

Question 2

What if I do not have expected counts — only ratios?

Accepted Answer

Enter the ratios in the expected box and the calculator rescales them automatically so the expected total matches the observed total. For example, if you want to test a 9 : 3 : 3 : 1 ratio against four observed counts that sum to 556, you can type "9 3 3 1" directly — the calculator multiplies by 556/16 to get the expected counts (312.75, 104.25, 104.25, 34.75). This is equivalent to typing the rescaled values yourself. Ratios must all be strictly positive; a hypothesised zero count is not a valid expected value because the test divides by it.

Question 3

What does it mean if my expected counts are very small?

Accepted Answer

The chi-square distribution is the limiting null distribution of the statistic as expected counts grow large. A widely cited rule of thumb is that every expected count should be at least 5 (some authors say 1, with no more than 20% of expected counts below 5). With sparser data the asymptotic approximation breaks down: the p-value can be biased and the test loses power. Three remedies: pool adjacent rare categories until the expected counts pass the threshold, use an exact test (such as the multinomial exact test or Fisher’s exact test for contingency tables), or use a likelihood-ratio G-test which has better small-sample behaviour. The calculator does not refuse to compute χ² for sparse cells — it is your responsibility to check the assumption.

Question 4

Why is the p-value right-tailed?

Accepted Answer

Under the null hypothesis the observed counts should sit close to the expected counts, so χ² should be close to its expected value of df. The further the observations stray from the expected distribution — in either direction — the larger χ² becomes, because each term (Oᵢ − Eᵢ)² is squared. There is no "negative χ²" to indicate the opposite direction, so the chi-square test is one-sided by construction: only large values of the statistic are evidence against the null. The p-value is P(χ²_{df} ≥ observed). A χ² value close to zero is actually suspicious for the opposite reason — it suggests the observed data fit the hypothesis too closely, possibly because the data were edited or the expected distribution was tuned to the sample.

Question 5

How are degrees of freedom decided?

Accepted Answer

For a goodness-of-fit test with k categories and a fully specified expected distribution (no parameters estimated from the data), df = k − 1. The minus one comes from the constraint that observed and expected counts must sum to the same N. If you estimate m parameters of the expected distribution from the data — for example, fitting a Poisson mean to the observed counts to test Poisson goodness-of-fit — df = k − 1 − m. This calculator assumes the expected vector you supply (or the implicit uniform one) is fully specified, so it reports df = k − 1; if you estimated parameters first, subtract them yourself when interpreting the p-value.

Question 6

Should I use a chi-square test or Fisher’s exact test?

Accepted Answer

For a single categorical distribution (the goodness-of-fit case this calculator runs) the alternative is the multinomial exact test, not Fisher’s. For a 2×2 or larger contingency table comparing two categorical variables, Fisher’s exact test is preferred whenever any expected count is small (< 5) — it conditions on the observed marginals and computes an exact p-value rather than relying on the chi-square approximation. Chi-square is fine when all expected counts are comfortably large and is the standard choice for big samples. For tables larger than 2×2, the exact-test equivalents are computationally expensive and chi-square (or the likelihood-ratio G-test) is the practical default.

Chi-Square Test Calculator

How to use this calculator

How the calculation works

Worked example

Frequently asked questions

Related calculators