The name is absent

Stata Technical Bulletin

sg57 An immediate command for two-way tables

Nicholas J. Cox, University of Durham, UK, FAX (011)-44-91-374-2456, [email protected]

The syntax for the tab2i command is

tab2i #11 #12 [...] ∖ #21 #22 [∙∙∙] [∖ ∙∙∙] [. replace ]

where #_x1, #_x2, etc., are zeros or positive integers showing the frequencies in a two-way table, and backslashes separate rows
of the table. There must be at least two rows and at least two columns in the table.

Option

replace indicates that the variables listed by the command are to be left as the current data in place of whatever data were
there. These variables are row and column indices, observed and expected frequencies, and Pearson and adjusted residuals.

Explanation

A chi-squared test for association of the row and column variables in a two-way table of frequencies is featured in most first
courses in statistics. In Stata, this test is provided by the immediate command tabi or by the command tabulate. However,
neither produces output of expected (fitted, predicted) frequencies or of residuals. Most data analysts wish to glance at least
briefly at such results.

tab2i is an alternative to tabi that does produce this output. In a two-way table of frequencies, the observed frequency in
row a and column j of the table y_ij is compared with the expected frequency y_ij. Under the null hypothesis of independence,
the expected frequencies are calculated from row totals yt₊, column totals y₊j, and the table total y₊₊ by

Vij —

yi+ У+j

У++

The chi-squared statistic is then

2 _ ∖ ' (yij yij)²λ -ʌ ʌ-

The residuals produced by tab2i come in two flavors. First, Pearson residuals (also called standardized or chi-residuals)
are the (appropriately signed) square roots of each cell’s contribution to the Pearson chi-squared statistic. The Pearson residuals
are thus

¾⅛^, ~ ytj

Vyij

Under the null hypothesis, the Pearson residuals approximately follow Gaussian (normal) distributions with mean 0 and variance
less than 1. Consequently, one rough rule of thumb is to look especially carefully at any residual greater than 2 in magnitude.

Second, adjusted residuals are Pearson residuals divided by an estimate of their standard error

₁-2i+y₁-2±n

У++ Jy y++J

so that they are distributed more like Gaussians with mean 0 and variance 1.

Example

Jacqueline Tivers (1985) interviewed 400 women with young children in the London Borough of Merton in September
1977. In one analysis, she looked at the cross-tabulation of the age at which women finished full-time education and whether
they used a library regularly. The table of frequencies did not come with a chi-squared statistic or residuals.

More intriguing information

1. The name is absent
2. Les freins culturels à l'adoption des IFRS en Europe : une analyse du cas français
3. The name is absent
4. Orientation discrimination in WS 2
5. The name is absent
6. Change in firm population and spatial variations: The case of Turkey
7. The name is absent
8. Performance - Complexity Comparison of Receivers for a LTE MIMO–OFDM System
9. International Financial Integration*
10. The name is absent