Creating a 2000 IES-LFS Database in Stata



PROVIDE Project Technical Paper 2005:1

February 2005


PSU:      <observations>

Number of PSUs

26177

Population size =  11221840

-----------

Mean |

Estimate

Std. Err.

[95% Conf.

Interval]

-------------------

Deff

totinc |
_ _ _ _ _ _ _ _ _ _ _ _

42793.12

725.4612

41371.18

44215.06

-------------------

1.232494

_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

The ci command with the aweight option gives the exact same point estimates as svymean
with pweight. However, the standard errors are different since aweight uses a different
formula for the standard error. The standard error of the point estimate, which should not be
confused with the standard deviation of a variable, as well as the 95% confidence interval of
the estimate, is slightly larger when
pweight is used.

As argued before it is also important to specify clustering and stratification if applicable.
The IES 2000 data used in this example made use of clusters (PSUs) and stratification along
provincial and rural/urban lines (variable
provloc). In the two examples that follow svyset psu
psuno
is specified, and thereafter svyset strata provloc is added.12 In each instance the point
estimates are shown. Notice the effect on the standard error, confidence interval and
deff (see
section 2.2.3).

. svyset psu psuno

. svymean totinc

Survey mean estimation

pweight: wgtselect

Strata:   <one>

PSU:      psuno

Number of obs    =     26177

Number of strata =         1

Number of PSUs   =      2956

Population size  =  11221840

-----------

Mean |

Estimate

Std. Err.

[95% Conf.

Interval]

-------------------

Deff

totinc |
_ _ _ _ _ _ _ _ _ _ _ _

42793.12

1042.246

40749.52

44836.72

-------------------

2.543879

_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

. svyset strata provloc

. svymean totinc

Survey mean estimation

pweight: wgtselect

Strata:   provloc

PSU:      psuno

Number of obs    =     26177

Number of strata =        18

Number of PSUs   =      3327

Population size  =  11221840

-----------

Mean |

Estimate

Std. Err.

[95% Conf.

Interval]

-------------------

Deff

totinc |
_ _ _ _ _ _ _ _ _ _ _ _

42793.12

975.1044

40881.25

44704.99

-------------------

2.226684

_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

12 Variable psuno is supplied by Statistics South Africa. Variable provloc, which represents the strata in the
survey, was created by grouping variables for province (
prov) and location.

9

© PROVIDE Project



More intriguing information

1. XML PUBLISHING SOLUTIONS FOR A COMPANY
2. FISCAL CONSOLIDATION AND DECENTRALISATION: A TALE OF TWO TIERS
3. A Bayesian approach to analyze regional elasticities
4. Expectation Formation and Endogenous Fluctuations in Aggregate Demand
5. The name is absent
6. Shifting Identities and Blurring Boundaries: The Emergence of Third Space Professionals in UK Higher Education
7. Towards Learning Affective Body Gesture
8. How much do Educational Outcomes Matter in OECD Countries?
9. Multiple Arrhythmogenic Substrate for Tachycardia in a
10. THE UNCERTAIN FUTURE OF THE MEXICAN MARKET FOR U.S. COTTON: IMPACT OF THE ELIMINATION OF TEXTILE AND CLOTHING QUOTAS