Creating a 2000 IES-LFS Database in Stata



PROVIDE Project Technical Paper 2005:1

February 2005


PSU:      <observations>

Number of PSUs

26177

Population size =  11221840

-----------

Mean |

Estimate

Std. Err.

[95% Conf.

Interval]

-------------------

Deff

totinc |
_ _ _ _ _ _ _ _ _ _ _ _

42793.12

725.4612

41371.18

44215.06

-------------------

1.232494

_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

The ci command with the aweight option gives the exact same point estimates as svymean
with pweight. However, the standard errors are different since aweight uses a different
formula for the standard error. The standard error of the point estimate, which should not be
confused with the standard deviation of a variable, as well as the 95% confidence interval of
the estimate, is slightly larger when
pweight is used.

As argued before it is also important to specify clustering and stratification if applicable.
The IES 2000 data used in this example made use of clusters (PSUs) and stratification along
provincial and rural/urban lines (variable
provloc). In the two examples that follow svyset psu
psuno
is specified, and thereafter svyset strata provloc is added.12 In each instance the point
estimates are shown. Notice the effect on the standard error, confidence interval and
deff (see
section 2.2.3).

. svyset psu psuno

. svymean totinc

Survey mean estimation

pweight: wgtselect

Strata:   <one>

PSU:      psuno

Number of obs    =     26177

Number of strata =         1

Number of PSUs   =      2956

Population size  =  11221840

-----------

Mean |

Estimate

Std. Err.

[95% Conf.

Interval]

-------------------

Deff

totinc |
_ _ _ _ _ _ _ _ _ _ _ _

42793.12

1042.246

40749.52

44836.72

-------------------

2.543879

_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

. svyset strata provloc

. svymean totinc

Survey mean estimation

pweight: wgtselect

Strata:   provloc

PSU:      psuno

Number of obs    =     26177

Number of strata =        18

Number of PSUs   =      3327

Population size  =  11221840

-----------

Mean |

Estimate

Std. Err.

[95% Conf.

Interval]

-------------------

Deff

totinc |
_ _ _ _ _ _ _ _ _ _ _ _

42793.12

975.1044

40881.25

44704.99

-------------------

2.226684

_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

12 Variable psuno is supplied by Statistics South Africa. Variable provloc, which represents the strata in the
survey, was created by grouping variables for province (
prov) and location.

9

© PROVIDE Project



More intriguing information

1. The name is absent
2. Ability grouping in the secondary school: attitudes of teachers of practically based subjects
3. Strategic Investment and Market Integration
4. INSTITUTIONS AND PRICE TRANSMISSION IN THE VIETNAMESE HOG MARKET
5. El Mercosur y la integración económica global
6. The name is absent
7. Constrained School Choice
8. The technological mediation of mathematics and its learning
9. Consumer Networks and Firm Reputation: A First Experimental Investigation
10. The name is absent