Creating a 2000 IES-LFS Database in Stata



PROVIDE Project Technical Paper 2005:1

February 2005


PSU:      <observations>

Number of PSUs

26177

Population size =  11221840

-----------

Mean |

Estimate

Std. Err.

[95% Conf.

Interval]

-------------------

Deff

totinc |
_ _ _ _ _ _ _ _ _ _ _ _

42793.12

725.4612

41371.18

44215.06

-------------------

1.232494

_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

The ci command with the aweight option gives the exact same point estimates as svymean
with pweight. However, the standard errors are different since aweight uses a different
formula for the standard error. The standard error of the point estimate, which should not be
confused with the standard deviation of a variable, as well as the 95% confidence interval of
the estimate, is slightly larger when
pweight is used.

As argued before it is also important to specify clustering and stratification if applicable.
The IES 2000 data used in this example made use of clusters (PSUs) and stratification along
provincial and rural/urban lines (variable
provloc). In the two examples that follow svyset psu
psuno
is specified, and thereafter svyset strata provloc is added.12 In each instance the point
estimates are shown. Notice the effect on the standard error, confidence interval and
deff (see
section 2.2.3).

. svyset psu psuno

. svymean totinc

Survey mean estimation

pweight: wgtselect

Strata:   <one>

PSU:      psuno

Number of obs    =     26177

Number of strata =         1

Number of PSUs   =      2956

Population size  =  11221840

-----------

Mean |

Estimate

Std. Err.

[95% Conf.

Interval]

-------------------

Deff

totinc |
_ _ _ _ _ _ _ _ _ _ _ _

42793.12

1042.246

40749.52

44836.72

-------------------

2.543879

_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

. svyset strata provloc

. svymean totinc

Survey mean estimation

pweight: wgtselect

Strata:   provloc

PSU:      psuno

Number of obs    =     26177

Number of strata =        18

Number of PSUs   =      3327

Population size  =  11221840

-----------

Mean |

Estimate

Std. Err.

[95% Conf.

Interval]

-------------------

Deff

totinc |
_ _ _ _ _ _ _ _ _ _ _ _

42793.12

975.1044

40881.25

44704.99

-------------------

2.226684

_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

12 Variable psuno is supplied by Statistics South Africa. Variable provloc, which represents the strata in the
survey, was created by grouping variables for province (
prov) and location.

9

© PROVIDE Project



More intriguing information

1. Surveying the welfare state: challenges, policy development and causes of resilience
2. LABOR POLICY AND THE OVER-ALL ECONOMY
3. The name is absent
4. Competition In or For the Field: Which is Better
5. The name is absent
6. On the Desirability of Taxing Charitable Contributions
7. The name is absent
8. Models of Cognition: Neurological possibility does not indicate neurological plausibility.
9. Whatever happened to competition in space agency procurement? The case of NASA
10. References
11. Human Development and Regional Disparities in Iran:A Policy Model
12. The Tangible Contribution of R&D Spending Foreign-Owned Plants to a Host Region: a Plant Level Study of the Irish Manufacturing Sector (1980-1996)
13. The name is absent
14. Personal Experience: A Most Vicious and Limited Circle!? On the Role of Entrepreneurial Experience for Firm Survival
15. SAEA EDITOR'S REPORT, FEBRUARY 1988
16. Internationalization of Universities as Internationalization of Bildung
17. The name is absent
18. A Hybrid Neural Network and Virtual Reality System for Spatial Language Processing
19. Policy Formulation, Implementation and Feedback in EU Merger Control
20. The name is absent