PROVIDE Project Technical Paper 2005:1
February 2005
Table 8 continued...
Uncoded |
True |
Total |
Miscoded |
Not |
Total obs. (C + D + E) | |
Public transport | ||||||
P1504Q01 |
25,259 |
9 |
25,268 |
113 |
882 |
26,263 |
P1504Q02 |
25,263 |
9 |
25,272 |
109 |
882 |
26,263 |
P1504Q03 |
25,263 |
9 |
25,272 |
109 |
882 |
26,263 |
P1504Q04 |
25,262 |
9 |
25,271 |
110 |
882 |
26,263 |
P1504Q05 |
25,263 |
9 |
25,272 |
109 |
882 |
26,263 |
P1504Q06 |
25,263 |
10 |
25,273 |
109 |
881 |
26,263 |
P1504Q07 |
25,263 |
9 |
25,272 |
109 |
882 |
26,263 |
P1504Q08 |
25,263 |
9 |
25,272 |
109 |
882 |
26,263 |
P1504total |
25,258 |
9 |
25,267 |
114 |
882 |
26,263 |
Cost of other sport/recreation | ||||||
goods | ||||||
P2003Q01 |
24,726 |
- |
24,726 |
102 |
1,435 |
26,263 |
P2003Q02 |
24,736 |
1 |
24,737 |
92 |
1,434 |
26,263 |
P2003Q03 |
24,735 |
2 |
24,737 |
93 |
1,433 |
26,263 |
P2003Q0401 |
24,731 |
1 |
24,732 |
97 |
1,434 |
26,263 |
P2003Q05 |
24,734 |
- |
24,734 |
94 |
1,435 |
26,263 |
P2003Q06 |
24,735 |
- |
24,735 |
93 |
1,435 |
26,263 |
P2003Q07 |
24,735 |
1 |
24,736 |
93 |
1,434 |
26,263 |
P2003Total_______________________ |
21,405 |
- |
21,405 |
3,423 |
1,435 |
26,263 |
From Table 8 it is clear that the numbers of true missing values are quite low. The only
sections that contain more than ten missing values in certain variables are the two sections on
monthly housing costs. Changing missing values to zeroes is justifiable given the small
number of true missing values. It is better to rather lose information content of a few true
missing variables by changing it to zeroes than lose the entire observation due to the adding-
up restrictions in Stata. As explained previously missing values in expenditure categories
from domworkerh.dta and homegrownh.dta were also changed to zero given that these were
optional section in the questionnaire.
The only other anomaly in Table 8 is variable P2003Total. Answering this question was
not optional, and hence the large number of uncoded observations is strange. However, as
explained below, it was established that this was a result of a coding error. Since this is a sub-
total it could simply be recalculated.
Another concern relates to questions 3.1 and 3.2 in the housing section. The questionnaire
requests respondents to answer either the section labelled monthly housing cost if rented or
the section labelled monthly housing cost if owned. In 21 cases households answered both
sections. It does not seem highly unlikely that some households own property (excluding
holiday homes) as well as rent property. The list below shows the reported values for each of
the questions in these two sections. There appears to be no duplication (apart from record
number 26015 - levy is reported twice). The error is small enough and is unlikely to affect
results in any great deal, and is consequently ignored.
41
© PROVIDE Project
More intriguing information
1. Konjunkturprognostiker unter Panik: Kommentar2. The name is absent
3. The magnitude and Cyclical Behavior of Financial Market Frictions
4. Industrial Cores and Peripheries in Brazil
5. DISCUSSION: ASSESSING STRUCTURAL CHANGE IN THE DEMAND FOR FOOD COMMODITIES
6. Subduing High Inflation in Romania. How to Better Monetary and Exchange Rate Mechanisms?
7. The name is absent
8. Estimated Open Economy New Keynesian Phillips Curves for the G7
9. The name is absent
10. Impacts of Tourism and Fiscal Expenditure on Remote Islands in Japan: A Panel Data Analysis