Estimating the Technology of Cognitive and Noncognitive Skill Formation

diagonalization used in linear factor analyses, we do not work with random vectors. Instead,
we work with their densities. This approach offers the advantage that the problem remains
linear even when the random vectors are related nonlinearly.

The conditional independence requirement of Assumption 2 is weaker than the full in-
dependence assumption traditionally made in standard linear factor models as it allows for
heteroscedasticity. Assumption 3 requires θ, Z₁ , Z₂ to be vectors of the same dimensions,
while Assumption 4 can be satisfied even if Z3 is a scalar. The minimum number of mea-
surements needed for identification is therefore 2L + 1, which is exactly the same number of
measurements as in the linear, classical measurement error case.

Versions of Assumption 3 appear in the nonparametric instrumental variable literature
(e.g., Newey and Powell, 2003; Darolles et al., 2002). Intuitively, the requirement that
p_Z₁ |_Z₂ (Z₁ |Z₂ ) forms a bounded complete family requires that the density of Z₁ vary suffi-
ciently as Z₂ varies (and similarly for pθ∣Z₁ (θ∣Z₁)).¹9

Assumption 4 is automatically satisfied, for instance, if θ is univariate and a3 (θ, ε3) is
strictly increasing in θ. However, it holds much more generally. Since a3 (θ, ε3) is nonsepa-
rable, the distribution of Z₃ conditional on θ can change with θ, thus making it possible for
Assumption 4 to be satisfied even if a3 (θ, ε3) is not strictly increasing in θ.

Assumption 5 specifies how the observed Z1 is used to determine the scale of the un-
observed θ. The most common choices of the functional Ψ would be the mean, the mode,
the median, or any other well-defined measure of location. This specification allows for non-
classical measurement error. One way to satisfy this assumption is to normalize a1 (θ, ε1) to
be equal to θ + ε₁ , where ε₁ has zero mean, median or mode. The zero mode assumption
is particularly plausible for surveys where respondents face many possible wrong answers
but only one correct answer. Moving the mode of the answers away from zero would there-
fore require a majority of respondents to misreport in exactly the same way— an unlikely
scenario. Many other nonseparable functions can also satisfy this assumption. With the
distribution of pθ (θ) in hand, we can identify the technology using the analysis presented
below in Section 3.4.

Note that Theorem 2 does not claim that the distributions of the errors εj or that the
functions a_j∙ (∙, ∙) are identified. In fact, it is always possible to alter the distribution of ε_j∙ and
the dependence of the function α_j∙ (∙, ∙) on its second argument in ways that cancel each other
out, as noted in the literature on nonseparable models.²⁰ However, lack of identifiability of
¹⁹In the case of classical measurement error, bounded completeness assumptions can be phrased in terms of
primitive conditions requiring nonvanishing characteristic functions of the distributions of the measurement
errors as in Mattner (1993). However, apart from this special case, very little is known about primitive
conditions for bounded completeness, and research is still ongoing on this topic. See d’Haultfoeuille (2006).
²⁰See Matzkin (2003, 2007).

More intriguing information

1. THE CHANGING STRUCTURE OF AGRICULTURE
2. Keynesian Dynamics and the Wage-Price Spiral:Estimating a Baseline Disequilibrium Approach
3. Natural hazard mitigation in Southern California
4. Insecure Property Rights and Growth: The Roles of Appropriation Costs, Wealth Effects, and Heterogeneity
5. Spectral calibration of exponential Lévy Models [1]
6. The name is absent
7. The name is absent
8. EFFICIENCY LOSS AND TRADABLE PERMITS
9. Infrastructure Investment in Network Industries: The Role of Incentive Regulation and Regulatory Independence
10. Fiscal Policy Rules in Practice