A Rational Analysis of Alternating Search and Reflection Strategies in Problem Solving



Figure 8: The property-retrieval strategy

Figure 9: The find-fact-on-feedback strategy

general rule that wants to create a new example. The second
strategy, find-fact-on-feedback, is demonstrated in figure 9.
If the model has decided it will retrieve the weights, it still
cannot predict an answer, because it doesn’t even know what
the available answers are. So a “I don’t know” production
fires, after which the environment hopefully will give some
feedback. Suppose we have a child facing a real beam, it can
see that the answer is “left”. The strategy then tries to find
some fact in declarative memory that can help to predict the
answer. This can be an arbitrary fact, but since “beam”, “2”,
“3” and “left” are all part of the goal, ACT-R ensures that
facts containing these elements, or having associations with
them, are likely candidates. So 3 is-greater-than 2 is a possi-
ble candidate, particularly if there is already an association
between beam and greater-than, i.e. the child already knows
that beams have something to do with the fact that one thing
is greater than another.

Results

Simulations of the model, discussed in detail in Taatgen
(1996), show that it can indeed infer the correct rules for the
beam task. If the model already has an association between
beams and weight and between beams and greater-than, the
correct rules can be inferred using only a few examples. If
the model has no prior associations at all, it may need as
much as 40 examples, and in some runs it cannot even find
the correct rules at all.

When the model starts out with a wrong hypothesis, for
example that the labels have to be used to predict the out-
come, it shows a behavior similar to the explore-impasse-
insight-execute scheme: it first learns a lot of irrelevant rules
to predict the answer using the labels, then reaches a stage in
which it tries to explain a single example over and over again
but fails in doing this, after which it rejects the rule that
examines the labels, creates a rule that examines the weights,
and quickly derives the rest of the rules needed within a few
trials. Figure 10 shows an estimation of the time spent at

Figure 10: Estimated time spent at each trial before and after
the “insight” trial

each trial before and after the moment the model creates the
rule to examine weight instead of label.

Same model, other task: discrimination-shift

Another interesting property of the model is that its rules are
general, and can be applied to other tasks. A task that can be
modeled using the same production rules is discrimination-
shift learning (Kendler & Kendler, 1959). Figure 11 shows
an example of this task: subjects have to learn to discrimi-
nate the four stimuli in two reinforcement categories, for
example white is positive and black is negative. After the
subjects has made 10 consecutive correct predictions, the
reinforcement scheme is changed: either a reversal-shift, in
which all stimuli that received previous positive reinforce-

Figure 11: Example of discrimination-shift learning



More intriguing information

1. Ahorro y crecimiento: alguna evidencia para la economía argentina, 1970-2004
2. Why unwinding preferences is not the same as liberalisation: the case of sugar
3. Has Competition in the Japanese Banking Sector Improved?
4. The name is absent
5. Philosophical Perspectives on Trustworthiness and Open-mindedness as Professional Virtues for the Practice of Nursing: Implications for he Moral Education of Nurses
6. The name is absent
7. SOME ISSUES CONCERNING SPECIFICATION AND INTERPRETATION OF OUTDOOR RECREATION DEMAND MODELS
8. Disturbing the fiscal theory of the price level: Can it fit the eu-15?
9. EXPANDING HIGHER EDUCATION IN THE U.K: FROM ‘SYSTEM SLOWDOWN’ TO ‘SYSTEM ACCELERATION’
10. Literary criticism as such can perhaps be called the art of rereading.
11. The name is absent
12. The name is absent
13. THE ANDEAN PRICE BAND SYSTEM: EFFECTS ON PRICES, PROTECTION AND PRODUCER WELFARE
14. TOWARDS THE ZERO ACCIDENT GOAL: ASSISTING THE FIRST OFFICER MONITOR AND CHALLENGE CAPTAIN ERRORS
15. Insecure Property Rights and Growth: The Roles of Appropriation Costs, Wealth Effects, and Heterogeneity
16. Fertility in Developing Countries
17. Structure and objectives of Austria's foreign direct investment in the four adjacent Central and Eastern European countries Hungary, the Czech Republic, Slovenia and Slovakia
18. The name is absent
19. Der Einfluß der Direktdemokratie auf die Sozialpolitik
20. Anti Microbial Resistance Profile of E. coli isolates From Tropical Free Range Chickens