Page 235 - Deep Learning
P. 235

218                         Adaptation

               A plausible approach is to divide errors into types based on their origins.
            The concept of general problem-solving methods suggests a particular hypoth-
                                                         27
            esis about the origin and hence the correction of errors.  At the first encounter
            with the target task, a learner is not so much practicing as solving the problem
            of how to perform the practice task. In the earliest stage of practicing – the
            very first training trials – he cannot yet have a strategy that is adapted to the
            specific task (or else the task is not unfamiliar). But if he is acting, he is, by defi-
            nition, executing some strategy or another. General methods are not adapted
            to the particular structure of any one task environment. They are widely appli-
            cable, but inefficient. Their function is not to produce solutions but to enable
            the learner to generate task-relevant behaviors vis-à-vis an unfamiliar task.
               Like other strategies, general strategies can be described as collections of
            rules. The latter serve as templates for task-specific rules. For example, the gen-
            eral heuristic, if you want to switch on an electrical device, then push its power
            button, when applied to a particular device such as a computer projector, might
            generate the specific rule, if you want to turn on a computer projector of brand
            X, then push the red button to the right. The conditions on such rules (i.e., the
            part between if and then) are likely to be incomplete because the general rules
            do not contain any knowledge about the specific situation to which they are
            applied. Consequently, a rule has some probability of matching situations in
            which the action it recommends is not, in fact, appropriate, correct or useful.
            To continue the projector example, perhaps the projector works only if it is
            switched on after being hooked up to the computer, so a more complete rule
            is, if you want to turn on a computer projector of brand X, and the projector has
            been hooked up to the computer, then push the red button to the right. Without
            this additional condition, the rule will recommend pushing the power button
            even in situations in which the projector and the computer are not yet con-
            nected, perhaps causing a communication problem between the two devices.
            I refer to errors caused by overly general rules as displacement errors because
            they arise when an action was recommended even though it should not have
            been and thereby displaced whatever action would otherwise have been per-
                                28
            formed in that situation.  The claim that a significant proportion of errors in
            real situations is due to displacement is the Displacement Hypothesis.
               Evidence that errors are generated because rules and strategies are overly
            general at the outset of learning has been found in multiple task domains; see
                                                                           29
            Table 7.1. Overregularization in language acquisition is a classical example.
            Children at a certain age apply linguistic rules to cases that constitute excep-
            tions to those rules. An example is the overregularization of past tense forma-
            tion. That is, the tendency to use the suffix “-ed” even in the case of irregular
   230   231   232   233   234   235   236   237   238   239   240