Page 407 - Deep Learning
P. 407

390                         Conclusion

            prior experience and consequently experience a higher rate of non-monotonic
            change. But the paradoxical implication of turbulence is that a higher rate
            of change will not necessarily be associated with a higher level of success. A
            stronger disposition to change might not by itself be useful unless we can also
            learn something about the conditions under which we are better off projecting
            prior experience and the conditions under which we are better off drawing
            back to leap. The ideal is not to change with high frequency, but to project
            prior experience when the situation at hand is like prior situations and hence
            does in fact lend itself to be handled on the basis of experience, and to draw
            back and leap when the situation at hand is, in fact, essentially different and
            requires a novel response.
               It is tempting to believe that we can learn to differentiate one class of situ-
            ations from the other, and hence become more accurate in choosing between
            extrapolating and leaping. After all, a person’s lifetime contains many examples
            of both types of situations, and the study of history extends that database over
            generations. If we analyze this accumulated experience from the deep learning
            point of view, we might be able identify conditions that reliably characterize
            situations in which we should project rather than leap and vice versa, and so
            improve the accuracy of our choices.
               A moment of reflection reveals the fallacy hiding in this temptation. Our
            knowledge of the situations in which projection worked well and the situa-
            tions in which drawing back to leap worked better is itself a strand in our past
            experience. It is subject to the same turbulence as all experience: We cannot
            know whether our past experience of when past experience is useful, is itself
            useful in any future situation. That is, we cannot project whatever experience
            we have of projecting past experience onto the next situation. Are we better
            off overriding that experience and making choices as to when to project and
            when to override in some novel way? The argument against the projectability
            of past experience applies recursively to our past experience of projecting past
            experience; similar with our experience of drawing back to leap. In program-
            ming terminology, turbulence is recursive; that is, it applies to itself. Hence, we
            cannot, even in principle, learn from experience when to project prior experi-
            ence and when to override it.
               Philosophers  recognize  this  as  a  version  of  “Hume’s  problem”  or  “the
            problem of induction.”  David Hume asked an epistemological question: How
                               3
            can we know that a regularity induced from past examples is true of the entire
            set that the examples were drawn from? The problem, said Hume, is that if
            we apply a principle of inductive inference, what guarantees the validity of
            that principle? If the principle is itself a generalization over successful past
   402   403   404   405   406   407   408   409   410   411   412