Page 368 - Understanding Machine Learning
P. 368

Proof of the Fundamental Theorem of Learning Theory
           350

                 i.i.d. instances from X with labels according to c we have that any ERM hypothesis
                 has a true error of at most 
.
                                         c
                 Proof. Define the class H ={c    h : h ∈ H},where c     h = (h \ c) ∪ (c \ h). It is
                                                                                        c
                 easy to verify that if some A ⊂ X is shattered by H then it is also shattered by H
                                                           c
                 and vice versa. Hence, VCdim(H) = VCdim(H ). Therefore, using Theorem 28.3
                                                                                        c
                 we know that with probability of at least 1 − δ, the sample S is an 
-net for H .
                 Note that L D (h) = D(h     c). Therefore, for any h ∈ H with L D (h) ≥ 
 we have that

                 |(h  c)∩S| > 0, which implies that h cannot be an ERM hypothesis, which concludes
                 our proof.
   363   364   365   366   367   368   369   370   371   372   373