Page 255 - Data Science Algorithms in a Week
P. 255

The Utilization of Case-Based Reasoning                237



























                       Figure 3. Different paths in the EDs.

                       The Nearest Neighbor Approach

                          Algorithms  such  as  K  nearest  neighbor  or  R  nearest  neighbor  are  deployed  to
                       determine the similarities between the attributes of both the new case we are seeking a
                       solution for, and the cases stored in the case-base. Similarities are then normalized to fall
                       between zero “0” and one “1” or as percentages. These functions use various similarity
                       metrics such as Euclidean distance, city block distance, probabilistic similarity measures,
                       and geometric similarity metrics. Similarity percentages are retrieved using a predefined
                       parameter  value  “K”.  However,  in  the  R  nearest  neighbors,  cases  with  similarities
                       percentages (see equation below) that are greater than or equal to a predefined value “R”
                       are retrieved.






                          where,
                          NC represents the new case
                          SCs are stored cases in the case-base.
                          n is the number of attributes in each case
                          w is weight, and
                          f is the similarity function.
   250   251   252   253   254   255   256   257   258   259   260