Page 255 - Data Science Algorithms in a Week

P. 255

The Utilization of Case-Based Reasoning 237

Figure 3. Different paths in the EDs.

The Nearest Neighbor Approach

Algorithms such as K nearest neighbor or R nearest neighbor are deployed to
determine the similarities between the attributes of both the new case we are seeking a
solution for, and the cases stored in the case-base. Similarities are then normalized to fall
between zero “0” and one “1” or as percentages. These functions use various similarity
metrics such as Euclidean distance, city block distance, probabilistic similarity measures,
and geometric similarity metrics. Similarity percentages are retrieved using a predefined
parameter value “K”. However, in the R nearest neighbors, cases with similarities
percentages (see equation below) that are greater than or equal to a predefined value “R”
are retrieved.

where,
NC represents the new case
SCs are stored cases in the case-base.
n is the number of attributes in each case
w is weight, and
f is the similarity function.

250 251 252 253 254 255 256 257 258 259 260