Estimated Probability
Probability that a generic pattern , drawn from pdf , belongs to an arbitrary region of the feature space:
If out of data are in , we can estimate via the relative frequency:
Now consider this:

Letβs re-define as , where the subscript means the number of data in the region. ~Ex.: means the region is formed by samples of data.
NOTE: The volume does not depend on the number of data (yet), we decide the area , so we decide the volume.
Let be our pattern of interest. ~Ex.: we are searching for the pdf of males in a specific university Then we use the region to estimate from data.
We define:
- be the volume of .
- be the number of data (out of ) in .
- be the -th estimate of , ~Ex.: .
Asymptotic Necessary and Sufficient Conditions: we want to ensure :
- (to guarantee convergence)
β To satisfy these conditions, we can do one of two things:
- Fix a proper volume, say and determine consequently. (Parzen Window).
- Fix (~Ex.: ), and determine consequently, in such a way that exactly patterns fall in (-nearest neighbor).
Original Files:
