Estimated Probability

Probability that a generic pattern , drawn from pdf , belongs to an arbitrary region of the feature space:

If out of data are in , we can estimate via the relative frequency:

Now consider this:

Let’s re-define as , where the subscript means the number of data in the region. ~Ex.: means the region is formed by samples of data.

NOTE: The volume does not depend on the number of data (yet), we decide the area , so we decide the volume.


Let be our pattern of interest. ~Ex.: we are searching for the pdf of males in a specific university Then we use the region to estimate from data.

We define:

  • be the volume of .
  • be the number of data (out of ) in .
  • be the -th estimate of , ~Ex.: .

Asymptotic Necessary and Sufficient Conditions: we want to ensure :

  1. (to guarantee convergence)

β‡’ To satisfy these conditions, we can do one of two things:

  1. Fix a proper volume, say and determine consequently. (Parzen Window).
  2. Fix (~Ex.: ), and determine consequently, in such a way that exactly patterns fall in (-nearest neighbor).

Original Files: