RBF (Radial Basis Function) Networks

A generalized linear discriminant
RBF NN are conceptually similar to K-Nearest Neighbour, a predicted value is likely to be about the same as other items that have close values to the predictor variables One difference with the K-NN is that, here we train a Neural Network, in the K-NN however we have to store each training sample, much more space is required in the K-NN method.
Definition of Radial Basis Function

RBF Network are 2-layer NN (input layer, 1 hidden layer, output layer)
All weights between the input layer and the first hidden layer are equal to $1$ .
There could be a bias terms: $b_{i}$ .
The RB Function (Radial Basis Function), or kernel is defined as:

φ (\underline{x}) = e^{- \frac{∥ x - μ _{k} ∥}{2 σ _{k}^{2}}}

A simple RBF Network with just 1-hidden layer will have this form:

y_{i} = j = 1 \sum k w_{ij} φ (\underline{x}) + b_{i}

RB Functions realize a mixture of Gaussian PDFs, hence they are particularly suitable for pdf estimation.
Like MLPs, RBF Networks are “universal” approximators.

For the learning part, it’s supervised

C (τ, w) = \frac{1}{2} i \sum (\overset{y_{i}}{^} - y_{i})^{2}

And we usually consider 2 approaches:

Via gradient descent over $C (w)$ , we learn the parameters: $w_{ij}$ , $b_{i}$ , $\underline{μ_{k}}$ and $σ_{k}$ .
$\underline{μ_{k}}$ and $σ_{k}$ are estimated statistically, then the other parameters $w_{ij}$ and $b_{i}$ are estimated via linear algebra methods (such as matrix inversion), or via the precedent method gradient descent.

NOTE: With RBF Networks we can apply gradient-ASCENT over ML (Maximum Likelihood) method in order to estimate PDFs.

The ML method only works if the weights between the last hidden layer and the output layer sum up to $1$ .

This can’t be done in MLPs because the constraint $\int p (x) d x = 1$ is violated, since they realize MLPs realize mixtures of activation functions that are not inherently pdfs.

Definition of Radial Basis Function

A RBF is so named because the radius distance is the argument to the function, in our case:

φ (\underline{x}) = exp {- \frac{∥ x - μ _{k} ∥}{2 σ _{k}^{2}}}

The further a neuron is from the point being evaluated the less influence it has.

🪴 Quartz 4.0

Explorer

University AI - RBF (Radial Basis Function) Networks

RBF (Radial Basis Function) Networks

Definition of Radial Basis Function

Original Video

Original Files

Graph View

Table of Contents

Backlinks