Rosenblatt’s Perceptron Algorithm

Given a training set $L$ with targets $y_{i}$ taking values $\pm 1$ , find $\overset{w}{^}$ and $t$ such that the hyperplane perpendicular to $\overset{w}{^}$ correctly separates the examples and $t$ is the number of times that $\overset{w}{^}$ is updated.

INITIALIZE: Set $\overset{w}{^}_{0} \leftarrow 0$ , $t \leftarrow 0$ , $j \leftarrow 1$ , and $m \leftarrow 0$ .
NORMALIZE: Compute $R$ for all $i = 1, \dots, l$ set $\overset{x}{^}_{i} \leftarrow (\overset{x}{^}_{i}, R)^{T}$ .
CARROT OR STICK ?: If $y_{j} \overset{w}{^}^{T} \overset{x}{^}_{j} \leq 0$ , set $\overset{w}{^} \leftarrow \overset{w}{^} + η y_{j} \overset{x}{^}_{j}$ , $t \leftarrow t + 1$ , $m \leftarrow m + 1$ .
ALL TESTED ?: Set $j \leftarrow j + 1$ ; If $j \neq = l$ go back to step 3.
NO MISTAKES ?: If $m = 0$ , the algorithm terminates; set $\overset{w}{^} \leftarrow (w, b / R)$ and return $(\overset{w}{^}, t)$ .
TRY AGAIN: Set $j \leftarrow 1$ , $m \leftarrow 0$ , and go back to step P3.

$y_{j} \overset{w}{^}^{T} \overset{x}{^}_{j} \leq 0$ : because $y_{i}$ can only be $- 1$ or $+ 1$ , (two classes), the classification in this case is defined as sign agreement, because the supervisor stop the algorithm only when the sign agrees ( $y_{j} \overset{w}{^}^{T} \overset{x}{^}_{j} > 0$ )

This algorithm can also be seen as a NN where the activation function is the sign function

ReLU

Rectified Linear Unit

Another activation function that can be substituted to the sigmoid function.

Prevents saturation for $a > 0$ but not for $a < 0$ .

Also note that the derivate for $a = 0$ doesn’t exist. So we have to directly specify its value in the code (not too difficult)

Robust Linear Separation

The Rosenblatt’s perceptron algorithm perform a robust linear separation

Robust because all the points are divided from the linear separation by a factor of $δ$ (distance), this value is not known at prior, it can be found as the $min$ of the distances from the line of linear-separation and all the points.