Machine Learning (Chapter 17): Perceptron Learning

By Ritesh Sahu August 30, 2024

Machine Learning (Chapter 17): Perceptron Learning

Introduction

Perceptron learning is a fundamental concept in machine learning and neural networks, providing the basis for understanding more complex algorithms. This chapter delves into the Perceptron algorithm, which is a type of linear classifier used for binary classification tasks.

1. The Perceptron Model

The Perceptron is a simple linear classifier that maps input features to a binary output. It consists of input features, weights, a bias term, and an activation function. The goal of the Perceptron algorithm is to find a set of weights and bias that correctly classify the input data.

The Perceptron model can be described mathematically as follows:

Mathematical Formulation
Given an input vector $\mathbf{x} = [x_1, x_2, \ldots, x_n]$ and a weight vector $\mathbf{w} = [w_1, w_2, \ldots, w_n]$ , the output $y$ of the Perceptron is computed using the following formula:
$y = \text{sign}( \mathbf{w} \cdot \mathbf{x} + b )$
where:
- $\mathbf{w} \cdot \mathbf{x}$ denotes the dot product between the weight vector $\mathbf{w}$ and the input vector $\mathbf{x}$ .
- $b$ is the bias term.
- $\text{sign}$ is the activation function that returns $1$ if the argument is positive, and $-1$ otherwise.
Perceptron Learning Algorithm
The Perceptron Learning Algorithm adjusts the weights and bias based on the prediction error. Given a dataset with input features $\mathbf{x}_i$ and corresponding labels $y_i$ , the weight update rule is:
$\mathbf{w} \leftarrow \mathbf{w} + \eta (y_i - \hat{y}_i) \mathbf{x}_i$ $b \leftarrow b + \eta (y_i - \hat{y}_i)$
where:
- $\eta$ is the learning rate.
- $y_i$ is the true label.
- $\hat{y}_i$ is the predicted label.
The algorithm iteratively updates the weights and bias until convergence or until a predefined number of iterations is reached.

2. Python Code Example

Let's illustrate the Perceptron learning algorithm with a Python example using the numpy library. We'll classify points in a 2D space into two classes.

python:

import numpy as np

class Perceptron:
    def __init__(self, learning_rate=0.01, n_iter=1000):
        self.learning_rate = learning_rate
        self.n_iter = n_iter

    def fit(self, X, y):
        self.weights = np.zeros(X.shape[1])
        self.bias = 0
        self.errors_ = []

        for _ in range(self.n_iter):
            errors = 0
            for xi, target in zip(X, y):
                update = self.learning_rate * (target - self.predict(xi))
                self.weights += update * xi
                self.bias += update
                errors += int(update != 0.0)
            self.errors_.append(errors)

    def predict(self, X):
        return np.where(self.net_input(X) >= 0.0, 1, -1)

    def net_input(self, X):
        return np.dot(X, self.weights) + self.bias

# Example usage
if __name__ == "__main__":
    # Training data: XOR problem
    X = np.array([[0, 0], [0, 1], [1, 0], [1, 1]])
    y = np.array([-1, 1, 1, -1])  # Labels for XOR

    perceptron = Perceptron(learning_rate=0.1, n_iter=10)
    perceptron.fit(X, y)

    print("Weights:", perceptron.weights)
    print("Bias:", perceptron.bias)

    # Predictions
    predictions = perceptron.predict(X)
    print("Predictions:", predictions)

Explanation of the Code:

Initialization: We initialize the Perceptron with a learning rate and number of iterations.
Training (fit method): We initialize weights and bias, then update them based on the prediction errors.
Prediction (predict method): We use the trained weights and bias to predict labels for new data.
Example Usage: The XOR problem is used as a simple test case, though the Perceptron may not solve it perfectly due to its linear nature.

Conclusion

The Perceptron algorithm is a foundational technique in machine learning, providing insight into linear classification and the learning process. Despite its simplicity, it lays the groundwork for more advanced models and neural networks. Understanding its mathematical formulation and implementation helps build a strong foundation for studying more complex algorithms.

Search This Blog

Machine learning and artificial intelligence

Machine Learning (Chapter 17): Perceptron Learning

Machine Learning (Chapter 17): Perceptron Learning

Comments

Post a Comment

Popular posts from this blog

Machine Learning (Chapter 41): The ROC Curve

Machine Learning (Chapter 39): Bootstrapping & Cross-Validation

Machine Learning (Chapter 40): Class Evaluation Measures