Machine Learning (Chapter 9): Multivariate Regression

By Ritesh Sahu August 02, 2024

Machine Learning (Chapter 9): Multivariate Regression

Introduction

Multivariate regression is an extension of simple linear regression that deals with multiple independent variables. It aims to model the relationship between two or more features and a dependent variable. This chapter explores the concept, mathematical formulation, and practical implementation of multivariate regression.

Mathematical Formulation

In multivariate regression, we predict the dependent variable $y$ using multiple independent variables $x_1, x_2, \ldots, x_n$ . The relationship can be described using the following linear equation:

$y = \beta_0 + \beta_1 x_1 + \beta_2 x_2 + \cdots + \beta_n x_n + \epsilon$

where:

$y$ is the dependent variable.
$\beta_0$ is the intercept.
$\beta_1, \beta_2, \ldots, \beta_n$ are the coefficients of the independent variables $x_1, x_2, \ldots, x_n$ .
$\epsilon$ is the error term.

The goal is to estimate the coefficients $\beta_0, \beta_1, \beta_2, \ldots, \beta_n$ such that the sum of squared errors (or residuals) between the predicted values and the actual values is minimized.

Cost Function

The cost function for multivariate regression, known as the Mean Squared Error (MSE), is given by:

$J(\beta) = \frac{1}{2m} \sum_{i=1}^m (h_{\beta}(x^{(i)}) - y^{(i)})^2$

where:

$m$ is the number of training examples.
$h_{\beta}(x^{(i)})$ is the hypothesis function (i.e., the predicted value) for the $i$ -th training example.
$y^{(i)}$ is the actual value for the $i$ -th training example.

The hypothesis function $h_{\beta}(x)$ in multivariate regression is:

$h_{\beta}(x) = \beta_0 + \beta_1 x_1 + \beta_2 x_2 + \cdots + \beta_n x_n$

Gradient Descent

To minimize the cost function, we use gradient descent. The update rule for the coefficients is:

$\beta_j := \beta_j - \alpha \frac{\partial J(\beta)}{\partial \beta_j}$

where $\alpha$ is the learning rate, and $\frac{\partial J(\beta)}{\partial \beta_j}$ is the partial derivative of the cost function with respect to $\beta_j$ :

$\frac{\partial J(\beta)}{\partial \beta_j} = \frac{1}{m} \sum_{i=1}^m (h_{\beta}(x^{(i)}) - y^{(i)}) x_j^{(i)}$

Python Implementation

Below is a Python implementation of multivariate regression using scikit-learn and numpy.

python:
import numpy as np
from sklearn.linear_model import LinearRegression
from sklearn.metrics import mean_squared_error

# Example data
# Features: [x1, x2]
X = np.array([[1, 2], [2, 3], [3, 4], [4, 5]])
# Target variable
y = np.array([3, 5, 7, 9])

# Create and fit the model
model = LinearRegression()
model.fit(X, y)

# Predictions
y_pred = model.predict(X)

# Coefficients
intercept = model.intercept_
coefficients = model.coef_

print(f'Intercept: {intercept}')
print(f'Coefficients: {coefficients}')

# Model performance
mse = mean_squared_error(y, y_pred)
print(f'Mean Squared Error: {mse}')

Explanation

Data Preparation: We define our feature matrix $X$ and target vector $y$ .
Model Training: We create an instance of LinearRegression and fit it to our data.
Predictions: We use the model to predict the target values for our input features.
Evaluation: We calculate the Mean Squared Error to assess the model’s performance.

Conclusion

Multivariate regression allows us to model complex relationships between multiple features and a target variable. By understanding the mathematical foundation and applying it through practical implementation, we can make accurate predictions and gain insights from our data.

Search This Blog

Machine learning and artificial intelligence