Why We Cannot Use Linear Regression For Classification Task? (2024)

Why We Cannot Use Linear Regression For Classification Task? (1)

Report this article

Korvage Information Technology

Best software development company in UAE. Provides cost effective, deep domain expertise & result oriented IT solutions.

Published Mar 19, 2024

+ Follow

Linear regression is a statistical technique used to understand the relationship between two continuous variables by fitting a straight line to the data points. However, it's not suitable for classification tasks where the goal is to predict which category or class an observation belongs to.

One fundamental reason why linear regression isn't apt for classification tasks is its output nature. Linear regression predicts continuous values along a straight line, which makes sense for predicting quantities like house prices or temperature. But in classification, we're interested in categorical outcomes, like whether an email is spam or not, or whether a tumor is benign or malignant. These outcomes can't be accurately represented by a straight line.

Sign in

Stay updated on your professional world

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

New to LinkedIn? Join now

Insights from the community

Statistics How can you address underfitting in linear regression?
Artificial Intelligence How can you use Linear Regression to prioritize product features?
Predictive Modeling How do you incorporate exogenous variables and covariates in SVM models for time series forecasting?
Artificial Neural Networks How do you incorporate prior knowledge or constraints into GANs' loss functions?
Artificial Neural Networks How do you balance the trade-off between complexity and simplicity of your loss function?
Data Analytics How do you validate ensemble model performance?
Time Series Analysis What are some common sources of error and bias in time series forecasting?
Machine Learning You're on a tight schedule for model optimization. How do you decide which hyperparameters to tune first?
Critical Thinking How can you determine the transparency of a predictive model?
Artificial Neural Networks How do you handle outliers or noisy data when using loss functions for regression problems?

Others also viewed

How to Choose right ML model? Somesh Kumar Sahu 8mo
BxD Primer Series: Bayesian Model Averaging (BMA) Ensemble Mayank K. 1y
BxD Primer Series: Vector Autoregression Time Series Models Mayank K. 1y
BxD Primer Series: Stacking Ensemble Models Mayank K. 1y
Diffusion Models: A Comprehensive Overview Madan Agrawal 3w
BxD Primer Series: Logistic Regression Models Mayank K. 1y
The Impact of Anomaly Detection Algorithms on Material Test Reports Star Software 2mo
BxD Primer Series: Bagging Ensemble Models Mayank K. 1y
BxD Primer Series: Lasso Regression Models, L1 regularization in general and comparison with L2 regularization Mayank K. 1y
BxD Primer Series: Linear Discriminant Analysis (LDA) for Dimensionality Reduction Mayank K. 1y

Explore topics

Sales
Marketing
IT Services
Business Administration
HR Management
Engineering
Soft Skills
See All

FAQs

Why We Cannot Use Linear Regression For Classification Task? ›

There are two things that explain why Linear Regression is not suitable for classification. The first one is that Linear Regression deals with continuous values whereas classification problems mandate discrete values. The second problem is regarding the shift in threshold value when new data points are added.

Read On ›

Can you use linear regression for classification tasks by putting a threshold on its output? ›

You can, of course, but there are two problems with this: It is probably not reasonable to assume a linear relationship between the two variables. In particular, if it leads to negative (or >1) predictions for relevant values of the predictor, it is certainly not reasonable.

Discover More Details ›

Can we use regression models for classification? ›

Linear regression can be used for classification too. On the diabetes data, use the NominalToBinary filter to convert the two classes, which are nominal, to the numeric values 0 and 1, and apply linear regression. The result is a predicted number between 0 and 1 for each instance.

Why do we need to use classification instead of regression? ›

Regression algorithms solve regression problems such as house price predictions and weather predictions. Classification algorithms solve classification problems like identifying spam e-mails, spotting cancer cells, and speech recognition.

See Details ›

Why not to use linear regression? ›

Furthermore, the data must not include one or a few extreme values since these may create a false sense of relationship in the data even when none exists. If these assumptions are not met, the results of linear regression analysis may be misleading.

Find Out More ›

Why is linear regression not suitable for classification? ›

Tell Me More ›

What is the difference between the output of regression and classification task? ›

The most significant difference between regression vs classification is that while regression helps predict a continuous quantity, classification predicts discrete class labels.

Show Me More ›

Can we use linear regression for multi class classification? ›

For multiclass problems, multinomial logistic regression would typically be used rather than a combination of multiple regular logistic regression models. By analogy, one could instead use least squares linear regression with multiple outputs.

Explore More ›

Can you combine regression and classification? ›

Some prediction problems require predicting both numeric values and a class label for the same input. A simple approach is to develop both regression and classification predictive models on the same data and use the models sequentially.

Can linear regression be used for image classification? ›

Linear regression can be used for image recognition by treating each pixel as an independent variable and assigning a weight to it. The weight represents how much each pixel contributes to the label of the image. To make a prediction, the model sums up the weighted pixels and compares it to a threshold.

Show Me More ›

Why is logistic regression better than linear regression for classification? ›

Both methods are used for prediction purposes. Linear regression predicts a continuous outcome, while logistic regression predicts a categorical outcome, specifically the probability of the outcome belonging to a particular class.

Read The Full Story ›

What is the regression model Cannot be used for? ›

Regression cannot be used to identify non-linear relationships between two variables.

See Details ›

In which situation is classification better than regression? ›

Robustness to outliers: Classification models are typically more robust to outliers and data errors than regression models. This is because classification models only focus on the categorical relationship between the input and output variables rather than the exact numerical relationship.

Get More Info Here ›

What are the disadvantages of linear regression? ›

One of the main disadvantages of using linear regression for predictive analytics is that it is sensitive to outliers and noise. Outliers are data points that deviate significantly from the rest of the data, and noise is random variation or error in the data.

Why linear regression is wrong? ›

There are lots of reasons why linear regression may perform "so bad". A linear regression model may in fact be appropriate but there is a lot of noise in the data. In other words, the explanatory variables that you have simply don't explain enough of the variation in the response.

What are the major problems of linear regression? ›

Five problems that lie in the scope of this article are:

Non-Linearity of the response-predictor relationships.
Correlation of error terms.
A non-constant variance of the error term [Heteroscedasticity]
Collinearity.
Outliers and High Leverage Points.

Jan 2, 2021

View Details ›

What is a threshold linear regression? ›

Threshold regression models are a diverse set of non-regular regression models that all depend on change points or thresholds. They provide a simple but elegant and interpretable way to model certain kinds of nonlinear relationships between the outcome and a predictor.

In what situations is linear regression not a suitable method? ›

Answer: Linear regression is not suitable for classification because it predicts continuous outcomes rather than discrete classes.

Learn More ›

What is thresholding for classification? ›

The classification threshold in machine learning is a boundary or a cut-off point used to assign a specific predicted class for each object. You need to set this threshold when working with probabilistic machine learning models. These models do not assign the label directly.

Discover More Details ›

In what situations is linear regression a suitable method? ›

If you work for a business or organization, you can use linear regression to help them make informed decisions. Specifically, if you collect large data amounts, this process may assist with your analysis efforts because you can use it to transform raw data into purposeful data.

Show Me More ›

Why We Cannot Use Linear Regression For Classification Task? (2024)

Korvage Information Technology

Best software development company in UAE. Provides cost effective, deep domain expertise & result oriented IT solutions.

Recommended by LinkedIn

More articles by this author

Sign in

Insights from the community

Others also viewed

Explore topics

FAQs

Why We Cannot Use Linear Regression For Classification Task? ›

Why is logistic regression better than linear regression for classification? ›