Linear and logistic regression models: when to use and how to interpret them? (2024)

  • Journal List
  • J Bras Pneumol
  • v.48(6); 2022
  • PMC9747134

As a library, NLM provides access to scientific literature. Inclusion in an NLM database does not imply endorsem*nt of, or agreement with, the contents by NLM or the National Institutes of Health.
Learn more: PMC Disclaimer | PMC Copyright Notice

Linear and logistic regression models: when to use and how to interpret them? (1)

Jornal Brasileiro de Pneumologia

J Bras Pneumol. 2022; 48(6): e20220439.

Published online 2022 Nov 25. doi:10.36416/1806-3756/e20220439

Horacio Matias Castro1,2 and Juliana Carvalho Ferreira1,3

Author information Copyright and License information PMC Disclaimer

PRACTICAL SCENARIO

A secondary analysis1 of a study designated “Integrating Palliative and Critical Care,” a cluster randomized trial, was conducted to explore differences in receipt of elements of palliative care among patients who died in the ICU with interstitial lung disease (ILD) or COPD in comparison with those who died of cancer. The authors used two methods of multiple regression analysis: linear regression to estimate the impact of COPD and ILD, in comparison with that of cancer, on the length of ICU stay, and logistic regression to evaluate the effects of COPD and ILD on the presence or absence of elements of palliative care. All regression models were adjusted for confounders (age, sex, minority status, education level, among others) of the association between the patient diagnosis and palliative care outcomes.

INTRODUCTION

Linear and logistic regressions are widely used statistical methods to assess the association between variables in medical research. These methods estimate if there is an association between the independent variable (also called predictor, exposure, or risk factor) and the dependent variable (outcome).2

The association between two variables is evaluated with simple regression analysis. However, in many clinical scenarios, more than one independent variable may be associated with the outcome, and there may be the need to control for confounder variables. When more than two independent variables are associated with the outcome, multiple regression analysis is used. Multiple regression analysis evaluates the independent effect of each variable on the outcome, adjusting for the effect of the other variables included in the same regression model.

WHEN TO USE LINEAR OR LOGISTIC REGRESSION?

The determinant of the type of regression analysis to be used is the nature of the outcome variable. Linear regression is used for continuous outcome variables (e.g., days of hospitalization or FEV1), and logistic regression is used for categorical outcome variables, such as death. Independent variables can be continuous, categorical, or a mix of both.

In our example, the authors wanted to know if there was a relationship between cancer, COPD, and ILD (baseline disease; the independent variables) with two different outcomes. One outcome was continuous (length of ICU stay) and the other one was categorical (presence or absence of elements of palliative care). Therefore, two models were built: a linear model to examine the association between baseline disease (chronic pulmonary disease or cancer) and length of ICU stay, and a logistic regression analysis to examine the association between the baseline disease and being in receipt of elements of palliative care.

HOW TO INTERPRET RESULTS OF REGRESSION ANALYSIS?

Regression models are performed within statistical packages, and the output results include several parameters, which can be complex to interpret. Clinicians who are learning the basics of regression models should focus on the key parameters presented in Chart 1.

Chart 1

Most important parameters in regression analyses and their interpretations.

ParameterLinear regressionLogistic regression
Direction and strength of the association between the independent variable and the dependent variable (outcome)Beta coefficient:
Describes the (expected) average change in the outcome variable for each one-unit change in the independent variable for continuous variables, or the average change in the outcome variable for one category of the independent variable compared with a reference category for categorical variables
OR:
The OR for a continuous independent variable is interpreted as the change in the odds of the outcome occurring for every one-unit increase in the independent variable

The OR for categorical independent variables is interpreted as the increase or decrease in odds between two categories (e.g., men vs women)

OR = 1: no association; OR > 1: positive association or risk factor; and OR < 1: negative association or protective factor

Example (for a continuous independent variable)The expected increase in FEV1 for each centimeter increase in heightThe expected increase in the odds of death for each increase of one year of age among patients with sepsis
Example (for a categorical independent variable)The expected increase in FEV1 for men compared with women with the same height and ageThe expected increase in the odds of death for men compared with women among COVID-19 patients
Precision of the estimateThe 95% CI of the beta coefficientThe 95%CI of the OR
Statistical significanceThe p value (significant when < 0.05)The p value (significant when < 0.05)

Open in a separate window

In our example, the baseline disease-COPD, ILD, or cancer (the reference category)-is the independent variable, and length of ICU stay and receipt of palliative care elements are the outcomes of interest. In addition, the regression models also included other independent variables considered as potential confounders, such as age, sex, and minority status. In the linear regression model, the length of ICU stay for patients with ILD was longer than for those with cancer (β = 2.75; 95% CI, 0.52-4.98; p = 0.016), which means that, on average, having ILD increased the length of ICU stay in 2.75 days when compared with the length of ICU stay among cancer patients. In the logistic regression model, the authors found that patients with ILD, when compared with cancer patients, were less likely to have any documentation of their pain assessment in the last 24 h of life (OR = 0.43; 95% CI, 0.19-0.97; p = 0.042), which means that having ILD decreased the odds of documentation of pain assessment by more than half.

KEY POINTS

  • Linear and logistic regressions are important statistical methods for testing relationships between variables and quantifying the direction and strenght of the association.

  • Linear regression is used with continuous outcomes, and logistic regression is used with categorical outcomes.

  • These procedures require expertise in regression model building and typically require the assistance of a biostatistician.

REFERENCES

1. Brown CE, Engelberg RA, Nielsen EL, Curtis JR. Palliative Care for Patients Dying in the Intensive Care Unit with Chronic Lung Disease Compared with Metastatic Cancer. Ann Am Thorac Soc. 2016;13(5):684–689. doi:10.1513/AnnalsATS.201510-667OC. [PMC free article] [PubMed] [CrossRef] [Google Scholar]

2. Bzovsky S, Phillips MR, Guymer RH, Wykoff CC, Thabane L, Bhandari M. The clinician's guide to interpreting a regression analysis. Eye (Lond) 2022;36(9):1715–1717. doi:10.1038/s41433-022-01949-z. [PMC free article] [PubMed] [CrossRef] [Google Scholar]

Articles from Jornal Brasileiro de Pneumologia are provided here courtesy of Sociedade Brasileira de Pneumologia e Tisiologia (Brazilian Thoracic Society)

Linear and logistic regression models: when to use and how to interpret them? (2024)
Top Articles
Can I Use My Credit Card in Canada? | Capital One
Just Ask: Would wrapping credit cards in foil prevent theft of information?
$4,500,000 - 645 Matanzas CT, Fort Myers Beach, FL, 33931, William Raveis Real Estate, Mortgage, and Insurance
2018 Jeep Wrangler Unlimited All New for sale - Portland, OR - craigslist
Regal Amc Near Me
Moviesda Dubbed Tamil Movies
Decaying Brackenhide Blanket
Rainfall Map Oklahoma
How Quickly Do I Lose My Bike Fitness?
Craigslist Greenville Craigslist
Spelunking The Den Wow
Ap Chem Unit 8 Progress Check Mcq
Charmeck Arrest Inquiry
Fredericksburg Free Lance Star Obituaries
Operation Cleanup Schedule Fresno Ca
ARK: Survival Evolved Valguero Map Guide: Resource Locations, Bosses, & Dinos
Iu Spring Break 2024
Lehmann's Power Equipment
Lawson Uhs
Kirksey's Mortuary - Birmingham - Alabama - Funeral Homes | Tribute Archive
Naval Academy Baseball Roster
Caring Hearts For Canines Aberdeen Nc
Sister Souljah Net Worth
Naya Padkar Gujarati News Paper
Devotion Showtimes Near Regency Buenaventura 6
Workshops - Canadian Dam Association (CDA-ACB)
Kirk Franklin Mother Debra Jones Age
Mami No 1 Ott
Valley Craigslist
Gesichtspflege & Gesichtscreme
Craigs List Jax Fl
417-990-0201
Khatrimmaza
47 Orchid Varieties: Different Types of Orchids (With Pictures)
Jay Gould co*ck
Senior Houses For Sale Near Me
T&J Agnes Theaters
Linabelfiore Of
Bay Focus
Collier Urgent Care Park Shore
More News, Rumors and Opinions Tuesday PM 7-9-2024 — Dinar Recaps
Newsweek Wordle
Energy Management and Control System Expert (f/m/d) for Battery Storage Systems | StudySmarter - Talents
Best GoMovies Alternatives
18006548818
Free Crossword Puzzles | BestCrosswords.com
Lorton Transfer Station
Phone Store On 91St Brown Deer
Abigail Cordova Murder
Used Auto Parts in Houston 77013 | LKQ Pick Your Part
Ippa 番号
Latest Posts
Article information

Author: Pres. Carey Rath

Last Updated:

Views: 6011

Rating: 4 / 5 (41 voted)

Reviews: 80% of readers found this page helpful

Author information

Name: Pres. Carey Rath

Birthday: 1997-03-06

Address: 14955 Ledner Trail, East Rodrickfort, NE 85127-8369

Phone: +18682428114917

Job: National Technology Representative

Hobby: Sand art, Drama, Web surfing, Cycling, Brazilian jiu-jitsu, Leather crafting, Creative writing

Introduction: My name is Pres. Carey Rath, I am a faithful, funny, vast, joyous, lively, brave, glamorous person who loves writing and wants to share my knowledge and understanding with you.