[Q] Unbalanced binary outcome variable

Is there an issue for having an “unbalanced” sample size for a dependent outcome measure in a logistic regression?

For example, if I am measuring the likelihood for lung disease , and out of 2500 subjects, only 200 have lung disease (and 2,300 do not), can I still use this as an outcome variable?

Beyond logistic regression, does this unevenness matter for classification methods?

Thank you in advance

submitted by /u/ajr139
[link] [comments]

Published by

Nevin Manimala

Nevin Manimala is interested in blogging and finding new blogs https://nevinmanimala.com

Leave a Reply

Your email address will not be published. Required fields are marked *