Skip to main content

Questions tagged [binning]

Binning means grouping a continuous variable into discrete categories. It is particularly used in reference to histograms, but could also be used more generally in the sense of coarsening.

4 votes
1 answer
265 views

I have binned loss data where each bin is defined by: A minimum loss and maximum loss (the bin boundaries) A probability of occurrence for that bin The probabilities across all bins sum to 1. ...
Benjamin Acar's user avatar
2 votes
1 answer
76 views

Assuming a randomized experiment, with the randomization stratified on a discretized version of a continuous baseline covariate (e.g. age groups, cutoff of a clinical score). We know that ...
Martin Modrák's user avatar
2 votes
0 answers
53 views

I am trying to compare the counts obtained from linear binning and logarithmic binning for a given dataset X, but I am observing unexpected results in the logarithmic binning case. Linear Binning: if ...
an_idiot_noob's user avatar
0 votes
0 answers
39 views

This is a follow-up question to What is the benefit of breaking up a continuous predictor variable? Is binning of continuous data always bad for statistical tests? [duplicate] From the above ...
Ommo's user avatar
  • 454
11 votes
3 answers
654 views

As part of a collaboration, I've been asked to fit a model with a continuous response $Y$ and an ordinal predictor $X$ (levels 1 to 5). The dataset owner is after an answer that is inherently binary: ...
mkt's user avatar
  • 21.2k
9 votes
1 answer
172 views

We have been teaching sample size calculation for the comparison of two groups A and B. I was asked to provide a mathematical explanation of why it reduces power to "dichotomize" a ...
Mkanders's user avatar
  • 476
2 votes
1 answer
218 views

I'm interested in modelling the effect of shot noise on images. When taking a picture with a camera, the number of photons incident upon each pixel during the exposure time is (I believe) a ...
Yly's user avatar
  • 141
1 vote
1 answer
146 views

My problem is that I have data of sexual assertiveness (7 point Likert-scale, SAQ) and relationship satisfaction (5 point Likert-scale, RAS) but I would like to divide the sexual assertiveness ...
mse25's user avatar
  • 11
1 vote
0 answers
55 views

I have a somewhat theoretical question. I am trying to establish how closely scores across different language tests (IELTS, TOEFL, C1A, OET, DET) used in public domains match each other, given that ...
Amanda's user avatar
  • 11
0 votes
0 answers
151 views

In my company I've been noticing some binary classification modeling code that replaces bins of a continuous variable with the corresponding Weight of Evidence (WoE) of the given bin. As far as I ...
jglad's user avatar
  • 33
1 vote
1 answer
152 views

I'm trying to compare two populations with 40 samples each. For each sample, I have two measurements of angle, measured in bins of 30 degrees (1-12), and I calculated the difference between the two (e....
Anthony BH's user avatar
2 votes
1 answer
546 views

I am working on a project to determine the variables that better predict the binary outcome. I am using conditional random forest and permimp::permimp for ...
Kate's user avatar
  • 347
0 votes
0 answers
58 views

(Please note that this is all hypothetical at this point and the data specifics should not matter that much). Let's say I have a dataset where participants took a certain amount of time to complete a ...
grace.cutler's user avatar
1 vote
0 answers
36 views

It crossed my mind that when designing an experiment and you're not interested in NHST but full regression model where coefficients for treatment exposure and relevant covariates are desired, perhaps ...
jbuddy_13's user avatar
  • 3,970
1 vote
1 answer
182 views

Background I have two Empirical distributions, both derived from social media data. The first represents a broad sample of ~4.8 million posts and the number of followers each post author has. The ...
Connor's user avatar
  • 677

15 30 50 per page
1
2 3 4 5
18