Skip to main content

Questions tagged [zero-inflation]

Excessive 0's in a variable compared to a specified reference distribution. Regression approaches include zero-inflated models and hurdle (2-part) models. For count data, zero-inflated and hurdle models based on Poisson or negative binomial distributions are common (ZIP/ZINB and HP/HNB).

0 votes
0 answers
29 views

I know that in glmmTMB in R, using a hurdle model, it is possible to model the 0 vs non-0 part. For example, in my current model: ...
Bettina's user avatar
  • 107
0 votes
0 answers
44 views

I am currently working on the project where I need to assign customers across N recipes before AB testing such that KPIs for each customer are balanced across recipes (reduce pre-test bias) Dataset ...
Rishab's user avatar
  • 1
1 vote
0 answers
88 views

I used the R package glmmTMB to analyze a dataset using a binomial model and a hurdle model, then used the package ggeffects to generate predictions from both models. In glmmTMB, binomial models ...
Michaela's user avatar
  • 229
0 votes
0 answers
31 views

I'm running a hurdle model in JAGS but if randomly crashes for reasons I cannot determine. The model describes the presence absence of a species and, when the species is present, its biomass. It uses ...
Boussens-Dumon Grégoire's user avatar
3 votes
1 answer
81 views

I am trying to determine which variables affect the likelihood of lumpfish eating salmon lice and which variables predict the number of lice eaten. My data are highly zero-inflated, so I decided to ...
sloe's user avatar
  • 31
4 votes
1 answer
234 views

In a hurdle model, we have a Count part, which models the probability of a positive outcome. For instance, we can use a truncated Poisson regression (truncated because the raw Poisson regression amy ...
robertspierre's user avatar
1 vote
0 answers
72 views

I have a data set looking at the effect of 2 different treatments on shrub growth (percentage cover). This has been measured over 5 months and 3 species have established themselves. The data set of ...
Anna Roberts's user avatar
1 vote
0 answers
43 views

I'm tackling a real live problem, describe here as an analogy: problem description let's say I'm gold mining, where the problem setting is that I have to predict the gold volume given a location (on ...
kamashay's user avatar
4 votes
1 answer
77 views

Can glmmTMB can specify GPHMs? I read it can include "truncated_genpois" which I assume is a GPHM? for example, ...
Emma Neigel's user avatar
2 votes
1 answer
313 views

I am running a model to determine the probability of seedlings occurring in each sub-plot based on several responses. However, Dharma tests tell me my data is under-dispersed. I want to run a hurdle ...
Emma Neigel's user avatar
1 vote
0 answers
56 views

I have a multivariate dataset consisting of household level variables such as education, location of the household, occupation, income, consumption etc. The regressand variable is number of social ...
Abhi's user avatar
  • 11
3 votes
0 answers
64 views

I'm relatively new to data science and currently working on a project to group global cities based on exposure to various climate hazards. I've sourced climate data from GCMs participating in CMIP6 as ...
wobre's user avatar
  • 31
0 votes
0 answers
46 views

In a standard hurdle model[ [wikipedia]][1], the decision to participate would be associated to some strictly positive outcomes (eg positive count if Poisson) and non-participation would necessarily ...
Nicostat's user avatar
  • 113
0 votes
0 answers
64 views

I am working on a case-cohort (~ case-control, but putting all cases in the subcohort) study evaluating miRNA markers. The variables of interest are continuous quantitative measures of miRNA ...
Javier Hernando's user avatar

15 30 50 per page
1
2 3 4 5
44