Skip to main content

Questions tagged [categorical-data]

Categorical (also called nominal) data can take on a limited number of possible values called categories. Categorical values "label", they do not "measure". Please use [ordinal-data] tag for discrete but ordered data types.

2 votes
1 answer
94 views

I’m trying to use the R poly() function with degree 1 to force glm to interpret a factor linearly. I’m puzzled by the fact that the size of the sample seems to increase the coefficient of the ...
Guillaume's user avatar
1 vote
0 answers
23 views

I am trying to perform a correspondence analysis on a dataset of anatomical measurements of ecologically relevant features. Most of these variables are ordered factor variables representing binning of ...
user2352714's user avatar
1 vote
0 answers
13 views

I'm trying to understand how three categorical variables affect several binary variables. I am roughly following these instructions. Here is what my data look like (not my real data): Binary answers ...
Hapless ankylosaur's user avatar
2 votes
1 answer
314 views

I've been following the method illustrated here: Polynomial contrasts for regression to transform the results .L, .Q, .C, etc. of a glm ordinal factor regression in the values for each of the levels ...
Guillaume's user avatar
0 votes
0 answers
102 views

I'm struggling to understand the linearity assumption when running OLS with continuous dependent var and categorical independent variables that have been mean-encoded (simple group mean per category). ...
user avatar
0 votes
0 answers
69 views

I am a newbie at conducting difference checking test (Chi square test). When I make contegency table for doing Chi square test (classical and Bayesian tests), I get some phenomena that they would be ...
Student coding's user avatar
2 votes
1 answer
59 views

I am trying to analyze some survey data in R but I am a bit confused about how to run the right type of analysis. In the survey of college students, the participants were put in a hypothetical ...
Alex Fischer's user avatar
0 votes
0 answers
64 views

I have individual level data with a performance measure (good/bad) and characteristic variables for the individual (e.g. gender). I usually analyse this using a chi-squared test to see if the ...
Rob Green's user avatar
4 votes
1 answer
151 views

Problem in brief I would like to generate several samples of iid categorical data. The standard approach does not work because the potential number of categories is large, and I do not want to impose ...
g g's user avatar
  • 2,954
0 votes
0 answers
68 views

The free Statistics package "JASP" has a data library that illustrates various tests and operations. One of them is Factor Analysis. They use the data from Spearman's 1904 "General ...
David's user avatar
  • 11
0 votes
0 answers
47 views

I am running a GLM (Gaussian Family; Identity link) on some medical data. I intend to find out if the level of disease severity has any effect on task performance. A minimum reproducible example (...
AvadaMouse's user avatar
6 votes
2 answers
163 views

I’ve had a reviewer suggest that I use ethnicity as a covariate in a linear regression. Some ethnic groups in the sample are small enough that I am a little worried that I will overfit if I do this. ...
Mohan's user avatar
  • 1,091
2 votes
1 answer
124 views

I am trying to do a GLM with a dataset. My dataset consists of days individuals go on a social outing, and whether the outing was "better than average" (subjective). I have recorded the ...
MisterMonster314's user avatar
9 votes
2 answers
291 views

I am attempting to do analysis on a dataset using a GLM. In this dataset I have two columns with codes in about individuals, and trying to infer whether an individual passes. For example: ...
MisterMonster314's user avatar
4 votes
2 answers
256 views

I am working on doing a path analysis and using lavaan(). One of my endogenous variables is an ordered factor HOWEVER, the difference between each group is not ...
Mike Thompson's user avatar

15 30 50 per page
1
2 3 4 5
242