Get column index from label in a data frame

Question

Say we have the following data frame:

> df A B C 1 1 2 3 2 4 5 6 3 7 8 9

We can select column 'B' from its index:

> df[,2] [1] 2 5 8

Is there a way to get the index (2) from the column label ('B')?

See @matthewdowle's answer here for the best solution: stackoverflow.com/a/9277935/636656 — Ari B. Friedman
– Ari B. Friedman, Commented Jul 14, 2014 at 16:20

Henrik · Accepted Answer · 2010-12-13 09:47:26Z

140

you can get the index via grep and colnames:

grep("B", colnames(df)) [1] 2

or use

grep("^B$", colnames(df)) [1] 2

to only get the columns called "B" without those who contain a B e.g. "ABC".

edited Dec 13, 2010 at 9:47

answered Dec 13, 2010 at 9:35

Henrik

14.5k10 gold badges71 silver badges92 bronze badges

Sign up to request clarification or add additional context in comments.

4 Comments

IRTFM Over a year ago

Your original example's advantages could be demonstrated in code if you showed its use in something like df[ , grep("^B", colnames(df)) ], i.e, returning the dataframe columns starting with "B". Feel free to use in a further edit if you agree.

IRTFM Over a year ago

Or even df[ , grep("^[BC]", colnames(df)) ], i.e., the columns that start with either B or C.

Henrik Over a year ago

@Dwin: As @aix already said, the asker wants the index. But I also usually use grep the way you describe it.

user989762 Over a year ago

@Henrik. Thank you so much. This must be the single most useful command to work with dplyr and variables!

NPE · Accepted Answer · 2012-04-27 09:59:34Z

110

The following will do it:

which(colnames(df)=="B")

edited Apr 27, 2012 at 9:59

answered Dec 13, 2010 at 9:40

NPE

503k114 gold badges970 silver badges1k bronze badges

6 Comments

Henrik Over a year ago

The problem with grep is also the advantage, namely that it uses regular expressions (so you can search for any pattern in your colnames). To just get the colnames "B" use "^B$" as the pattern in grep. ^ is the metacharacter for the beginning and $ for the end of a string.

nico Over a year ago

You don't even need which. You can directly use df[names(df)=="B"]

NPE Over a year ago

@nico The question is to get the index of the column.

Panos Kalatzantonakis Over a year ago

"Which" worked for me in every case. I couldn't get a column with the name "fBodyAcc-meanFreq()-Z" using grep.

Steve Over a year ago

@Kabamaru: Grep will work as long as you escape the metacharacters. For the example you gave, this will work: grep("^fBodyAcc-meanFreq\$)-Z$",colnames(df)) or also grep("^fBodyAcc-meanFreq\\(\$-Z$",colnames(df)).

|

chimeric · Accepted Answer · 2017-08-24 19:51:36Z

9

I wanted to see all the indices for the colnames because I needed to do a complicated column rearrangement, so I printed the colnames as a dataframe. The rownames are the indices.

as.data.frame(colnames(df)) 1 A 2 B 3 C

answered Aug 24, 2017 at 19:51

chimeric

9051 gold badge9 silver badges14 bronze badges

2 Comments

lillemets Over a year ago

A more concise way to do this is cbind(names(df)).

Gregor Thomas Over a year ago

@lillemets if brevity is your goal, t(t(names(df))) saves you 2 characters ;)

Grant Shannon · Accepted Answer · 2019-09-27 13:18:33Z

8

Following on from chimeric's answer above:

To get ALL the column indices in the df, so i used:

which(!names(df)%in%c())

or store in a list:

indexLst<-which(!names(df)%in%c())

edited Sep 27, 2019 at 13:18

answered Jun 29, 2018 at 8:52

Grant Shannon

5,1432 gold badges51 silver badges39 bronze badges

1 Comment

Dimitrios Zacharatos Over a year ago

i think this is the best answer because it can be generalized

Dan Tarr · Accepted Answer · 2018-06-01 20:53:56Z

This seems to be an efficient way to list vars with column number:

cbind(names(df))

Output:

 [,1] [1,] "A" [2,] "B" [3,] "C"

Sometimes I like to copy variables with position into my code so I use this function:

varnums<- function(x) {w=as.data.frame(c(1:length(colnames(x))), paste0('# ',colnames(x))) names(w)= c("# Var/Pos") w} varnums(df)

Output:

# Var/Pos # A 1 # B 2 # C 3

Vesanen · Accepted Answer · 2020-03-03 12:36:24Z

2

match("B", names(df))

Can work also if you have a vector of names.

edited Mar 3, 2020 at 12:36

Vesanen

4431 gold badge6 silver badges15 bronze badges

answered Dec 9, 2019 at 23:14

James Holland

1,16411 silver badges19 bronze badges

Comments

SentientProgram · Accepted Answer · 2021-10-16 00:16:34Z

To generalize @NPE's answer slightly:

which(colnames(dat) %in% var)

where var is of the form

c("colname1","colname2",...,"colnamen")

returns the indices of whichever column names one needs.

neves · Accepted Answer · 2018-11-28 13:42:21Z

Use t function:

t(colnames(df)) [,1] [,2] [,3] [,4] [,5] [,6] [1,] "var1" "var2" "var3" "var4" "var5" "var6"

Jimmy TwoCents · Accepted Answer · 2021-01-11 20:48:15Z

0

Here is an answer that will generalize Henrik's answer.

df=data.frame(A=rnorm(100), B=rnorm(100), C=rnorm(100)) numeric_columns<-c('A', 'B', 'C') numeric_index<-sapply(1:length(numeric_columns), function(i) grep(numeric_columns[i], colnames(df)))

answered Jan 11, 2021 at 20:48

Jimmy TwoCents

1651 gold badge1 silver badge9 bronze badges

2 Comments

Gregor Thomas Over a year ago

That sapply is a long way to write match(numeric_columns, names(df)) --- unless you really need the regex power rather than exact string matching.

Jimmy TwoCents Over a year ago

thanks @GregorThomas...not super familar with match. In this case it is quite a bit shorter, but I like the sapply because it's a little more explicit what is going on...to each their own i guess (havem't benchmarked any performance differences)

Martin Gal · Accepted Answer · 2022-04-08 23:10:19Z

#I wanted the column index instead of the column name. This line of code worked for me:

which (data.frame (colnames (datE)) == colnames (datE[c(1:15)]), arr.ind = T)[,1] #with datE being a regular dataframe with 15 columns (variables) data.frame(colnames(datE)) #> colnames.datE. #> 1 Ce #> 2 Eu #> 3 La #> 4 Pr #> 5 Nd #> 6 Sm #> 7 Gd #> 8 Tb #> 9 Dy #> 10 Ho #> 11 Er #> 12 Y #> 13 Tm #> 14 Yb #> 15 Lu which(data.frame(colnames(datE))==colnames(datE[c(1:15)]),arr.ind=T)[,1] #> [1] 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

Collectives™ on Stack Overflow

Get column index from label in a data frame

10 Answers 10

4 Comments

6 Comments

2 Comments

1 Comment

Comments

Comments

Comments

Comments

2 Comments

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

10 Answers 10

4 Comments

6 Comments

2 Comments

1 Comment

Comments

Comments

Comments

Comments

2 Comments

Comments

Linked

Related