Pandas equivalent of the R operator "%in%" [duplicate]

Question

What is the python equivalent of this in operator? I am trying to filter down a pandas database by having rows only remain if a column in the row has a value found in my list.

I tried using any() and am having immense difficulty with this.

That's beautiful, exactly what I was looking for. You know how hard it is to google "in" and special symbols. — wolfsatthedoor
– wolfsatthedoor, Commented Aug 8, 2014 at 15:01
I don't see the difficulty. Googling "pandas in operator" provides pandas.pydata.org/pandas-docs/stable/indexing.html as first hit and a text search of "in operator" on that page let's you immediately find what you are looking for. — K.-Michael Aye
– K.-Michael Aye, Commented Aug 8, 2014 at 21:21
I googled Python rather than pandas, I didn't know it was a Pandas specific thing. — wolfsatthedoor
– wolfsatthedoor, Commented Aug 8, 2014 at 21:28

Anoushiravan R · Accepted Answer · 2022-02-23 23:52:40Z

Pandas comparison with R docs are here.

s <- 0:4 s %in% c(2,4)

The isin method is similar to R %in% operator:

In [13]: s = pd.Series(np.arange(5),dtype=np.float32) In [14]: s.isin([2, 4]) Out[14]: 0 False 1 False 2 True 3 False 4 True dtype: bool

data_steve · Accepted Answer · 2016-07-27 15:52:23Z

FWIW: without having to call pandas, here's the answer using a for loop and list compression in pure python

x = [2, 3, 5] y = [1, 2, 3] # for loop for i in x: [].append(i in y) Out: [True, True, False] # list comprehension [i in y for i in x] Out: [True, True, False]

RiskyMaor · Accepted Answer · 2020-10-22 21:15:19Z

If you want to use only numpy without panads (like a use case I had) then you can:

import numpy as np x = np.array([1, 2, 3, 10]) y = np.array([10, 11, 2]) np.isin(y, x)

This is equivalent to:

c(10, 11, 2) %in% c(1, 2, 3, 10)

Note that the last line will work only for numpy >= 1.13.0, for older versions you'll need to use np.in1d.

stok · Accepted Answer · 2018-12-06 04:43:00Z

-2

As others indicate, in operator of base Python works well.

myList = ["a00", "b000", "c0"] "a00" in myList # True "a" in myList # False

answered Dec 6, 2018 at 4:43

stok

3754 silver badges9 bronze badges

1 Comment

russellpierce Over a year ago

But requires an atomic left side to yield results that match R's %in% in calling semantics. E.g. ["a00", "node", "c0"] in myList isn't what someone used to %in% is going to expect.

Collectives™ on Stack Overflow

Pandas equivalent of the R operator "%in%" [duplicate]

4 Answers 4

Comments

Comments

Comments

1 Comment

Linked

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

Comments

Comments

Comments

1 Comment

Linked

Related