How to check for NaN values

Question

float('nan') represents NaN (not a number). But how do I check for it?

For some history of NaN in Python, see PEP 754. python.org/dev/peps/pep-0754 — Craig McQueen
– Craig McQueen, Commented Jan 22, 2010 at 1:30
just for fun, NaN is a Number: isinstance(float("nan"), Number) ;-P — Michał Šrajer
– Michał Šrajer, Commented Feb 4, 2023 at 20:16

gimel · Accepted Answer · 2022-04-25 04:13:16Z

2202

Use math.isnan:

>>> import math >>> x = float('nan') >>> math.isnan(x) True

edited Apr 25, 2022 at 4:13

user3064538

answered Jun 3, 2009 at 13:24

gimel

86.9k10 gold badges80 silver badges104 bronze badges

Sign up to request clarification or add additional context in comments.

17 Comments

gimel Over a year ago

@charlie-parker : In Python3, math.isnan is still a part of the math module. docs.python.org/3/library/math.html#math.isnan . Use numpy.isnan if you wish, this answer is just a suggestion.

TMWP Over a year ago

is math.isnan preferred to np.isnan() ?

jei Aug 8 at 8:23

If your project don't use numpy yes!

petrpulc Over a year ago

@TMWP possibly... import numpy takes around 15 MB of RAM, whereas import math takes some 0,2 MB

user2357112 Over a year ago

@TMWP: If you're using NumPy, numpy.isnan is a superior choice, as it handles NumPy arrays. If you're not using NumPy, there's no benefit to taking a NumPy dependency and spending the time to load NumPy just for a NaN check (but if you're writing the kind of code that does NaN checks, it's likely you should be using NumPy).

Mike Over a year ago

@jungwook That actually doesn't work. Your expression is always false. That is, float('nan') == float('nan') returns False — which is a strange convention, but basically part of the definition of a NaN. The approach you want is actually the one posted by Chris Jester-Young, below.

|

C. K. Young · Accepted Answer · 2009-06-03 13:22:05Z

623

The usual way to test for a NaN is to see if it's equal to itself:

def isNaN(num): return num != num

answered Jun 3, 2009 at 13:22

C. K. Young

224k47 gold badges394 silver badges446 bronze badges

16 Comments

mavnn Over a year ago

Word of warning: quoting Bear's comment below "For people stuck with python <= 2.5. Nan != Nan did not work reliably. Used numpy instead." Having said that, I've not actually ever seen it fail.

djsadinoff Over a year ago

I'm sure that, given operator overloading, there are lots of ways I could confuse this function. go with math.isnan()

Gonzalo Over a year ago

Even though this works and, to a degree makes sense, I'm a human with principles and I hereby declare this as prohibited witchcraft. Please use math.isnan instead.

Tobias Geisler Over a year ago

If your input includes strings this is the correct answer. (@williamtorkington) np.isnan and math.isnan will both break in this case.

kevlarr Over a year ago

This answer is awful; it relies on nan being the only thing in the universe not equal to itself. AT THE VERY LEAST it should be return isinstance(num, float) and num != num. The overhead of verifying the type is better than the possibility of actually being wrong, which this can be.

|

mavnn · Accepted Answer · 2022-03-19 10:42:28Z

303

numpy.isnan(number) tells you if it's NaN or not.

edited Mar 19, 2022 at 10:42

user3064538

answered Jun 3, 2009 at 13:28

mavnn

9,4994 gold badges36 silver badges53 bronze badges

10 Comments

Michel Keijzers Over a year ago

Works in python version 2.7 too.

Jay Prall Over a year ago

numpy.all(numpy.isnan(data_list)) is also useful if you need to determine if all elements in the list are nan

sleblanc Over a year ago

No need for NumPy: all(map(math.isnan, [float("nan")]*5))

mavnn Over a year ago

When this answer was written 6 years ago, Python 2.5 was still in common use - and math.isnan was not part of the standard library. Now days I'm really hoping that's not the case in many places!

comte Over a year ago

note that np.isnan() doesn't handle decimal.Decimal type (as many numpy's function). math.isnan() does handle.

|

Michael M. · Accepted Answer · 2023-11-02 13:50:10Z

265

Here are three ways where you can test a variable is "NaN" or not.

import pandas as pd import numpy as np import math # For single variable all three libraries return single boolean x1 = float("nan") print(f"It's pd.isna: {pd.isna(x1)}") print(f"It's np.isnan: {np.isnan(x1)}}") print(f"It's math.isnan: {math.isnan(x1)}}")

Output:

It's pd.isna: True It's np.isnan: True It's math.isnan: True

edited Nov 2, 2023 at 13:50

Michael M.

11.2k11 gold badges22 silver badges46 bronze badges

answered Mar 3, 2019 at 8:38

M. Hamza Rajput

10.5k3 gold badges53 silver badges42 bronze badges

7 Comments

abhishake Over a year ago

pd.isna(value) saved a lot of troubles! working like a charm!

mah65 Over a year ago

pd.isnan() or pd.isna()? That is the question :D

jemand771 Over a year ago

version 3 of this answer was correct and well formatted. this one (now 7) is wrong again. rolled back as "dont want your edit" while the edits improved the answer, wtf.

Cam Over a year ago

side note I have found if not np.isnan(x): to be quite useful.

wisbucky Over a year ago

pd.isna('foo') is also the only one that can handle strings. np.isnan('foo') and math.isnan('foo') will result in TypeError exception.

|

wjandrea · Accepted Answer · 2024-05-17 17:59:41Z

64

Editor's note: The below timings are flawed, for example, they have not factored out name lookup time. See the comments.

It seems that checking if it's equal to itself (x != x) is the fastest.

import pandas as pd import numpy as np import math x = float('nan') %timeit x != x 44.8 ns ± 0.152 ns per loop (mean ± std. dev. of 7 runs, 10000000 loops each) %timeit math.isnan(x) 94.2 ns ± 0.955 ns per loop (mean ± std. dev. of 7 runs, 10000000 loops each) %timeit pd.isna(x) 281 ns ± 5.48 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each) %timeit np.isnan(x) 1.38 µs ± 15.7 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

edited May 17, 2024 at 17:59

wjandrea

33.8k10 gold badges69 silver badges105 bronze badges

answered Jun 3, 2020 at 11:40

Grzegorz

1,39313 silver badges13 bronze badges

6 Comments

npengra317 Over a year ago

It's worthwhile noting that this works even if infinities are in question. That is, if z = float('inf'), z != z evaluates to false.

matan h Over a year ago

in my computer z=float('inf') and then z==z give True. x=float('nan') and then x==x give False.

rvf Over a year ago

In most (if not all) cases, these speed differences will only be relevant, if repeated numerous times. Then you'll be using numpy or another tensor library, anyway.

wjandrea Over a year ago

This is a bad comparison. At this scale (nanoseconds) name and attribute lookup time are significant. If you use only local names, the difference between x != x and math.isnan(x) disappears; they're both about 35 ns on my system. You can use %timeit in cell mode to check: 1) %%timeit x = float('nan') <newline> x != x 2) %%timeit x = float('nan'); from math import isnan <newline> isnan(x)

MisterMiyagi Over a year ago

Careful: These timings only represent checking a pre-existing variable and do not generalise well. A function such as math.isnan will compete very differently when a function is actually required and x != x would need wrapping in a lambda. A numpy functionality such as numpy.isnan will compete very differently when applied to a numpy array where x != x would require iteration.

|

x0s · Accepted Answer · 2020-04-22 20:24:30Z

49

here is an answer working with:

NaN implementations respecting IEEE 754 standard
- ie: python's NaN: float('nan'), numpy.nan...
any other objects: string or whatever (does not raise exceptions if encountered)

A NaN implemented following the standard, is the only value for which the inequality comparison with itself should return True:

def is_nan(x): return (x != x)

And some examples:

import numpy as np values = [float('nan'), np.nan, 55, "string", lambda x : x] for value in values: print(f"{repr(value):<8} : {is_nan(value)}")

Output:

nan : True nan : True 55 : False 'string' : False <function <lambda> at 0x000000000927BF28> : False

edited Apr 22, 2020 at 20:24

answered May 24, 2017 at 9:40

x0s

1,90720 silver badges18 bronze badges

8 Comments

keithpjolley Over a year ago

The series I'm checking is strings with missing values are 'nans' (???) so this solution works where others failed.

user2357112 Over a year ago

numpy.nan is a regular Python float object, just like the kind returned by float('nan'). Most NaNs you encounter in NumPy will not be the numpy.nan object.

x0s Over a year ago

numpy.nan defines its NaN value on its own in the underlying library in C. It does not wrap python's NaN. But now, they both comply with IEEE 754 standard as they rely on C99 API.

x0s Over a year ago

@user2357112supportsMonica: Python and numpy NaN actually don't behave the same way: float('nan') is float('nan') (non-unique) and np.nan is np.nan (unique)

user2357112 Over a year ago

@x0s: That has nothing to do with NumPy. np.nan is a specific object, while each float('nan') call produces a new object. If you did nan = float('nan'), then you'd get nan is nan too. If you constructed an actual NumPy NaN with something like np.float64('nan'), then you'd get np.float64('nan') is not np.float64('nan') too.

|

DaveTheScientist · Accepted Answer · 2012-09-25 18:22:03Z

I actually just ran into this, but for me it was checking for nan, -inf, or inf. I just used

if float('-inf') < float(num) < float('inf'):

This is true for numbers, false for nan and both inf, and will raise an exception for things like strings or other types (which is probably a good thing). Also this does not require importing any libraries like math or numpy (numpy is so damn big it doubles the size of any compiled application).

math.isfinite was not introduced until Python 3.2, so given the answer from @DaveTheScientist was posted in 2012 it was not exactly "reinvent[ing] the wheel" - solution still stands for those working with Python 2.
This can be useful for people who need to check for NaN in a pd.eval expression. For example pd.eval(float('-inf') < float('nan') < float('inf')) will return False

Tomalak · Accepted Answer · 2009-06-03 13:24:51Z

29

math.isnan()

or compare the number to itself. NaN is always != NaN, otherwise (e.g. if it is a number) the comparison should succeed.

answered Jun 3, 2009 at 13:24

Tomalak

339k68 gold badges547 silver badges635 bronze badges

1 Comment

Bear Over a year ago

For people stuck with python <= 2.5. Nan != Nan did not work reliably. Used numpy instead.

Idok · Accepted Answer · 2012-07-04 20:15:28Z

28

Well I entered this post, because i've had some issues with the function:

math.isnan()

There are problem when you run this code:

a = "hello" math.isnan(a)

It raises exception. My solution for that is to make another check:

def is_nan(x): return isinstance(x, float) and math.isnan(x)

answered Jul 4, 2012 at 20:15

Idok

4,2925 gold badges23 silver badges18 bronze badges

5 Comments

Peter Hansen Over a year ago

It was probably downvoted because isnan() takes a float, not a string. There's nothing wrong with the function, and the problems are only in his attempted use of it. (For that particular use case his solution is valid, but it's not an answer to this question.)

Rob Over a year ago

Be careful with checking for types in this way. This will not work e.g. for numpy.float32 NaN's. Better to use a try/except construction: def is_nan(x): try: return math.isnan(x) except: return False

Brice M. Dempsey Over a year ago

NaN does not mean that a value is not a valid number. It is part of IEEE floating point representation to specify that a particular result is undefined. e.g. 0 / 0. Therefore asking if "hello" is nan is meaningless.

RAFIQ Over a year ago

this is better because NaN can land in any list of strings,ints or floats, so useful check

Cristian Garcia Over a year ago

I had to implement exactly this for handling string columns in pandas.

Josh Lee · Accepted Answer · 2010-01-26 09:10:53Z

Another method if you're stuck on <2.6, you don't have numpy, and you don't have IEEE 754 support:

def isNaN(x): return str(x) == str(1e400*0)

Erfan · Accepted Answer · 2021-06-29 08:09:10Z

Comparison pd.isna, math.isnan and np.isnan and their flexibility dealing with different type of objects.

The table below shows if the type of object can be checked with the given method:

 +------------+-----+---------+------+--------+------+ | Method | NaN | numeric | None | string | list | +------------+-----+---------+------+--------+------+ | pd.isna | yes | yes | yes | yes | yes | | math.isnan | yes | yes | no | no | no | | np.isnan | yes | yes | no | no | yes | <-- # will error on mixed type list +------------+-----+---------+------+--------+------+

`pd.isna`

The most flexible method to check for different types of missing values.

None of the answers cover the flexibility of pd.isna. While math.isnan and np.isnan will return True for NaN values, you cannot check for different type of objects like None or strings. Both methods will return an error, so checking a list with mixed types will be cumbersom. This while pd.isna is flexible and will return the correct boolean for different kind of types:

In [1]: import pandas as pd In [2]: import numpy as np In [3]: missing_values = [3, None, np.NaN, pd.NA, pd.NaT, '10'] In [4]: pd.isna(missing_values) Out[4]: array([False, True, True, True, True, False])

This!!!! I came here trying to figure out how to check for both NaN and None, which depending on user input excel sheets I could get either. If it weren't for those pesky users this would be easy!
this answer is a bit missing on the performance of the three but it is by far the most complete answer so take my upvote, I guess you could add the option to see if it's equal to itself too to compare a fourth option but seems like a lot of work.

Mauro Bianchi · Accepted Answer · 2010-06-17 08:35:39Z

With python < 2.6 I ended up with

def isNaN(x): return str(float(x)).lower() == 'nan'

This works for me with python 2.5.1 on a Solaris 5.9 box and with python 2.6.5 on Ubuntu 10

This isn't too portable, as Windows sometimes calls this -1.#IND

Mahdi · Accepted Answer · 2016-07-06 15:41:17Z

7

I am receiving the data from a web-service that sends NaN as a string 'Nan'. But there could be other sorts of string in my data as well, so a simple float(value) could throw an exception. I used the following variant of the accepted answer:

def isnan(value): try: import math return math.isnan(float(value)) except: return False

Requirement:

isnan('hello') == False isnan('NaN') == True isnan(100) == False isnan(float('nan')) = True

edited Jul 6, 2016 at 15:41

answered Jun 23, 2016 at 8:22

Mahdi

1,9121 gold badge22 silver badges38 bronze badges

5 Comments

chwi Over a year ago

or try: int(value)

Mahdi Over a year ago

@chwi so what does your suggestion tell about value being NaN or not?

chwi Over a year ago

Well, being "not a number", anything that can not be casted to an int I guess is in fact not a number, and the try statement will fail? Try, return true, except return false.

Mahdi Over a year ago

@chwi Well, taking "not a number" literally, you are right, but that's not the point here. In fact, I am looking exactly for what the semantics of NaN is (like in python what you could get from float('inf') * 0), and thus although the string 'Hello' is not a number, but it is also not NaN because NaN is still a numeric value!

Harsha Biyani Over a year ago

@chwi: You are correct, if exception handling is for specific exception. But in this answer, generic exception have been handled. So no need to check int(value) For all exception, False will be written.

siberiawolf61 · Accepted Answer · 2016-12-07 06:49:13Z

All the methods to tell if the variable is NaN or None:

None type

In [1]: from numpy import math In [2]: a = None In [3]: not a Out[3]: True In [4]: len(a or ()) == 0 Out[4]: True In [5]: a == None Out[5]: True In [6]: a is None Out[6]: True In [7]: a != a Out[7]: False In [9]: math.isnan(a) Traceback (most recent call last): File "<ipython-input-9-6d4d8c26d370>", line 1, in <module> math.isnan(a) TypeError: a float is required In [10]: len(a) == 0 Traceback (most recent call last): File "<ipython-input-10-65b72372873e>", line 1, in <module> len(a) == 0 TypeError: object of type 'NoneType' has no len()

NaN type

In [11]: b = float('nan') In [12]: b Out[12]: nan In [13]: not b Out[13]: False In [14]: b != b Out[14]: True In [15]: math.isnan(b) Out[15]: True

Valentin Goikhman · Accepted Answer · 2020-01-13 19:02:29Z

In Python 3.6 checking on a string value x math.isnan(x) and np.isnan(x) raises an error. So I can't check if the given value is NaN or not if I don't know beforehand it's a number. The following seems to solve this issue

if str(x)=='nan' and type(x)!='str': print ('NaN') else: print ('non NaN')

petezurich · Accepted Answer · 2021-10-27 12:44:20Z

How to remove NaN (float) item(s) from a list of mixed data types

If you have mixed types in an iterable, here is a solution that does not use numpy:

from math import isnan Z = ['a','b', float('NaN'), 'd', float('1.1024')] [x for x in Z if not ( type(x) == float # let's drop all float values… and isnan(x) # … but only if they are nan )]

['a', 'b', 'd', 1.1024]

Short-circuit evaluation means that isnan will not be called on values that are not of type 'float', as False and (…) quickly evaluates to False without having to evaluate the right-hand side.

J11 · Accepted Answer · 2018-07-17 04:57:43Z

For nan of type float

>>> import pandas as pd >>> value = float(nan) >>> type(value) >>> <class 'float'> >>> pd.isnull(value) True >>> >>> value = 'nan' >>> type(value) >>> <class 'str'> >>> pd.isnull(value) False

Ed Greenberg · Accepted Answer · 2024-11-02 11:37:43Z

Here it is 2024 and I've been struggling with this. What I found, based on all the suggestions and comments above:

numpy returns nan for blank excel spreadsheet cells. nan is a float.

To test for this,

 if type(activity) == float and np.isnan(activity):

It also works fine with math.isnan().

You can't use isnan unless you first test for float since running either math.isnan or numpy.isnan on another type (like str) will throw an error.

cottontail · Accepted Answer · 2023-07-13 23:29:30Z

If you want to check for values that are not NaN, then negate whatever is used to flag NaNs; pandas has its own dedicated function for flagging non-NaN values.

lst = [1, 2, float('nan')] m1 = [e == e for e in lst] # [True, True, False] m2 = [not math.isnan(e) for e in lst] # [True, True, False] m3 = ~np.isnan(lst) # array([ True, True, False]) m4 = pd.notna(lst) # array([ True, True, False])

This is especially useful if you want to filter values that are not NaN. For ndarray/Series objects, == is vectorized, so it can be used as well.

s = pd.Series(lst) arr = np.array(lst) x = s[s.notna()] y = s[s==s] # `==` is vectorized z = arr[~np.isnan(arr)] # array([1., 2.]) assert (x == y).all() and (x == z).all()

Ram Prajapati · Accepted Answer · 2024-04-18 11:55:00Z

To filter out both empty strings (''), None and NaN values in the 'num_specimen_seen' column, we can use the pd.notna() function from pandas.

import pandas as pd import numpy as np df = pd.DataFrame({ 'num_specimen_seen': [10, 2, 1, '', 34, 'aw', np.NaN, 5, '43', np.nan, 'ed', None, ''] }) for idx, row in df.iterrows(): if pd.notna(row['num_specimen_seen']) and row['num_specimen_seen'] != '': print(idx, row['num_specimen_seen'])

This code will skip both NaN and empty strings in the 'num_specimen_seen' column when iterating over the DataFrame.

Max Kleiner · Accepted Answer · 2018-07-17 13:03:05Z

for strings in panda take pd.isnull:

if not pd.isnull(atext): for word in nltk.word_tokenize(atext):

the function as feature extraction for NLTK

def act_features(atext): features = {} if not pd.isnull(atext): for word in nltk.word_tokenize(atext): if word not in default_stopwords: features['cont({})'.format(word.lower())]=True return features

Collectives™ on Stack Overflow

How to check for NaN values

21 Answers 21

17 Comments

16 Comments

10 Comments

7 Comments

6 Comments

8 Comments

2 Comments

1 Comment

5 Comments

Comments

`pd.isna`

2 Comments

1 Comment

5 Comments

Comments

Comments

Comments

Comments

Comments

Comments

Comments

2 Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

21 Answers 21

17 Comments

16 Comments

10 Comments

7 Comments

6 Comments

8 Comments

2 Comments

1 Comment

5 Comments

Comments

2 Comments

1 Comment

5 Comments

Comments

Comments

Comments

Comments

Comments

Comments

Comments

2 Comments

Linked

Related