如何检查NaN值？

float（nan'）表示nan（不是数字）。但我该如何检查呢？

测试NaN的通常方法是查看它是否等于自身：

def isNaN(num):
    return num != num

2009-06-03 13:22:05

使用math.isnan：

>>> import math
>>> x = float('nan')
>>> math.isnan(x)
True

2009-06-03 13:24:37

math.isnan（）

或将数字与自身进行比较。NaN总是！=NaN，否则（例如，如果是数字），比较应成功。

2009-06-03 13:24:51

numpy.isnan（数字）告诉你它是不是NaN。

2009-06-03 13:28:31

另一种方法是，如果你坚持低于2.6，你没有numpy，也没有IEEE 754支持：

def isNaN(x):
    return str(x) == str(1e400*0)

2010-01-26 09:10:53

当python＜2.6时

def isNaN(x):
    return str(float(x)).lower() == 'nan'

这适用于Solaris 5.9机箱上的python 2.5.1和Ubuntu 10上的python 2.6.5

2010-06-17 08:35:39

我进入这篇文章，因为我在功能方面遇到了一些问题：

math.isnan()

运行此代码时出现问题：

a = "hello"
math.isnan(a)

它引发了异常。我的解决方案是再做一次检查：

def is_nan(x):
    return isinstance(x, float) and math.isnan(x)

2012-07-04 20:15:28

事实上我刚碰到这个，但对我来说，它是在检查nan、-inf或inf

if float('-inf') < float(num) < float('inf'):

这对于数字是正确的，对于nan和inf都是错误的，对于字符串或其他类型（这可能是一件好事）会引发异常。此外，这不需要导入任何库，如math或numpy（numpy非常大，它的大小是任何编译应用程序的两倍）。

2012-09-25 18:22:03

我正在从一个web服务接收数据，该服务将NaN作为字符串“NaN”发送。但我的数据中也可能有其他类型的字符串，所以简单的float（value）可能会引发异常。我使用了接受答案的以下变体：

def isnan(value):
  try:
      import math
      return math.isnan(float(value))
  except:
      return False

要求：

isnan('hello') == False
isnan('NaN') == True
isnan(100) == False
isnan(float('nan')) = True

2016-06-23 08:22:33

判断变量是NaN还是None的所有方法：

无类型

In [1]: from numpy import math

In [2]: a = None
In [3]: not a
Out[3]: True

In [4]: len(a or ()) == 0
Out[4]: True

In [5]: a == None
Out[5]: True

In [6]: a is None
Out[6]: True

In [7]: a != a
Out[7]: False

In [9]: math.isnan(a)
Traceback (most recent call last):
  File "<ipython-input-9-6d4d8c26d370>", line 1, in <module>
    math.isnan(a)
TypeError: a float is required

In [10]: len(a) == 0
Traceback (most recent call last):
  File "<ipython-input-10-65b72372873e>", line 1, in <module>
    len(a) == 0
TypeError: object of type 'NoneType' has no len()

NaN类型

In [11]: b = float('nan')
In [12]: b
Out[12]: nan

In [13]: not b
Out[13]: False

In [14]: b != b
Out[14]: True

In [15]: math.isnan(b)
Out[15]: True

2016-12-07 06:49:13

下面是一个答案：

符合IEEE 754标准的NaN实现例如：python的NaN:float（NaN'），numpy.NaN。。。任何其他对象：string或其他任何对象（遇到异常时不会引发异常）

按照标准实现的NaN是唯一一个与自身的不平等比较应返回True的值：

def is_nan(x):
    return (x != x)

还有一些例子：

import numpy as np
values = [float('nan'), np.nan, 55, "string", lambda x : x]
for value in values:
    print(f"{repr(value):<8} : {is_nan(value)}")

输出：

nan      : True
nan      : True
55       : False
'string' : False
<function <lambda> at 0x000000000927BF28> : False

2017-05-24 09:40:52

用于浮球类型

>>> import pandas as pd
>>> value = float(nan)
>>> type(value)
>>> <class 'float'>
>>> pd.isnull(value)
True
>>>
>>> value = 'nan'
>>> type(value)
>>> <class 'str'>
>>> pd.isnull(value)
False

2018-07-17 04:57:43

对于panda中的字符串，请使用pd.isnull：

if not pd.isnull(atext):
  for word in nltk.word_tokenize(atext):

NLTK的特征提取功能

def act_features(atext):
features = {}
if not pd.isnull(atext):
  for word in nltk.word_tokenize(atext):
    if word not in default_stopwords:
      features['cont({})'.format(word.lower())]=True
return features

2018-07-17 13:03:05

如何从混合数据类型列表中删除NaN（float）项

如果在可迭代的中有混合类型，这里有一个不使用numpy的解决方案：

from math import isnan

Z = ['a','b', float('NaN'), 'd', float('1.1024')]

[x for x in Z if not (
                      type(x) == float # let's drop all float values…
                      and isnan(x) # … but only if they are nan
                      )]

['a', 'b', 'd', 1.1024]

短路求值意味着不会对非“float”类型的值调用isnan，因为False和（…）很快求值为False，而无需对右侧求值。

2019-01-28 06:53:40

这里有三种方法可以测试变量是否为“NaN”。

import pandas as pd
import numpy as np
import math

# For single variable all three libraries return single boolean
x1 = float("nan")

print(f"It's pd.isna: {pd.isna(x1)}")
print(f"It's np.isnan: {np.isnan(x1)}}")
print(f"It's math.isnan: {math.isnan(x1)}}")

输出

It's pd.isna: True
It's np.isnan: True
It's math.isnan: True

2019-03-03 08:38:35

在Python 3.6中，检查字符串值x math.isnan（x）和np.issan（x）会引发错误。所以我无法检查给定值是否为NaN，如果我事先不知道它是一个数字。以下内容似乎解决了这个问题

if str(x)=='nan' and type(x)!='str':
    print ('NaN')
else:
    print ('non NaN')

2020-01-13 19:02:29

似乎检查它是否等于自身（x！=x）是最快的。

import pandas as pd 
import numpy as np 
import math 

x = float('nan')

%timeit x != x
44.8 ns ± 0.152 ns per loop (mean ± std. dev. of 7 runs, 10000000 loops each)

%timeit math.isnan(x)
94.2 ns ± 0.955 ns per loop (mean ± std. dev. of 7 runs, 10000000 loops each)

%timeit pd.isna(x)
281 ns ± 5.48 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

%timeit np.isnan(x)
1.38 µs ± 15.7 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

2020-06-03 11:40:47

比较pd.isna、math.isnan和np.isnan及其处理不同类型对象的灵活性。

下表显示了是否可以使用给定方法检查对象类型：


+------------+-----+---------+------+--------+------+
|   Method   | NaN | numeric | None | string | list |
+------------+-----+---------+------+--------+------+
| pd.isna    | yes | yes     | yes  | yes    | yes  |
| math.isnan | yes | yes     | no   | no     | no   |
| np.isnan   | yes | yes     | no   | no     | yes  | <-- # will error on mixed type list
+------------+-----+---------+------+--------+------+

pd.isna文件

检查不同类型缺失值的最灵活方法。

所有答案都没有涵盖pd.isna的灵活性。虽然math.isnan和np.isnan将为NaN值返回True，但您无法检查None或字符串等不同类型的对象。这两个方法都会返回错误，因此检查混合类型的列表会很麻烦。而pd.isna是灵活的，它将为不同类型返回正确的布尔值：

In [1]: import pandas as pd

In [2]: import numpy as np

In [3]: missing_values = [3, None, np.NaN, pd.NA, pd.NaT, '10']

In [4]: pd.isna(missing_values)
Out[4]: array([False,  True,  True,  True,  True, False])

2021-06-28 19:13:38

如何检查NaN值？

推荐文章

最新文章

标签