如何检查NaN值？

float（nan'）表示nan（不是数字）。但我该如何检查呢？

当前回答

似乎检查它是否等于自身（x！=x）是最快的。

import pandas as pd 
import numpy as np 
import math 

x = float('nan')

%timeit x != x
44.8 ns ± 0.152 ns per loop (mean ± std. dev. of 7 runs, 10000000 loops each)

%timeit math.isnan(x)
94.2 ns ± 0.955 ns per loop (mean ± std. dev. of 7 runs, 10000000 loops each)

%timeit pd.isna(x)
281 ns ± 5.48 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

%timeit np.isnan(x)
1.38 µs ± 15.7 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

2020-06-03 11:40:47

其他回答

对于panda中的字符串，请使用pd.isnull：

if not pd.isnull(atext):
  for word in nltk.word_tokenize(atext):

NLTK的特征提取功能

def act_features(atext):
features = {}
if not pd.isnull(atext):
  for word in nltk.word_tokenize(atext):
    if word not in default_stopwords:
      features['cont({})'.format(word.lower())]=True
return features

2018-07-17 13:03:05

numpy.isnan（数字）告诉你它是不是NaN。

2009-06-03 13:28:31

用于浮球类型

>>> import pandas as pd
>>> value = float(nan)
>>> type(value)
>>> <class 'float'>
>>> pd.isnull(value)
True
>>>
>>> value = 'nan'
>>> type(value)
>>> <class 'str'>
>>> pd.isnull(value)
False

2018-07-17 04:57:43

如何从混合数据类型列表中删除NaN（float）项

如果在可迭代的中有混合类型，这里有一个不使用numpy的解决方案：

from math import isnan

Z = ['a','b', float('NaN'), 'd', float('1.1024')]

[x for x in Z if not (
                      type(x) == float # let's drop all float values…
                      and isnan(x) # … but only if they are nan
                      )]

['a', 'b', 'd', 1.1024]

短路求值意味着不会对非“float”类型的值调用isnan，因为False和（…）很快求值为False，而无需对右侧求值。

2019-01-28 06:53:40

事实上我刚碰到这个，但对我来说，它是在检查nan、-inf或inf

if float('-inf') < float(num) < float('inf'):

这对于数字是正确的，对于nan和inf都是错误的，对于字符串或其他类型（这可能是一件好事）会引发异常。此外，这不需要导入任何库，如math或numpy（numpy非常大，它的大小是任何编译应用程序的两倍）。

2012-09-25 18:22:03

如何检查NaN值？

推荐文章

最新文章

标签