NumPy数组不是JSON序列化的

在创建NumPy数组后，并将其保存为Django上下文变量，我在加载网页时收到以下错误:

array([   0,  239,  479,  717,  952, 1192, 1432, 1667], dtype=int64) is not JSON serializable

这是什么意思?

当前回答

下面是一个为我工作的实现，并删除了所有的nan(假设这些是简单的对象(list或dict)):

from numpy import isnan

def remove_nans(my_obj, val=None):
    if isinstance(my_obj, list):
        for i, item in enumerate(my_obj):
            if isinstance(item, list) or isinstance(item, dict):
                my_obj[i] = remove_nans(my_obj[i], val=val)

            else:
                try:
                    if isnan(item):
                        my_obj[i] = val
                except Exception:
                    pass

    elif isinstance(my_obj, dict):
        for key, item in my_obj.iteritems():
            if isinstance(item, list) or isinstance(item, dict):
                my_obj[key] = remove_nans(my_obj[key], val=val)

            else:
                try:
                    if isnan(item):
                        my_obj[key] = val
                except Exception:
                    pass

    return my_obj

2017-01-09 12:08:19

其他回答

我经常“jsonify”np.arrays。首先尝试在数组上使用".tolist()"方法，如下所示:

import numpy as np
import codecs, json 

a = np.arange(10).reshape(2,5) # a 2 by 5 array
b = a.tolist() # nested lists with same data, indices
file_path = "/path.json" ## your path variable
json.dump(b, codecs.open(file_path, 'w', encoding='utf-8'), 
          separators=(',', ':'), 
          sort_keys=True, 
          indent=4) ### this saves the array in .json format

为了“unjsonify”数组使用:

obj_text = codecs.open(file_path, 'r', encoding='utf-8').read()
b_new = json.loads(obj_text)
a_new = np.array(b_new)

2015-09-29 17:44:51

如果你在字典中嵌套了numpy数组，我发现了最好的解决方案:

import json
import numpy as np

class NumpyEncoder(json.JSONEncoder):
    """ Special json encoder for numpy types """
    def default(self, obj):
        if isinstance(obj, np.integer):
            return int(obj)
        elif isinstance(obj, np.floating):
            return float(obj)
        elif isinstance(obj, np.ndarray):
            return obj.tolist()
        return json.JSONEncoder.default(self, obj)

dumped = json.dumps(data, cls=NumpyEncoder)

with open(path, 'w') as f:
    json.dump(dumped, f)

多亏了这个家伙。

2018-04-05 16:28:16

这是一个不同的答案，但这可能有助于那些试图保存数据然后再次读取的人。有一种方法比泡菜更快更容易。我试图保存并在pickle dump中阅读它，但在阅读时有很多问题，浪费了一个小时，尽管我正在用自己的数据创建一个聊天机器人，但仍然没有找到解决方案。

Vec_x和vec_y是numpy数组:

data=[vec_x,vec_y]
hkl.dump( data, 'new_data_file.hkl' )

然后你只需读取它并执行以下操作:

data2 = hkl.load( 'new_data_file.hkl' )

2018-07-13 20:23:21

如果其他人的代码(例如模块)正在执行json.dumps()，其他答案将不起作用。这种情况经常发生，例如，web服务器自动将其返回响应转换为JSON，这意味着我们不能总是更改JSON .dump()的参数。这个答案解决了这个问题，并且基于一个(相对)新的解决方案，适用于任何第三方类(不仅仅是numpy)。

TLDR

PIP安装json_fix

import json_fix # import this anytime before the JSON.dumps gets called
import json

# create a converter
import numpy
json.fallback_table[numpy.ndarray] = lambda array: array.tolist()

# no additional arguments needed: 
json.dumps(
   dict(thing=10, nested_data=numpy.array((1,2,3)))
)
#>>> '{"thing": 10, "nested_data": [1, 2, 3]}'

2022-09-14 22:42:24

此外，还有一些关于Python中的列表与数组的非常有趣的信息~> Python列表与数组-何时使用?

可以注意到，一旦我在将数组保存到JSON文件中之前将其转换为列表，无论如何，在我现在的部署中，一旦我读取该JSON文件以供以后使用，我就可以继续以列表形式使用它(而不是将其转换回数组)。

在我看来，AND在屏幕上作为一个列表(逗号分隔)比数组(非逗号分隔)看起来更好。

使用上面的@travelingbones的.tolist()方法，我一直在使用这样的方法(捕捉一些我发现的错误):

保存字典

def writeDict(values, name):
    writeName = DIR+name+'.json'
    with open(writeName, "w") as outfile:
        json.dump(values, outfile)

读字典

def readDict(name):
    readName = DIR+name+'.json'
    try:
        with open(readName, "r") as infile:
            dictValues = json.load(infile)
            return(dictValues)
    except IOError as e:
        print(e)
        return('None')
    except ValueError as e:
        print(e)
        return('None')

希望这能有所帮助!

2016-03-18 22:03:59

NumPy数组不是JSON序列化的

推荐文章

最新文章

标签