如何添加一个新的列到现有的数据帧?

我有以下索引DataFrame命名列和行不连续的数字:

          a         b         c         d
2  0.671399  0.101208 -0.181532  0.241273
3  0.446172 -0.243316  0.051767  1.577318
5  0.614758  0.075793 -0.451460 -0.012493

我想添加一个新列，'e'，到现有的数据帧，并不想改变数据帧中的任何东西(即，新列始终具有与DataFrame相同的长度)。

0   -0.335485
1   -1.166658
2   -0.385571
dtype: float64

如何将列e添加到上面的例子中?

当前回答

如果你得到SettingWithCopyWarning，一个简单的解决方法是复制你想要添加列的数据帧。

df = df.copy()
df['col_name'] = values

2016-03-07 03:28:54

其他回答

在分配新列之前，如果已经索引了数据，则需要对索引进行排序。至少在我的情况下，我必须:

data.set_index(['index_column'], inplace=True)
"if index is unsorted, assignment of a new column will fail"        
data.sort_index(inplace = True)
data.loc['index_value1', 'column_y'] = np.random.randn(data.loc['index_value1', 'column_x'].shape[0])

2015-06-14 23:57:36

以下是我所做的…但我对熊猫和Python都很陌生，所以不能保证。

df = pd.DataFrame([[1, 2], [3, 4], [5,6]], columns=list('AB'))

newCol = [3,5,7]
newName = 'C'

values = np.insert(df.values,df.shape[1],newCol,axis=1)
header = df.columns.values.tolist()
header.append(newName)

df = pd.DataFrame(values,columns=header)

2015-10-06 01:18:52

直接通过NumPy这样做将是最有效的:

df1['e'] = np.random.randn(sLength)

注意我最初(非常老)的建议是使用map(这要慢得多):

df1['e'] = df1['a'].map(lambda x: np.random.random())

2012-09-23 19:22:27

让我补充一下，就像hum3一样，.loc没有解决SettingWithCopyWarning，我不得不求助于df.insert()。在我的例子中，假阳性是由“假”链索引dict['a']['e']生成的，其中'e'是新列，dict['a']是来自字典的数据框架。

还请注意，如果您知道自己在做什么，您可以使用切换警告 pd.options.mode。chained_assignment =无然后用这里给出的另一个解。

2015-10-22 14:21:45

如果你得到SettingWithCopyWarning，一个简单的解决方法是复制你想要添加列的数据帧。

df = df.copy()
df['col_name'] = values

2016-03-07 03:28:54

如何添加一个新的列到现有的数据帧?

推荐文章

最新文章

标签