在Pandas中重命名列名

我想从

['$a', '$b', '$c', '$d', '$e']

['a', 'b', 'c', 'd', 'e']

当前回答

您可以使用str.slice：

df.columns = df.columns.str.slice(1)

2016-01-28 17:31:39

其他回答

df.rename(index=str, columns={'A':'a', 'B':'b'})

pandas.DataFrame.rename

2018-07-19 04:50:15

重命名特定列

使用df.reame（）函数并引用要重命名的列。并非所有列都必须重命名：

df = df.rename(columns={'oldName1': 'newName1', 'oldName2': 'newName2'})
# Or rename the existing DataFrame (rather than creating a copy) 
df.rename(columns={'oldName1': 'newName1', 'oldName2': 'newName2'}, inplace=True)

最小代码示例

df = pd.DataFrame('x', index=range(3), columns=list('abcde'))
df

   a  b  c  d  e
0  x  x  x  x  x
1  x  x  x  x  x
2  x  x  x  x  x

以下方法都可以工作并产生相同的输出：

df2 = df.rename({'a': 'X', 'b': 'Y'}, axis=1)  # new method
df2 = df.rename({'a': 'X', 'b': 'Y'}, axis='columns')
df2 = df.rename(columns={'a': 'X', 'b': 'Y'})  # old method  

df2

   X  Y  c  d  e
0  x  x  x  x  x
1  x  x  x  x  x
2  x  x  x  x  x

请记住将结果指定回，因为修改不在原位。或者，指定inplace=True：

df.rename({'a': 'X', 'b': 'Y'}, axis=1, inplace=True)
df

   X  Y  c  d  e
0  x  x  x  x  x
1  x  x  x  x  x
2  x  x  x  x  x

在v0.25中，如果指定了要重命名的无效列，还可以指定errors='raise'来引发错误。请参阅v0.25 rename（）文档。

重新分配列标题

使用df.set_axis（），axis=1，inplace=False（返回副本）。

df2 = df.set_axis(['V', 'W', 'X', 'Y', 'Z'], axis=1, inplace=False)
df2

   V  W  X  Y  Z
0  x  x  x  x  x
1  x  x  x  x  x
2  x  x  x  x  x

这将返回一个副本，但您可以通过设置inplace=True来修改DataFrame（这是<=0.24版本的默认行为，但将来可能会更改）。

您也可以直接分配标题：

df.columns = ['V', 'W', 'X', 'Y', 'Z']
df

   V  W  X  Y  Z
0  x  x  x  x  x
1  x  x  x  x  x
2  x  x  x  x  x

2012-07-06 01:48:15

由于您只想删除所有列名中的$符号，因此只需执行以下操作：

df = df.rename(columns=lambda x: x.replace('$', ''))

df.rename(columns=lambda x: x.replace('$', ''), inplace=True)

2014-03-26 10:20:45

列名与系列名称

我想解释一下幕后发生的事情。

数据帧是一组系列。

序列又是numpy.array的扩展。

numpy.arrays具有属性.name。

这是系列的名称。熊猫很少尊重这个属性，但它会在某些地方停留，可以用来攻击熊猫的一些行为。

命名列列表

这里有很多答案谈到df.columns属性是一个列表，而实际上它是一个系列。这意味着它具有.name属性。

如果您决定填写列的名称“系列：

df.columns = ['column_one', 'column_two']
df.columns.names = ['name of the list of columns']
df.index.names = ['name of the index']

name of the list of columns     column_one  column_two
name of the index
0                                    4           1
1                                    5           2
2                                    6           3

请注意，索引的名称总是低一列。

挥之不去的艺术事实

.name属性有时会持续存在。如果将df.columns设置为['one'，'two']，则df.one.name将为'one'。

如果您将df.one.name设置为'three'，则df.columns仍然会给您['one'，'two']，df.one.name会给您'three]。

BUT

pd.DataFrame（df.one）将返回

因为Pandas重用已经定义的Series的.name。

多级列名

Pandas有多种方法来实现多层列名。这里面没有太多魔法，但我想在我的回答中也包括这一点，因为我没有看到任何人在这里学习这一点。

    |one            |
    |one      |two  |
0   |  4      |  1  |
1   |  5      |  2  |
2   |  6      |  3  |

通过将列设置为列表，这很容易实现，如下所示：

df.columns = [['one', 'one'], ['one', 'two']]

2016-09-29 12:30:40

# This way it will work
import pandas as pd

# Define a dictionary 
rankings = {'test': ['a'],
        'odi': ['E'],
        't20': ['P']}

# Convert the dictionary into DataFrame
rankings_pd = pd.DataFrame(rankings)

# Before renaming the columns
print(rankings_pd)

rankings_pd.rename(columns = {'test':'TEST'}, inplace = True)

2021-07-14 02:09:30

在Pandas中重命名列名

推荐文章

最新文章

标签