在Pandas中重命名列名

我想从

['$a', '$b', '$c', '$d', '$e']

['a', 'b', 'c', 'd', 'e']

当前回答

许多panda函数都有一个就地参数。当设置为True时，转换将直接应用于调用它的数据帧。例如：

df = pd.DataFrame({'$a':[1,2], '$b': [3,4]})
df.rename(columns={'$a': 'a'}, inplace=True)
df.columns

>>> Index(['a', '$b'], dtype='object')

或者，在某些情况下，您希望保留原始数据帧。如果创建数据帧是一项昂贵的任务，我经常看到人们陷入这种情况。例如，如果创建数据帧需要查询雪花数据库。在这种情况下，只需确保将inplace参数设置为False。

df = pd.DataFrame({'$a':[1,2], '$b': [3,4]})
df2 = df.rename(columns={'$a': 'a'}, inplace=False)
df.columns

>>> Index(['$a', '$b'], dtype='object')

df2.columns

>>> Index(['a', '$b'], dtype='object')

如果这些类型的转换是您经常做的，那么您还可以研究一些不同的panda GUI工具。我是一个叫做水户的人的创造者。它是一个电子表格，可以自动将您的编辑转换为python代码。

2021-06-15 00:38:13

其他回答

另一种替换原始列标签的方法是从原始列标签中删除不需要的字符（此处为“$”）。

这可以通过在df.columns上运行for循环并将剥离的列附加到df.column来完成。

相反，我们可以通过使用下面的列表理解在一个语句中巧妙地做到这一点：

df.columns = [col.strip('$') for col in df.columns]

（Python中的strip方法会从字符串的开头和结尾剥离给定的字符。）

2015-11-23 13:56:10

如果已经有新列名的列表，可以尝试以下操作：

new_cols = ['a', 'b', 'c', 'd', 'e']
new_names_map = {df.columns[i]:new_cols[i] for i in range(len(new_cols))}

df.rename(new_names_map, axis=1, inplace=True)

2021-06-10 03:46:32

重命名Pandas中的列是一项简单的任务。

df.rename(columns={'$a': 'a', '$b': 'b', '$c': 'c', '$d': 'd', '$e': 'e'}, inplace=True)

2020-05-08 12:34:49

如果您已经获得了数据帧，df.columns将所有内容转储到您可以操作的列表中，然后作为列的名称重新分配到数据帧中。。。

columns = df.columns
columns = [row.replace("$", "") for row in columns]
df.rename(columns=dict(zip(columns, things)), inplace=True)
df.head() # To validate the output

最佳方式？我不知道。一种方式——是的。

评估问题答案中提出的所有主要技术的更好方法如下：使用cProfile测量内存和执行时间@kadee、@kaitlyn和@eumiro拥有执行时间最快的函数-尽管这些函数非常快，但我们比较了所有答案的0.000和0.001秒舍入。寓意：我上面的答案可能不是“最好”的方式。

import pandas as pd
import cProfile, pstats, re

old_names = ['$a', '$b', '$c', '$d', '$e']
new_names = ['a', 'b', 'c', 'd', 'e']
col_dict = {'$a': 'a', '$b': 'b', '$c': 'c', '$d': 'd', '$e': 'e'}

df = pd.DataFrame({'$a':[1, 2], '$b': [10, 20], '$c': ['bleep', 'blorp'], '$d': [1, 2], '$e': ['texa$', '']})

df.head()

def eumiro(df, nn):
    df.columns = nn
    # This direct renaming approach is duplicated in methodology in several other answers:
    return df

def lexual1(df):
    return df.rename(columns=col_dict)

def lexual2(df, col_dict):
    return df.rename(columns=col_dict, inplace=True)

def Panda_Master_Hayden(df):
    return df.rename(columns=lambda x: x[1:], inplace=True)

def paulo1(df):
    return df.rename(columns=lambda x: x.replace('$', ''))

def paulo2(df):
    return df.rename(columns=lambda x: x.replace('$', ''), inplace=True)

def migloo(df, on, nn):
    return df.rename(columns=dict(zip(on, nn)), inplace=True)

def kadee(df):
    return df.columns.str.replace('$', '')

def awo(df):
    columns = df.columns
    columns = [row.replace("$", "") for row in columns]
    return df.rename(columns=dict(zip(columns, '')), inplace=True)

def kaitlyn(df):
    df.columns = [col.strip('$') for col in df.columns]
    return df

print 'eumiro'
cProfile.run('eumiro(df, new_names)')
print 'lexual1'
cProfile.run('lexual1(df)')
print 'lexual2'
cProfile.run('lexual2(df, col_dict)')
print 'andy hayden'
cProfile.run('Panda_Master_Hayden(df)')
print 'paulo1'
cProfile.run('paulo1(df)')
print 'paulo2'
cProfile.run('paulo2(df)')
print 'migloo'
cProfile.run('migloo(df, old_names, new_names)')
print 'kadee'
cProfile.run('kadee(df)')
print 'awo'
cProfile.run('awo(df)')
print 'kaitlyn'
cProfile.run('kaitlyn(df)')

2015-09-01 02:24:17

除了已经提供的解决方案之外，您还可以在读取文件时替换所有列。我们可以使用names和header=0来实现这一点。

首先，我们创建一个我们喜欢用作列名的名称列表：

import pandas as pd

ufo_cols = ['city', 'color reported', 'shape reported', 'state', 'time']
ufo.columns = ufo_cols

ufo = pd.read_csv('link to the file you are using', names = ufo_cols, header = 0)

在这种情况下，所有列名都将替换为列表中的名称。

2020-03-08 15:43:28

在Pandas中重命名列名

推荐文章

最新文章

标签