用熊猫绘制相关矩阵

我有一个具有大量特征的数据集，因此分析相关矩阵变得非常困难。我想绘制一个相关矩阵，我们使用dataframe.corr()函数从pandas库中获得。pandas库是否提供了任何内置函数来绘制这个矩阵?

当前回答

为了完整起见，截至2019年底，我所知道的seaborn最简单的解决方案，如果使用Jupyter:

import seaborn as sns
sns.heatmap(dataframe.corr())

2019-11-08 08:01:37

其他回答

你可以使用来自seaborn的heatmap()来查看b/w不同特征的相关性:

import matplot.pyplot as plt
import seaborn as sns

co_matrics=dataframe.corr()
plot.figure(figsize=(15,20))
sns.heatmap(co_matrix, square=True, cbar_kws={"shrink": .5})

2021-04-24 17:58:36

试试这个函数，它也会显示相关矩阵的变量名:

def plot_corr(df,size=10):
    """Function plots a graphical correlation matrix for each pair of columns in the dataframe.

    Input:
        df: pandas DataFrame
        size: vertical and horizontal size of the plot
    """

    corr = df.corr()
    fig, ax = plt.subplots(figsize=(size, size))
    ax.matshow(corr)
    plt.xticks(range(len(corr.columns)), corr.columns)
    plt.yticks(range(len(corr.columns)), corr.columns)

2015-07-13 13:10:12

如果你的dataframe是df，你可以简单地使用:

import matplotlib.pyplot as plt
import seaborn as sns

plt.figure(figsize=(15, 10))
sns.heatmap(df.corr(), annot=True)

2019-08-15 21:06:18

你可以使用matplotlib中的pyplot.matshow():

import matplotlib.pyplot as plt

plt.matshow(dataframe.corr())
plt.show()

编辑:

在评论中有一个关于如何更改轴勾标签的请求。这是一个豪华版，它画在一个更大的图形尺寸上，有轴标签来匹配数据框架，还有一个颜色条图例来解释颜色尺度。

我包括如何调整标签的大小和旋转，我正在使用一个图形比例，使颜色条和主要图形出来的高度相同。

编辑2: 由于df.corr()方法忽略非数值列，在定义x和y标签时应该使用.select_dtypes(['number'])，以避免不必要的标签移位(包括在下面的代码中)。

f = plt.figure(figsize=(19, 15))
plt.matshow(df.corr(), fignum=f.number)
plt.xticks(range(df.select_dtypes(['number']).shape[1]), df.select_dtypes(['number']).columns, fontsize=14, rotation=45)
plt.yticks(range(df.select_dtypes(['number']).shape[1]), df.select_dtypes(['number']).columns, fontsize=14)
cb = plt.colorbar()
cb.ax.tick_params(labelsize=14)
plt.title('Correlation Matrix', fontsize=16);

2015-04-03 13:04:18

我认为有很多好的答案，但我把这个答案添加给那些需要处理特定列和显示不同情节的人。

import numpy as np
import seaborn as sns
import pandas as pd
from matplotlib import pyplot as plt

rs = np.random.RandomState(0)
df = pd.DataFrame(rs.rand(18, 18))
df= df.iloc[: , [3,4,5,6,7,8,9,10,11,12,13,14,17]].copy()
corr = df.corr()
plt.figure(figsize=(11,8))
sns.heatmap(corr, cmap="Greens",annot=True)
plt.show()

2022-01-16 04:23:21

用熊猫绘制相关矩阵

推荐文章

最新文章

标签