如何获取Pandas DataFrame的行计数？

如何获取panda数据帧df的行数？

当前回答

假设数据集是“data”，将数据集命名为“data_fr”，data_fr中的行数为“nu_rows”

#import the data frame. Extention could be different as csv,xlsx or etc.
data_fr = pd.read_csv('data.csv')

#print the number of rows
nu_rows = data_fr.shape[0]
print(nu_rows)

2021-01-02 23:04:44

其他回答

您也可以这样做：

假设df是您的数据帧。然后df.shape为您提供数据帧的形状，即（行，列）

因此，分配以下命令以获得所需的

 row = df.shape[0], col = df.shape[1]

2020-05-12 07:14:24

我从R背景来到Pandas，我发现Pandas在选择行或列时更加复杂。

我不得不与它搏斗一段时间，然后我找到了一些应对方法：

获取列数：

len(df.columns)
## Here:
# df is your data.frame
# df.columns returns a string. It contains column's titles of the df.
# Then, "len()" gets the length of it.

获取行数：

len(df.index) # It's similar.

2016-09-29 07:41:41

假设df是您的数据帧，那么：

count_row = df.shape[0]  # Gives number of rows
count_col = df.shape[1]  # Gives number of columns

或者更简洁地说，

r, c = df.shape

2016-02-20 13:30:05

找出数据帧中行数的另一种方法是pandas.Index.size，我认为这是最可读的变体。

请注意，正如我对公认答案的评论，

疑似pandas.Index.size实际上比len（df.Index）更快，但在我的计算机上告诉的是相反的情况（每个循环大约慢150 ns）。

2020-02-24 15:14:22

…建立在Jan Philip Gehrcke的答案之上。

len（df）或len（df.index）比df.shape[0]更快的原因是：

看看代码。df.shape是一个@属性，它运行两次调用len的DataFrame方法。

df.shape??
Type:        property
String form: <property object at 0x1127b33c0>
Source:
# df.shape.fget
@property
def shape(self):
    """
    Return a tuple representing the dimensionality of the DataFrame.
    """
    return len(self.index), len(self.columns)

在len（df）的罩下

df.__len__??
Signature: df.__len__()
Source:
    def __len__(self):
        """Returns length of info axis, but here we use the index """
        return len(self.index)
File:      ~/miniconda2/lib/python2.7/site-packages/pandas/core/frame.py
Type:      instancemethod

len（df.index）将比len（df）稍快，因为它少了一个函数调用，但这总是比df.shape[0]快

2017-12-07 23:37:11

如何获取Pandas DataFrame的行计数？

推荐文章

最新文章

标签