将浮点数转换为整数在熊猫?

我一直在处理从CSV导入的数据。Pandas将一些列更改为浮点数，所以现在这些列中的数字显示为浮点数!但是，我需要将它们显示为整数或不带逗号。是否有方法将它们转换为整数或不显示逗号?

当前回答

>>> import pandas as pd
>>> right = pd.DataFrame({'C': [1.002, 2.003], 'D': [1.009, 4.55], 'key': ['K0', 'K1']})
>>> print(right)
           C      D key
    0  1.002  1.009  K0
    1  2.003  4.550  K1
>>> right['C'] = right.C.astype(int)
>>> print(right)
       C      D key
    0  1  1.009  K0
    1  2  4.550  K1

2017-05-23 03:51:03

其他回答

这是一个快速的解决方案，如果你想转换更多的列的熊猫。DataFrame从浮点数到整数也考虑到你可以有NaN值的情况。

cols = ['col_1', 'col_2', 'col_3', 'col_4']
for col in cols:
   df[col] = df[col].apply(lambda x: int(x) if x == x else "")

我尝试用else x)和else None)，但结果仍然有浮点数，所以我使用else ""。

2017-06-20 11:04:33

>>> import pandas as pd
>>> right = pd.DataFrame({'C': [1.002, 2.003], 'D': [1.009, 4.55], 'key': ['K0', 'K1']})
>>> print(right)
           C      D key
    0  1.002  1.009  K0
    1  2.003  4.550  K1
>>> right['C'] = right.C.astype(int)
>>> print(right)
       C      D key
    0  1  1.009  K0
    1  2  4.550  K1

2017-05-23 03:51:03

在问题的文本中解释了数据来自csv。Só，我认为显示选项，使转换时，数据读取，而不是之后，是相关的主题。

当在数据框架中导入电子表格或csv时，“只有整数列”通常会转换为浮点数，因为excel将所有数值存储为浮点数，以及底层库的工作方式。

当使用read_excel或read_csv读取文件时，有几个选项可以避免导入后转换:

参数dtype允许传递一个包含列名和目标类型的字典，例如dtype = {"my_column": "Int64"} 参数转换器可以用来传递进行转换的函数，例如用0改变NaN。转换= {"my_column": lambda x: int(x) if x else 0} parameter convert_float将“整型浮点数转换为int(即1.0 - > 1)”，但要注意像NaN这样的极端情况。该参数仅在read_excel中有效

要在现有的数据帧中进行转换，其他注释中已经给出了几种替代方法，但由于v1.0.0 pandas有一个有趣的函数:convert_dtypes，即“使用支持pd.NA的dtypes将列转换为最佳的dtypes”。

为例:

In [3]: import numpy as np                                                                                                                                                                                         

In [4]: import pandas as pd                                                                                                                                                                                        

In [5]: df = pd.DataFrame( 
   ...:     { 
   ...:         "a": pd.Series([1, 2, 3], dtype=np.dtype("int64")), 
   ...:         "b": pd.Series([1.0, 2.0, 3.0], dtype=np.dtype("float")), 
   ...:         "c": pd.Series([1.0, np.nan, 3.0]), 
   ...:         "d": pd.Series([1, np.nan, 3]), 
   ...:     } 
   ...: )                                                                                                                                                                                                          

In [6]: df                                                                                                                                                                                                         
Out[6]: 
   a    b    c    d
0  1  1.0  1.0  1.0
1  2  2.0  NaN  NaN
2  3  3.0  3.0  3.0

In [7]: df.dtypes                                                                                                                                                                                                  
Out[7]: 
a      int64
b    float64
c    float64
d    float64
dtype: object

In [8]: converted = df.convert_dtypes()                                                                                                                                                                            

In [9]: converted.dtypes                                                                                                                                                                                           
Out[9]: 
a    Int64
b    Int64
c    Int64
d    Int64
dtype: object

In [10]: converted                                                                                                                                                                                                 
Out[10]: 
   a  b     c     d
0  1  1     1     1
1  2  2  <NA>  <NA>
2  3  3     3     3

2021-07-01 16:59:47

需要转换为int的列也可以在字典中提到，如下所示

df = df.astype({'col1': 'int', 'col2': 'int', 'col3': 'int'})

2020-06-11 07:27:38

要修改浮点数输出，可以这样做:

df= pd.DataFrame(range(5), columns=['a'])
df.a = df.a.astype(float)
df

Out[33]:

          a
0 0.0000000
1 1.0000000
2 2.0000000
3 3.0000000
4 4.0000000

pd.options.display.float_format = '{:,.0f}'.format
df

Out[35]:

   a
0  0
1  1
2  2
3  3
4  4

2014-01-22 19:01:18

将浮点数转换为整数在熊猫?

推荐文章

最新文章

标签