我有一个非常简单的东西,它只是输出一些CSV格式的东西,但它必须是UTF-8。我在TextEdit或TextMate或Dreamweaver中打开这个文件,它会正确地显示UTF-8字符,但如果我在Excel中打开它,它会做这种愚蠢的íÄ之类的事情。下面是我在我的文档头部得到的内容:

header("content-type:application/csv;charset=UTF-8");
header("Content-Disposition:attachment;filename=\"CHS.csv\"");

这一切似乎都达到了预期的效果,除了Excel (Mac, 2008)不想正确地导入它。Excel里没有“以UTF-8格式打开”之类的选项,所以……我有点烦了。

我似乎在任何地方都找不到任何明确的解决方案,尽管很多人都有同样的问题。我看到的最多的事情是包括BOM,但我不知道如何做到这一点。正如你所看到的,我只是回显这些数据,我没有写入任何文件。如果我需要,我可以这样做,我只是没有因为在这一点上似乎不需要这样做。任何帮助吗?

更新:我尝试将BOM作为回显包(“CCC”,0xef, 0xbb, 0xbf);这是我刚刚从一个试图检测BOM的网站上找到的。但Excel只是在导入时将这三个字符附加到第一个单元格,仍然会把特殊字符弄乱。


当前回答

下面是我怎么做的(这是提示浏览器下载csv文件):

header('Content-Description: File Transfer');
header('Content-Type: application/octet-stream');
header('Content-Disposition: attachment; filename=file.csv');
header('Content-Transfer-Encoding: binary');
header('Expires: 0');
header('Cache-Control: must-revalidate, post-check=0, pre-check=0');
header('Pragma: public');
echo "\xEF\xBB\xBF"; // UTF-8 BOM
echo $csv_file_content;
exit();

唯一的一件事,它修复了UTF8编码问题在CSV预览当你在Mac上按空格键。但不是在Excel Mac 2008…不知道为什么

其他回答

你可以在导出之前将3个字节附加到文件中,这对我来说是有效的。在此之前,该系统只能在Windows和HP -UX中工作,但在Linux中失败。

FileOutputStream fStream = new FileOutputStream( f );
final byte[] bom = new byte[] { (byte) 0xEF, (byte) 0xBB, (byte) 0xBF };
OutputStreamWriter writer = new OutputStreamWriter( fStream, "UTF8" );
fStream.write( bom );

有一个UTF-8 BOM(3字节,十六进制EF BB BF)在文件的开始。否则Excel将根据本地默认编码(例如cp1252)而不是utf-8来解释数据

为Excel生成CSV文件,如何在值中有换行符

我刚刚处理了同样的问题,并提出了两个解决方案。

Use the PHPExcel class as suggested by bpeterson76. Using this class generates the most widely compatible file, I was able to generate a file from UTF-8 encoded data that opened fine in Excel 2008 Mac, Excel 2007 Windows, and Google Docs. The biggest problem with using PHPExcel is that it's slow and uses a lot of memory, which isn't an issue for reasonably sized files, but if your Excel/CSV file has hundreds or thousands of rows, this library becomes unusable. Here is a PHP method that will take some TSV data and output an Excel file to the browser, note that it uses the Excel5 Writer, which means the file should be compatible with older versions of Excel, but I no longer have access to any, so I cannot test them. function excel_export($tsv_data, $filename) { $export_data = preg_split("/\n/", $tsv_data); foreach($export_data as &$row) { $row = preg_split("/\t/", $row); } include("includes/PHPExcel.php"); include('includes/PHPExcel/Writer/Excel5.php'); $objPHPExcel = new PHPExcel(); $objPHPExcel->setActiveSheetIndex(0); $sheet = $objPHPExcel->getActiveSheet(); $row = '1'; $col = "A"; foreach($export_data as $row_cells) { if(!is_array($row_cells)) { continue; } foreach($row_cells as $cell) { $sheet->setCellValue($col.$row, $cell); $col++; } $row += 1; $col = "A"; } $objWriter = new PHPExcel_Writer_Excel5($objPHPExcel); header('Content-Type: application/vnd.ms-excel'); header('Content-Disposition: attachment;filename="'.$filename.'.xls"'); header('Cache-Control: max-age=0'); $objWriter->save('php://output'); exit; } Because of the efficiency issues with PHPExcel, I also had to figure out how to generate a UTF-8 & Excel compatible CSV or TSV file. The best I could come up with was a file that was compatible with Excel 2008 Mac, and Excel 2007 PC, but not Google Docs, which is good enough for my application. I found the solution here, specifically, this answer, but you should also read the accepted answer as it explains the problem. Here is the PHP code I used, note that I am using tsv data (tabs as delimiters instead of commas): header ( 'HTTP/1.1 200 OK' ); header ( 'Date: ' . date ( 'D M j G:i:s T Y' ) ); header ( 'Last-Modified: ' . date ( 'D M j G:i:s T Y' ) ); header ( 'Content-Type: application/vnd.ms-excel') ; header ( 'Content-Disposition: attachment;filename=export.csv' ); print chr(255) . chr(254) . mb_convert_encoding($tsv_data, 'UTF-16LE', 'UTF-8'); exit;

我也遇到过同样的问题,解决方法如下:

    header('Content-Encoding: UTF-8');
    header('Content-Type: text/csv; charset=utf-8' );
    header(sprintf( 'Content-Disposition: attachment; filename=my-csv-%s.csv', date( 'dmY-His' ) ) );
    header('Content-Transfer-Encoding: binary');
    header('Expires: 0');
    header('Cache-Control: must-revalidate, post-check=0, pre-check=0');
    header('Pragma: public');

    $df = fopen( 'php://output', 'w' );

    //This line is important:
    fputs( $df, "\xEF\xBB\xBF" ); // UTF-8 BOM !!!!!

    foreach ( $rows as $row ) {
        fputcsv( $df, $row );
    }
    fclose($df);
    exit();

引用微软技术支持工程师的话,

Excel for Mac目前不支持UTF-8

2017年更新:这适用于Office 2016之前的所有版本的Microsoft Excel for Mac。更新的版本(来自Office 365)现在支持UTF-8。

为了输出在Windows和OS X上的Excel都能够成功读取的UTF-8内容,您将需要做两件事:

确保您将UTF-8 CSV文本转换为UTF-16LE mb_convert_encoding($csv, 'UTF-16LE', 'UTF-8'); 确保在文件的开头添加了UTF-16LE字节顺序标记 科(255)。科(254)

下一个问题只出现在Excel在OS X(但不是Windows)将是当查看一个以逗号分隔值的CSV文件时,Excel将只呈现一行的行,所有的文本与第一行的逗号一起。

避免这种情况的方法是使用制表符作为分离的值。

我从PHP注释中使用了这个函数(使用制表符“\t”而不是逗号),它在OS X和Windows Excel上完美地工作。

注意,要修复一个空列作为一行结束的问题,我必须改变代码行,它说:

    $field_cnt = count($fields);

to

    $field_cnt = count($fields)-1;

正如本页上的其他一些评论所说,其他电子表格应用程序,如OpenOffice Calc,苹果自己的Numbers和谷歌Doc的spreadsheet,对带有逗号的UTF-8文件没有任何问题。

请参阅此问题中的表格,了解在Excel中哪些适用于Unicode CSV文件,哪些不适用


作为旁注,我可能会补充一点,如果你正在使用Composer,你应该看看如何将League\Csv添加到你的需求中。League\Csv有一个非常好的API来构建Csv文件。

要使用League\Csv创建Csv文件的这种方法,请查看这个示例

我在Mac上,在我的情况下,我只需要用“sep=;\n”指定分隔符,并像这样用UTF-16LE编码文件:

$data = "sep=;\n" .mb_convert_encoding($data, 'UTF-16LE', 'UTF-8');