如何解析CSV数据?

我可以在哪里找到一些JavaScript代码来解析CSV数据?

当前回答

就我个人而言，我喜欢使用deno std库，因为大多数模块都与浏览器正式兼容

问题是std是typescript，但官方解决方案可能会在未来发生https://github.com/denoland/deno_std/issues/641 https://github.com/denoland/dotland/issues/1728

目前有一个积极维护的飞行转译器https://bundle.deno.dev/

你可以像这样简单地使用它

<script type="module">
import { parse } from "https://bundle.deno.dev/https://deno.land/std@0.126.0/encoding/csv.ts"
console.log(await parse("a,b,c\n1,2,3"))
</script>

2023-02-02 20:48:06

其他回答

csvToArray v1.3

一个紧凑(645字节)，但兼容的函数，将CSV字符串转换为2D数组，符合RFC4180标准。

https://code.google.com/archive/p/csv-to-array/downloads

常用用法:jQuery

 $.ajax({
        url: "test.csv",
        dataType: 'text',
        cache: false
 }).done(function(csvAsString){
        csvAsArray=csvAsString.csvToArray();
 });

常用用法:JavaScript

csvAsArray = csvAsString.csvToArray();

覆盖字段分隔符

csvAsArray = csvAsString.csvToArray("|");

覆盖记录分离器

csvAsArray = csvAsString.csvToArray("", "#");

覆盖跳过报头

csvAsArray = csvAsString.csvToArray("", "", 1);

覆盖所有

csvAsArray = csvAsString.csvToArray("|", "#", 1);

2013-04-17 10:53:49

下面是一个极其简单的CSV解析器，它处理带有逗号、新行和转义双引号的引号字段。没有分裂或正则表达式。它每次扫描输入字符串1-2个字符，并构建一个数组。

在http://jsfiddle.net/vHKYH/上进行测试。

function parseCSV(str) {
    var arr = [];
    var quote = false;  // 'true' means we're inside a quoted field

    // Iterate over each character, keep track of current row and column (of the returned array)
    for (var row = 0, col = 0, c = 0; c < str.length; c++) {
        var cc = str[c], nc = str[c+1];        // Current character, next character
        arr[row] = arr[row] || [];             // Create a new row if necessary
        arr[row][col] = arr[row][col] || '';   // Create a new column (start with empty string) if necessary

        // If the current character is a quotation mark, and we're inside a
        // quoted field, and the next character is also a quotation mark,
        // add a quotation mark to the current column and skip the next character
        if (cc == '"' && quote && nc == '"') { arr[row][col] += cc; ++c; continue; }

        // If it's just one quotation mark, begin/end quoted field
        if (cc == '"') { quote = !quote; continue; }

        // If it's a comma and we're not in a quoted field, move on to the next column
        if (cc == ',' && !quote) { ++col; continue; }

        // If it's a newline (CRLF) and we're not in a quoted field, skip the next character
        // and move on to the next row and move to column 0 of that new row
        if (cc == '\r' && nc == '\n' && !quote) { ++row; col = 0; ++c; continue; }

        // If it's a newline (LF or CR) and we're not in a quoted field,
        // move on to the next row and move to column 0 of that new row
        if (cc == '\n' && !quote) { ++row; col = 0; continue; }
        if (cc == '\r' && !quote) { ++row; col = 0; continue; }

        // Otherwise, append the current character to the current column
        arr[row][col] += cc;
    }
    return arr;
}

2013-02-20 23:22:48

就我个人而言，我喜欢使用deno std库，因为大多数模块都与浏览器正式兼容

问题是std是typescript，但官方解决方案可能会在未来发生https://github.com/denoland/deno_std/issues/641 https://github.com/denoland/dotland/issues/1728

目前有一个积极维护的飞行转译器https://bundle.deno.dev/

你可以像这样简单地使用它

<script type="module">
import { parse } from "https://bundle.deno.dev/https://deno.land/std@0.126.0/encoding/csv.ts"
console.log(await parse("a,b,c\n1,2,3"))
</script>

2023-02-02 20:48:06

正则表达式拯救你!这几行代码根据RFC 4180标准处理带有嵌入逗号、引号和换行符的正确引用字段。

function parseCsv(data, fieldSep, newLine) {
    fieldSep = fieldSep || ',';
    newLine = newLine || '\n';
    var nSep = '\x1D';
    var qSep = '\x1E';
    var cSep = '\x1F';
    var nSepRe = new RegExp(nSep, 'g');
    var qSepRe = new RegExp(qSep, 'g');
    var cSepRe = new RegExp(cSep, 'g');
    var fieldRe = new RegExp('(?<=(^|[' + fieldSep + '\\n]))"(|[\\s\\S]+?(?<![^"]"))"(?=($|[' + fieldSep + '\\n]))', 'g');
    var grid = [];
    data.replace(/\r/g, '').replace(/\n+$/, '').replace(fieldRe, function(match, p1, p2) {
        return p2.replace(/\n/g, nSep).replace(/""/g, qSep).replace(/,/g, cSep);
    }).split(/\n/).forEach(function(line) {
        var row = line.split(fieldSep).map(function(cell) {
            return cell.replace(nSepRe, newLine).replace(qSepRe, '"').replace(cSepRe, ',');
        });
        grid.push(row);
    });
    return grid;
}

const csv = 'A1,B1,C1\n"A ""2""","B, 2","C\n2"';
const separator = ',';      // field separator, default: ','
const newline = ' <br /> '; // newline representation in case a field contains newlines, default: '\n' 
var grid = parseCsv(csv, separator, newline);
// expected: [ [ 'A1', 'B1', 'C1' ], [ 'A "2"', 'B, 2', 'C <br /> 2' ] ]

您不需要像lex/yacc这样的解析器-生成器。正则表达式可以正确地处理RFC 4180，这要归功于正向向后查找、反向向后查找和正向向前查找。

克隆/下载代码https://github.com/peterthoeny/parse-csv-js

2020-02-14 04:19:11

jQuery-CSV

它是一个jQuery插件，设计用于将CSV解析为JavaScript数据的端到端解决方案。它处理RFC 4180中提出的每一个边缘情况，以及一些Excel/谷歌电子表格导出中弹出的情况(即，大多数涉及空值)，这些都是规范所缺少的。

例子:

轨道,艺术家,专辑, 危险，“巴斯塔韵脚”，“当灾难袭来”，1997年

// Calling this
music = $.csv.toArrays(csv)

// Outputs...
[
  ["track", "artist", "album", "year"],
  ["Dangerous", "Busta Rhymes", "When Disaster Strikes", "1997"]
]

console.log(music[1][2]) // Outputs: 'When Disaster Strikes'

更新:

哦，是的，我还应该提一下，它是完全可配置的。

music = $.csv.toArrays(csv, {
  delimiter: "'", // Sets a custom value delimiter character
  separator: ';', // Sets a custom field separator character
});

更新2:

它现在也可以在Node.js上使用jQuery。因此，您可以选择使用相同的库进行客户端或服务器端解析。

更新3:

自从谷歌代码关闭后，jquery-csv已经迁移到GitHub。

免责声明:我也是jQuery-CSV的作者。

2012-04-24 01:24:18

如何解析CSV数据?

推荐文章

最新文章

标签