文件中最长的一行

我正在寻找一种简单的方法来找到文件中最长行的长度。理想情况下，它应该是一个简单的bash shell命令，而不是脚本。

当前回答

我在Unix环境中，使用gzip压缩的文件，大小只有几gb。我使用一个记录长度为2052的2 GB gzip文件测试了以下命令。

zcat < gzipped文件- > |厕所

and

Zcat <gzip文件> | awk '{打印长度}' | sort -u

时间是平均的

117秒 109秒

这是我运行10次后的脚本。

START=$(date +%s) ## time of start

zcat $1 |  wc -L

END=$(date +%s) ## time of end
DIFF=$(( $END - $START ))
echo "It took $DIFF seconds"

START=$(date +%s) ## time of start

zcat $1 |  awk '{print length}' | sort -u

END=$(date +%s) ## time of end
DIFF=$(( $END - $START ))
echo "It took $DIFF seconds"

2017-04-20 21:51:34

其他回答

在perl中:

perl -ne 'print ($l = $_) if (length > length($l));' filename | tail -1

这只打印行，不打印它的长度。

2009-10-31 21:38:14

使用wc (GNU coreutils) 7.4:

wc -L filename

给:

101 filename

2010-07-07 03:33:06

如果您正在使用MacOS并得到以下错误: wc:非法选项——L你不需要安装GNU sipmly这样做。

如果你所要做的只是获取文件中最长行的字符数，并且使用OS X运行:

Awk '{打印长度}' "$file_name" | sort -rn | head -1

像这样的东西;

$file_name" $(awk '{打印长度}' "$file_name" | sort -rn | head -1)字符"

输出:

文件my_file中最长的一行有117个字符

2019-11-16 16:41:15

以上例子中被忽略的重要一点。

下面两个示例计算展开的选项卡

  wc -L  <"${SourceFile}" 
# or
  expand --tabs=8 "${SourceFile}" | awk '{ if (length($0) > max) {max = length($0)} } END { print max }'

以下2个计数为未展开的选项卡。

  expand --tabs=1 "${SourceFile}" | wc -L 
# or
  awk '{ if (length($0) > max) {max = length($0)} } END { print max }' "${SourceFile}"

              Expanded    nonexpanded
$'nn\tnn'       10            5

2012-05-01 17:10:47

以下是回答者的参考资料

cat filename | awk '{print length, $0}'|sort -nr|head -1

http://wtanaka.com/node/7719

2009-10-31 20:56:39

文件中最长的一行

推荐文章

最新文章

标签