如何从Unix上的文本文件中提取预先确定的行范围?

我有一个~23000行的SQL转储，其中包含几个数据库的数据价值。我需要提取这个文件的某个部分(即单个数据库的数据)，并将其放在一个新文件中。我知道我想要的数据的开始行号和结束行号。

谁知道一个Unix命令(或一系列命令)可以从文件中提取16224到16482行之间的所有行，然后将它们重定向到一个新文件中?

当前回答

我一直在寻找这个问题的答案，但最终我不得不编写自己的代码。以上的答案都不令人满意。假设您有一个非常大的文件，并且有一些想要打印的行号，但这些行号不是按顺序排列的。您可以执行以下操作:

我的文件比较大对于{a..k};执行echo $letter;完成| cat -n > myfile.txt

我想要的具体行号: shuf -i 1-11 -n 4 > line_numbers_I_want.txt

要打印这些行号，请执行以下操作。 awk ` {system("head myfile.txt -n " $0 " | tail -n 1")} ` line_numbers_I_want.txt

上面所做的是头n行，然后采取最后一行使用尾巴

如果您希望行号按顺序排列，首先sort (is -n numeric sort)，然后获取行。

cat line_numbers_I_want.txt | sort -n | awk '{system("head myfile.txt -n " $0 " | tail -n 1")}'

2021-02-27 03:22:52

其他回答

我们甚至可以在命令行检查:

cat filename|sed 'n1,n2!d' > abc.txt

例如:

cat foo.pl|sed '100,200!d' > abc.txt

2014-02-05 06:41:52

perl -ne 'print if 16224..16482' file.txt > new_file.txt

2008-09-17 13:43:22

我想从一个使用变量的脚本中做同样的事情，并通过在$变量周围加上引号来分隔变量名和p来实现:

sed -n "$first","$count"p imagelist.txt >"$imageblock"

我想把一个列表分成不同的文件夹，找到最初的问题和答案，这是一个有用的步骤。(分裂命令不是旧操作系统上的选项，我必须将代码移植到)。

2017-10-28 09:35:10

我编写了一个小型bash脚本，您可以从命令行运行它，只要您更新PATH以包含它的目录(或者您可以将它放在PATH中已经包含的目录中)。

用法:$ pinch filename起始行结束行

#!/bin/bash
# Display line number ranges of a file to the terminal.
# Usage: $ pinch filename start-line end-line
# By Evan J. Coon

FILENAME=$1
START=$2
END=$3

ERROR="[PINCH ERROR]"

# Check that the number of arguments is 3
if [ $# -lt 3 ]; then
    echo "$ERROR Need three arguments: Filename Start-line End-line"
    exit 1
fi

# Check that the file exists.
if [ ! -f "$FILENAME" ]; then
    echo -e "$ERROR File does not exist. \n\t$FILENAME"
    exit 1
fi

# Check that start-line is not greater than end-line
if [ "$START" -gt "$END" ]; then
    echo -e "$ERROR Start line is greater than End line."
    exit 1
fi

# Check that start-line is positive.
if [ "$START" -lt 0 ]; then
    echo -e "$ERROR Start line is less than 0."
    exit 1
fi

# Check that end-line is positive.
if [ "$END" -lt 0 ]; then
    echo -e "$ERROR End line is less than 0."
    exit 1
fi

NUMOFLINES=$(wc -l < "$FILENAME")

# Check that end-line is not greater than the number of lines in the file.
if [ "$END" -gt "$NUMOFLINES" ]; then
    echo -e "$ERROR End line is greater than number of lines in file."
    exit 1
fi

# The distance from the end of the file to end-line
ENDDIFF=$(( NUMOFLINES - END ))

# For larger files, this will run more quickly. If the distance from the
# end of the file to the end-line is less than the distance from the
# start of the file to the start-line, then start pinching from the
# bottom as opposed to the top.
if [ "$START" -lt "$ENDDIFF" ]; then
    < "$FILENAME" head -n $END | tail -n +$START
else
    < "$FILENAME" tail -n +$START | head -n $(( END-START+1 ))
fi

# Success
exit 0

2014-12-10 17:06:47

使用ruby:

ruby -ne 'puts "#{$.}: #{$_}" if $. >= 32613500 && $. <= 32614500' < GND.rdf > GND.extract.rdf

2015-05-21 12:23:02

如何从Unix上的文本文件中提取预先确定的行范围?

推荐文章

最新文章

标签