Shell命令查找两个文件中的公共行

我确信我曾经发现过一个shell命令，它可以从两个或多个文件中打印公共行。它叫什么名字?

它比diff简单多了。

当前回答

在受限版本的Linux上(比如我正在开发的QNAP (NAS)):

Comm并不存在正如@ChristopherSchultz所说，grep -f -f file1 file2可能会导致一些问题，使用grep -f -f file1 file2真的很慢(超过5分钟-没有完成-在超过20mb的文件上使用下面的方法超过2-3秒)

这就是我所做的:

sort file1 > file1.sorted
sort file2 > file2.sorted

diff file1.sorted file2.sorted | grep "<" | sed 's/^< *//' > files.diff
diff file1.sorted files.diff | grep "<" | sed 's/^< *//' > files.same.sorted

如果files.same.sorted应该与原始文件的顺序相同，那么添加这一行与file1的顺序相同:

awk 'FNR==NR {a[$0]=$0; next}; $0 in a {print a[$0]}' files.same.sorted file1 > files.same

或者，对于与file2相同的顺序:

awk 'FNR==NR {a[$0]=$0; next}; $0 in a {print a[$0]}' files.same.sorted file2 > files.same

2016-03-20 09:05:42

其他回答

rm file3.txt

cat file1.out | while read line1
do
        cat file2.out | while read line2
        do
                if [[ $line1 == $line2 ]]; then
                        echo $line1 >>file3.out
                fi
        done
done

这个应该可以了。

2013-09-01 09:34:41

如果这两个文件还没有排序，你可以使用:

comm -12 <(sort a.txt) <(sort b.txt)

它将工作，避免错误消息comm: file 2不是有序的当执行comm -12 a.t xxb .txt时。

2017-07-21 11:14:14

而

fgrep -v -f 1.txt 2.txt > 3.txt

给出了两个文件的区别(在2.txt和不在1.txt中的文件)，你可以很容易地做一个

fgrep -f 1.txt 2.txt > 3.txt

收集所有公共行，这应该为您的问题提供一个简单的解决方案。如果你已经对文件进行了排序，你仍然应该使用通信。的问候!

注意:你可以用grep -F代替fgrep。

2015-01-20 17:21:16

awk 'NR==FNR{a[$1]++;next} a[$1] ' file1 file2

2016-08-14 10:16:56