linux – 从文件中grep多个模式,并为每个模式输出前5个匹配项
发布时间:2020-12-14 01:06:55 所属栏目:Linux 来源:网络整理
导读:我有关于模式匹配的问题. 我有一个包含多个模式的文件,比如pattern.txt Locus3039v1rpkm6.85 Locus3041v1rpkm6.84 Locus3042v1rpkm6.84 要搜索的测试文件是file.txt – Locus3039v1rpkm6.85 gi|350401309|ref|XM_003486067.1| 0 10 85 328 253 8e-12 78.8Loc
我有关于模式匹配的问题.
我有一个包含多个模式的文件,比如pattern.txt Locus3039v1rpkm6.85 要搜索的测试文件是file.txt – Locus3039v1rpkm6.85 gi|350401309|ref|XM_003486067.1| 0 10 85 328 253 8e-12 78.8 Locus3039v1rpkm6.85 gi|350401301|ref|XM_003486066.1| 0 10 85 566 491 8e-12 78.8 Locus3039v1rpkm6.85 gi|350401298|ref|XM_003486065.1| 0 10 85 500 425 8e-12 78.8 Locus3039v1rpkm6.85 gi|340723355|ref|XM_003400008.1| 0 10 106 566 470 3e-11 77.0 Locus3039v1rpkm6.85 gi|340723353|ref|XM_003400007.1| 0 10 106 496 400 3e-11 77.0 Locus3039v1rpkm6.85 gi|359323056|ref|XM_003639939.1| 0 27 104 322 245 9e-05 55.4 Locus3039v1rpkm6.85 gi|359323055|ref|XM_543849.4| 0 27 104 241 164 9e-05 55.4 Locus3039v1rpkm6.85 gi|354503991|ref|XM_003514015.1| 0 27 103 335 259 0.004 50.0 Locus3039v1rpkm6.85 gi|341599927|emb|AM412059.2| 1 63 100 1645525 1645489 6.8 39.2 Locus3039v1rpkm6.85 gi|340003223|emb|HE572590.1| 1 63 100 1671652 1671616 6.8 39.2 Locus3041v1rpkm6.84 gi|337757426|emb|FQ859181.1| 1 61 114 2772617 2772667 0.60 42.8 Locus3041v1rpkm6.84 gi|159889572|gb|CP000875.1| 0 5 40 1185295 1185330 0.60 42.8 Locus3041v1rpkm6.84 gi|158107272|gb|CP000820.1| 0 2 34 5594193 5594161 0.60 42.8 Locus3041v1rpkm6.84 gi|156844486|ref|XM_001645256.1| 83 140 793 850 0.60 42.8 Locus3041v1rpkm6.84 gi|339305108|gb|CP001503.2| 0 58 94 3006529 3006565 2.1 41.0 Locus3041v1rpkm6.84 gi|247533203|gb|CP001607.1| 0 1 40 1268073 1268034 2.1 41.0 Locus3041v1rpkm6.84 gi|367050653|ref|XM_003655658.1| 0 75 103 843 871 7.3 39.2 Locus3041v1rpkm6.84 gi|347002178|gb|CP003012.1| 0 75 103 2986236 2986208 7.3 39.2 Locus3043v1rpkm6.84 gi|332015867|gb|HQ658110.1| 0 9 31 4151 4129 0.49 42.8 Locus3043v1rpkm6.84 gi|254946573|gb|CP001619.1| 1 9 43 4243052 4243019 0.49 42.8 Locus3043v1rpkm6.84 gi|329755665|gb|JF715057.1| 0 11 42 110968 110937 1.7 41.0 Locus3043v1rpkm6.84 gi|9937515|gb|AF294752.1| 0 48 79 2081 2050 1.7 41.0 我希望匹配前5个命中的每个模式,然后移动到前5个的下一个模式,依此类推. 我试过了 grep -i -m 5 -f pattern.txt file.txt > out.txt grep -i -f pattern.txt -m 5 file.txt > out.txt 但我只获得了第一个模式的前五名并结束了.我哪里错了?是否有参数来执行此必需功能? 解决方法
试试这个:
for pat in $(cat pattern.txt); do grep -i -m 5 $pat file.txt; done > out.txt 意思是 >对于pattern.txt中的每个模式,grep前5个匹配的记录. 编辑 正如@dogbane在评论中提到的,这是一个UUOC.这是我改进的答案: for pat in $(< pattern.txt); do grep -i -m 5 $pat file.txt; done > out.txt 另见this答案. (编辑:李大同) 【声明】本站内容均来自网络,其相关言论仅代表作者个人观点,不代表本站立场。若无意侵犯到您的权利,请及时与联系站长删除相关内容! |