linux – 使用bash脚本循环包含域的文本文件
发布时间:2020-12-13 23:57:10 所属栏目:Linux 来源:网络整理
导读:嘿伙计们,我写了一个脚本,读取网页的href标签,并获取该网页上的链接,并将它们写入文本文件.现在我有一个包含这些链接的文本文件,例如: http://news.bbc.co.uk/2/hi/health/default.stmhttp://news.bbc.co.uk/weather/http://news.bbc.co.uk/weather/forecas
嘿伙计们,我写了一个脚本,读取网页的href标签,并获取该网页上的链接,并将它们写入文本文件.现在我有一个包含这些链接的文本文件,例如:
http://news.bbc.co.uk/2/hi/health/default.stm http://news.bbc.co.uk/weather/ http://news.bbc.co.uk/weather/forecast/8?area=London http://newsvote.bbc.co.uk/1/shared/fds/hi/business/market_data/overview/default.stm http://purl.org/dc/terms/ http://static.bbci.co.uk/bbcdotcom/0.3.131/style/3pt_ads.css http://static.bbci.co.uk/frameworks/barlesque/2.8.7/desktop/3.5/style/main.css http://static.bbci.co.uk/frameworks/pulsesurvey/0.7.0/style/pulse.css http://static.bbci.co.uk/wwhomepage-3.5/1.0.48/css/bundles/ie6.css http://static.bbci.co.uk/wwhomepage-3.5/1.0.48/css/bundles/ie7.css http://static.bbci.co.uk/wwhomepage-3.5/1.0.48/css/bundles/ie8.css http://static.bbci.co.uk/wwhomepage-3.5/1.0.48/css/bundles/main.css http://static.bbci.co.uk/wwhomepage-3.5/1.0.48/img/iphone.png http://www.bbcamerica.com/ http://www.bbc.com/future http://www.bbc.com/future/ http://www.bbc.com/future/story/20120719-how-to-land-on-mars http://www.bbc.com/future/story/20120719-road-opens-for-connected-cars http://www.bbc.com/future/story/20120724-in-search-of-aliens http://www.bbc.com/news/ 我希望能够过滤它们,以便我返回类似于: http://www.bbc.com : 6 http://static.bbci.co.uk: 15 侧面的值表示域在文件中出现的次数.我怎么能在bash中实现这一点,因为我会有一个循环遍历文件.我是bash shell脚本的新手吗? 解决方法$cut -d/ -f-3 urls.txt | sort | uniq -c 3 http://news.bbc.co.uk 1 http://newsvote.bbc.co.uk 1 http://purl.org 8 http://static.bbci.co.uk 1 http://www.bbcamerica.com 6 http://www.bbc.com (编辑:李大同) 【声明】本站内容均来自网络,其相关言论仅代表作者个人观点,不代表本站立场。若无意侵犯到您的权利,请及时与联系站长删除相关内容! |