加入收藏 | 设为首页 | 会员中心 | 我要投稿 李大同 (https://www.lidatong.com.cn/)- 科技、建站、经验、云计算、5G、大数据,站长网!
当前位置: 首页 > 编程开发 > Python > 正文

Python爬取猫眼电影排行

发布时间:2020-12-20 12:44:53 所属栏目:Python 来源:网络整理
导读:import requestsimport pyquerydef crawl_page(url: str) - None: headers = { 'user-agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML,like Gecko) Chrome/72.0.3626.121 Safari/537.36',} response = requests.get(url,head
import requests
import pyquery


def crawl_page(url: str) -> None:
    headers = {
        'user-agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML,like Gecko) Chrome/72.0.3626.121 Safari/537.36',}
    response = requests.get(url,headers=headers)
    parse_page(response.text)


def parse_page(source_code: str) -> None:
    html = pyquery.PyQuery(source_code)
    dd_elements = html('.board-wrapper dd')
    for dd_element in dd_elements.items():
        data = {
            '排名': dd_element.find('i.board-index').text(),'电影名': dd_element.find('a.image-link').attr('title'),'主演': dd_element.find('p.star').text().split(':')[1],'上映时间': dd_element.find('p.releasetime').text().split(':')[1],'评分': dd_element.find('p.score').text(),}
        print(data)
        save_data(data)


def save_data(data: dict) -> None:
    data = str(data)
    with open('MaoYan.txt','a+',encoding='utf8') as f:
        f.write(data+'n')
    return None


def main():
    for i in range(0,100,10):
        url = 'https://maoyan.com/board/4?offset={}'.format(i)
        crawl_page(url)


if __name__ == '__main__':
    main()

(编辑:李大同)

【声明】本站内容均来自网络,其相关言论仅代表作者个人观点,不代表本站立场。若无意侵犯到您的权利,请及时与联系站长删除相关内容!

    推荐文章
      热点阅读