电影天堂电影链接爬取

 1 import requests,re
 2 
 3 
 4 def getdetail(url):
 5 
 6     response = requests.get(url)
 7     html = response.content.decode('gbk')
 8     # 电影详情页标题
 9     movie_title_name = re.search('<h1><font color=#07519a>(.*)</f',html)
10     movie_title = movie_title_name.group(1)
11     # 电影 磁力   magnet
12     movie_magnet_url = re.search('/><a href="(.*)"><str',html)
13     # print(movie_magnet.group(1))
14     movie_magnet = movie_magnet_url.group(1)
15     # torrent种子
16     movie_torrent_url = re.search('ddf"><a href="(.*)">ft',html)
17     movie_torrent = movie_torrent_url.group(1)
18     # print(movie_torrent.group(1))
19     # 这个列表用来title
20     movie_title_list = []
21     movie_title_list.append(movie_title)
22 
23     # 这个列表两个下载的链接
24     movie_down_url = []
25     movie_down_url.append(movie_magnet)
26     movie_down_url.append(movie_torrent)
27     movie_down_url_all = []
28     movie_down_url_all.append(movie_down_url)
29 
30 
31     movie_dict = dict(zip(movie_title_list,movie_down_url_all))
32     print(movie_dict)
33 
34 
35 
36 def getpage():
37 
38     for i in range(1,178):
39         lurl = 'http://www.dytt8.net/html/gndy/dyzz/list_23_%s.html' % i
40 
41         response = requests.get(lurl)
42 
43         html = response.text
44 
45         movie_url_list = re.findall('<a href="(.*)" class="ulink"',html)
46 
47         for movie_item in movie_url_list:
48             movie_url = 'http://www.dytt8.net'+movie_item
49             getdetail(movie_url)
50 
51 
52 if __name__ == '__main__':
53     getpage()

电影天堂电影链接爬取

【JavaScript】图片加载由模糊变清晰 —— 图片优化

hexdump, hexedit 使用指南

最新文章

三星Galaxy A26首批渲染图曝光后置配备三摄相机

消息称鸿蒙智行尊界轿车命名为“S800”，采用紫色、银色双拼

蔚来宣布在阿塞拜疆开展业务，2025 年第二季度正式开启产品交付

Steam 国区 398 元起，游戏《乐高地平线大冒险》发售

变量提升和函数提升哪个优先级高(为什么低层次的变量不能使用高层次)

win解压缩怎么卸载干净

黑莓桌面管理器怎么用(黑莓桌面管理器如何导出通讯录)

关于鸟的故事（关于鸟类的绘本故事）

丝瓜水有什么功效和作用

莫理循（莫理循环拍摄凌迟）

最新评论

标签

关注我们么么哒！

电影天堂电影链接爬取

【JavaScript】图片加载由模糊变清晰 —— 图片优化

hexdump, hexedit 使用指南

最新文章

三星Galaxy A26首批渲染图曝光 后置配备三摄相机

最新评论

标签

关注我们 么么哒！

关注我们的公众号

三星Galaxy A26首批渲染图曝光后置配备三摄相机

关注我们么么哒！