校园电子商务网站建设win优化大师官网
京东测试账号:
账号:python_afu
密码:****
手动登录京东后,保存头部信息
可以使用sublime对内容进行正则处理
注意如果有冒号,要去掉哦
下面的代码使用到了Cookie,因为Cookie是有期限的,所有需要重新抓包找到Cookie,直接运行会报错#coding=utf-8
# 获取一个有登录信息的Cookie模拟登陆
import requests
# 1. 构建一个已经登录过的用户的headers信息
headers = {
'authority':'home.jd.com',
'method':'GET',
'scheme':'https',
'accept':'text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,image/apng,*/*;q=0.8',
'accept-encoding':'gzip, deflate, br',
'accept-language':'zh-CN,zh;q=0.9,en;q=0.8',
'cache-control':'max-age=0',
'cookie':'ipLoc-djd=1-72-2799-0; shshshfpb=05305bbfd662293965660cbd39b2b4b80aeb9ab22ad43f6815b5ff7ffa; shshshfpa=ed1b53d0-8e70-3a3b-57e0-687045756170-1533028289; __jda=122270672.1535304790021629950410.1535304790.1535304790.1535304790.1; __jdc=122270672; __jdv=122270672|direct|-|none|-|1535304790021; __jdu=1535304790021629950410; wlfstk_smdl=fvn1f4pfetdqrvci1w8t51vwy86heqau; TrackID=1Sl3wiuNGyUHmjSPnTn6RFEya_FTuPI5w6IL7LX-DyPAhVQ3qSiCrnqNlsL6ruJif59-w6Uga9WgbOuVgBK1Hew; thor=AE86F17153107F9744F5268A88E4731C45C7B857C2D8B269992EBE6692E23E9291C85ED861C7C5AFB77C7B162D32C8C38D69C351EFD04EAE3D44C3A23041A41F53044C1B2EB15D9257BF58AFC98955DFCF58C03BA32643AFA645A58BDCC21EF255647E56045239BC8F64989FDB29BC626ADEE5959E868E0148B8731C94C418FB6023EAA7A57BC49C6FBEE2752C2024DB; pinId=tF0w3lsgbrKuuR-PW68-4Q; pin=python_afu; unick=python_afu; ceshi3.com=000; _tp=wmA28pFNyQS3czxsK1OIHA%3D%3D; _pst=python_afu; __jdb=122270672.3.1535304790021629950410|1.1535304790; 3AB9D23F7A4B3C9B=3JRK7JWTXAB65RWQ7VTATRTEE5FP3SSEWYN22GGU2VEDZ3SITYGMP53UFOY3XV7YVH7SF7PSJOLRKIYIWDCK6HD2N4',
'upgrade-insecure-requests':'1',
'user-agent':'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/68.0.3440.106 Safari/537.36',
}
# 2. 通过headers里的报头信息(主要是Cookie信息),构建Request对象
# 3. 直接访问renren主页,服务器会根据headers报头信息(主要是Cookie信息),判断这是一个已经登录的用户,并返回相应的页面
response = requests.get("https://home.jd.com/",headers=headers)
# 4. 打印响应内容
print(response.text)
Cookies在爬虫方面最典型的应用是判定注册用户是否已经登录网站,用户可能会得到提示,是否在下一次进入此网站时保留用户信息以便简化登录手续
本博客源码Github地址:
请随手给个star,谢谢!
