python使用selenium以及selenium-wire做质量与性能检测

2023-05-16

python天生就是适合用来做爬虫，结合selenium真是如虎添翼；

1) 安装库

pip install selenium
pip install selenium-wire

2）添加驱动，比如 chrome需要下载一个驱动，放到项目目录下或者python安装目录下，根据机器上对应的chrome版本进行下载。我是放在python3.exe的目录

下载地址：

CNPM Binaries Mirror

selenium功能比较强大，但是仍然缺少一些特性，比如需要获取每个请求的头，返回的头信息等，靠谱的方式是selenium-wire，需要注意的是：不要使用IPV6，测试发现只能使用IPV4！！！

效果如下：

比如我的需求是：测试某网页全页面加载时长，各个子元素请求时长，并且截图，测试代码如下：

import time
from PIL import Image   # pip install pillow
import json

#from selenium import webdriver
# https://pypi.org/project/selenium-wire/#response-objects
from seleniumwire import webdriver  # Import from seleniumwire
from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions
from selenium.webdriver.chrome.options import Options

 
# Create a new instance of the Chrome driver
option = webdriver.ChromeOptions()
chrome_options = Options()
chrome_options.add_argument('--headless')    # 2> 添加无头参数r,一定要使用无头模式，不然截不了全页面，只能截到你电脑的高度
chrome_options.add_argument('--disable-gpu') # 3> 为了解决一些莫名其妙的问题关闭 GPU 计算
chrome_options.add_argument('--no-sandbox')  # 4> 为了解决一些莫名其妙的问题浏览器不动
chrome_options.add_argument("--user-data-dir=C:\\Users\\[user]\\AppData\\Local\\Google\\Chrome\\User Data") 
#chrome_options.add_extension("adblock_v3.6.12.crx") # 加载.crx后缀的插件

# 调用打印功能的设置，
# 打印不能使用--headless模式，必须要可见模式；打印的pDF有时格式还会乱；不如截图
settings = {
    "recentDestinations": [{
        "id": "Save as PDF",
        "origin": "local",
        "account": ""
    }],
    "selectedDestinationId": "Save as PDF",
    "version": 2,
    "isHeaderFooterEnabled": False,

    # "customMargins": {},
    #"marginsType": 2,#边距（2是最小值、0是默认）
    # "scaling": 100,
    # "scalingType": 3,
    # "scalingTypePdf": 3,
    #"isLandscapeEnabled": True,  # 若不设置该参数，默认值为纵向
    "isCssBackgroundEnabled": True,
    "mediaSize": {
        "height_microns": 297000,
        "name": "ISO_A4",
        "width_microns": 210000,
        "custom_display_name": "A4"
    },
}

chrome_options.add_argument('--enable-print-browser')
# chrome_options.add_argument('--headless') #headless模式下，浏览器窗口不可见，可提高效率
prefs = {
    'printing.print_preview_sticky_settings.appState': json.dumps(settings),
    'savefile.default_directory': 'd:\\test\\'  
}
# 此处填写你希望文件保存的路径,可填写your file path默认下载地址

chrome_options.add_argument('--kiosk-printing')  # 静默打印，无需用户点击打印页面的确定按钮
chrome_options.add_experimental_option('prefs', prefs)

##
option.add_experimental_option('excludeSwitches', ['enable-logging'])
driver = webdriver.Chrome(chrome_options=chrome_options)

# 窗口最大化
driver.maximize_window()


# 访问页面
driver.get('https://weibo.com')

# 记录全页面中成功与失败的请求数，并记录出错使用的时长

n1 = 0
n2 = 0
for request in driver.requests:
    if request.response:
        if request.response.status_code == 200:
            n1 += 1
        else:
            n2 += 1
            print(
                request.url,
                request.response.status_code,
                request.response.headers['Content-Type'] )
            # print(request.headers)
            # print(request.response.headers)
            # print(request.date)
            # print(request.response.date)
            delta = round(( request.response.date - request.date).microseconds/1000000, 2)
            print("cost ", delta, "s")
print("%d,  %d" % (n1, n2))
#driver.webDriverWait()
#driver.implicitly_wait(10)

element = WebDriverWait(driver, 10).until(expected_conditions.presence_of_element_located((By.ID, "app")))

try:
    # 模拟人滚动滚动条,处理图片懒加载问题
    k = 1
    js_height = "return document.body.clientHeight"
    height = driver.execute_script(js_height)
    while True:
        if k * 500 < height:
            js_move = "window.scrollTo(0,{})".format(k * 500)
            print(js_move)
            driver.execute_script(js_move)
            time.sleep(0.2)
            height = driver.execute_script(js_height)
            k += 1
        else:
            break

    time.sleep(1)

    # 7>  # 直接截图截不全，调取最大网页截图
    width = driver.execute_script(
        "return Math.max(document.body.scrollWidth, document.body.offsetWidth, document.documentElement.clientWidth, document.documentElement.scrollWidth, document.documentElement.offsetWidth);")
    height = driver.execute_script(
        "return Math.max(document.body.scrollHeight, document.body.offsetHeight, document.documentElement.clientHeight, document.documentElement.scrollHeight, document.documentElement.offsetHeight);")
    print(width, height)
    # 将浏览器的宽高设置成刚刚获取的宽高
    driver.set_window_size(width + 100, height + 100)
    time.sleep(1)
    png_path = "d:\\test\\" + '{}.png'.format('xx网址截图')

    # 截图并关掉浏览器
    driver.save_screenshot(png_path)
    driver.get_screenshot_as_file("d:\\test\\selenium.png")
    #driver.execute_script('document.title="test.pdf";window.print();')
    
    # png转pdf
    # image1 = Image.open(png_path)
    # im1 = image1.convert('RGB')
    # pdf_path = png_path.replace('.png', '.pdf')
    # im1.save(pdf_path)

except Exception as e:
    pass

driver.close()

后记：

在linux下，默认chrome是启动不了的，需要更改/usr/bin/google-chrome脚本，但是这样会造成selenium无法正常工作，

需要指定程序的绝对路径：

from seleniumwire import webdriver  # Import from seleniumwire
from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions
from selenium.webdriver.chrome.options import Options
from selenium.webdriver.chrome.service import Service

 
# Create a new instance of the Chrome driver
options = webdriver.ChromeOptions()
options.add_argument("--disable-dev-shm-usage"); 
options.add_argument("start-maximized"); 
options.add_argument("disable-infobars"); 
options.add_argument("--disable-extensions")
options.add_argument("--disable-gpu"); 
options.add_argument("--no-sandbox");
options.add_argument("--user-data-dir=/root/chrome/data")
# 指定chrome的路径
options.binary_location = "/opt/google/chrome/chrome"  


s = Service("/usr/bin/chromedriver")

driver = webdriver.Chrome(service=s, options=options)

# 窗口最大化
driver.maximize_window()


# 访问页面
driver.get('https://mail.qq.com')

这样就OK了。

后记2：

这个组件使用Selenium和MitmProxy两个组件来做信息检测，

也就是说自己加了一个中间人代理，通过代理将数据拦截下来并记录到内存或者目录；

文档链接：

Event Hooks & API

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

python使用selenium以及selenium-wire做质量与性能检测的相关文章

到底什么是Unikernel？

本文转载至 xff1a http dockone io article 855 utm source 61 tuicool amp utm medium 61 referral 编者的话本文介绍了一种新的应用虚拟化技术 xff0c 它让应
xauth: “timeout in locking authority file /home/<user>/.Xauthority”?

本文转载至 xff1a http unix stackexchange com questions 215558 why am i getting this message from xauth timeout in locking aut
小技巧：检查你本地及公共 IP 地址

本文转载至 xff1a https linux cn article 8207 1 html utm source 61 rss amp utm medium 61 rss 你本地的 IP 地址 xff1a 192 168 1 100 上面
Inside Real-Time Linux

本文转载于 xff1a https www linux com news event elce 2017 2 inside real time linux Real time Linux has come a long way in the
[小技巧] vim中使用cscope时不区别大小写

cscope 有 C 这么一个选项 C Ignore letter case when searching vim 里使用 cscope 不区别大小写可以使用下面一个技巧 xff1a set csprg 61 usr bin ra csco
PWM占空比和电机转速有什么线性关系

可以看电机拖动一书 xff0c 里面讲了电机的建模由于PWM波频率很高 xff0c 一般认为接在电机两端的电压平均值有如下关系 xff1a 假如占空比为a xff0c 驱动板供电电压为U xff0c 则电机两端电压Ud 61 a U 对于
SIFT特征点提取及描述论文算法详解

SIFT特征点提取及描述论文算法详解 1 尺度空间极值检测 Scale space extrema detection 1 1 尺度空间和极值1 2 DoG和LoG的关系1 3 构建高斯尺度差分空间Tips 2 极值点定位 Keypoint
国科大计算机视觉20-21考题

国科大计算机视觉20 21考题 SIFT检测及描述流程 xff08 20分 xff09 相机成像模型 xff08 16分 xff09 两视图的稀疏重建 xff08 16分 xff09 LM算法流程 xff08 16分 xff09 PCA的思
Ubuntu18.04关闭内核自动更新安装之前版本

Ubuntu18 04关闭内核自动更新安装之前版本回退的原因 xff0c 上一周安装了Ubuntu18 04双系统 xff0c 主机型号是外星人 Asura R6 xff0c 安装完毕后可以正常进入Ubuntu xff0c 但是关机的时候
Windows10配置MongoDB

Windows10安装MongoDB并配置 1 安装2 安装完成后启动服务器2 1 一次性启动2 2 设置为服务 xff0c 开机自启动 3 添加环境变量 xff0c 方便在cmd任何目录中直接启动参考链接 xff1a https www
Ubuntu18.04编译ORB-SLAM3及遇到的一些问题

测试环境 xff1a 系统 xff1a Ubuntu18 04Eigen 3 3 4 查看Eigen3版本的方法Pangolin 0 6OpenCV 3 4 14ROS Melodic 一安装依赖 ORB SLAM的各项依赖里OpenCV
Python multiprocessing多进程编程，进程间通信，psutil监控进程状态并通过电子邮件告警

python多进程编程进程监测一 mutiprocessng多进程编程和通信二进程监测分析三 Python邮件发送功能四完整代码运行结果 xff1a 服务器上的web后端经常需要同时运行多个进程 xff0c 各个进程之间需要交换数
Supervisor服务器进程监测

服务器上的应用程序有时候会莫名其妙地挂掉 xff0c 如果我们经常去登录服务器看是不是程序挂了 xff0c 挂了再拉起 xff0c 那样是非常耗时和麻烦的事情后来我们通过使用 supervisor 去守护启动 xff0c 实现方法如下一
Ubuntu18.04手动安装NVIDIA驱动

Ubuntu18 04手动安装NVIDIA驱动 1 下载驱动查看系统推荐的驱动版本 xff0c 官网下载对应的run文件 NVIDIA驱动下载的高级搜索 xff1a https www nvidia cn Download Find as
ORB-SLAM2 编译记录

ORB SLAM2编译记录由于之前已经编译过ORB SLAM3 xff0c 大部分库都已经配置好了 xff0c 这次主要只了处理两个错误 1 error usleep is not declared in this scope xxx x
视觉SLAM十四讲 Ubuntu20.04 Pangolin 环境配置

视觉SLAM十四讲 Ubuntu20 04 Pangolin 环境配置一 github下载源代码选择0 5版本的 xff0c 要不然版本装高了编译ORB SLAM2会遇到问题二报错及处理 error AV PIX FMT XVMC
Trilateration三边测量定位算法

http www justinablog com archives 1066 基本原理 Trilateration xff08 三边测量 xff09 是一种常用的定位算法 xff1a 已知三点位置 x1 y1 x2 y2 x3 y3 已知未
编译VINS_Mono报错： Project ‘cv_bridge‘ specifies ‘/usr/include/opencv‘ as an include dir, which is not f

编译VINS Mono报错 xff1a CMake Error at opt ros melodic share cv bridge cmake cv bridgeConfig cmake 113 Project cv bridge spe
《视觉SLAM十四讲》中SE(3)指数映射和左雅克比矩阵的推导

高博的书上给出了 S O 3 SO 3 S O 3 的指数映射推导 xff0c 但对于
Python sum()函数

Python里的sum函数语法例子1 列表中的元素为数字 xff1a 2 列表中的元素为字符串 xff1a 3 列表中元素为列表语法 sum iterable start 参数1 iterable xff0c 一个可迭代对象 xff0c

随机推荐

Ubuntu20.04安装tensorflow2.8.0+CUDA11.4

Ubuntu20 04安装tensorflow2 8 0 43 CUDA11 4 1 创建虚拟环境2 安装tensorflow3 安装CUDA4 安装cuDNN4 1 手动安装4 2 deb安装包安装 5 测试需要事先安装好Anacond
关于/etc/ld.so.conf.d/和环境变量设置

关于 etc ld so conf 和环境变量LD LIBRARY PATH 1 动态可执行程序和静态可执行程序2 动态链接库的搜索2 1 查询程序依赖的动态链接库2 2 动态装入器 xff08 dynamic loader xff09 2
Pytorch检查CUDA和cudnn是否可用及其版本

Pytorch检查CUDA和cudnn版本检查CUDA检查cudnn 命令行终端启动python 检查CUDA span class token operator gt gt span span class token operator
Ubuntu 18.04 ROS Melodic中调用支持Python3的cv_bridge

Ubuntu 18 04 ROS Melodic中调用支持Python3的cv bridge 0 背景1 编译自己的cv bridge功能包 Python 3 7 11 2 更新当前shell的环境变量3 附录Why use source
VIm自动生成python的文件头

VIm自动生成python的文件头我实现的效果如图所示 xff1a 思路是在vimrc配置文件中写相关的函数 xff0c 代码在下面贴出按 wq保存退出以后 xff0c 会自动更新上次修改时间 34 新建py文件时插入文件头 autoc
使用Dokcer配置Tensorflow-1.15环境并使用VSCode开发

使用Dokcer配置Tensorflow 1 15环境目前学术界大部分深度学习的开源代码都是基于Pytorch的 xff0c 但还有少部分工作或者以前的工作是基于Tensorflow 1 x的 xff0c 由于tensorflow的版本和
使用VNC可视化Docker容器

使用VNC可视化Docker容器 0 前言环境 xff1a 1 容器端配置1 1 启动Docker容器1 2 安装x111 3 安装桌面环境1 4 安装tightvncserver 2 配置VNC Server2 1 首先停止刚刚新建的虚拟
STM32 串口ISP下载方式解读

xfeff xfeff http blog sina com cn s blog b09739ab0102v4rm html Flash Loader Demonstrator 下载工具的安装 1 xff0e 硬件的连接和设置串口ISP
with异常处理

class A 39 39 39 此类的对象可以用 xff57 xff49 xff54 xff48 语句进行管理 39 39 39 def enter self print 34 已经进入with语句 34 return self def
telegram android 源码分析（一）自动设置代理

比如自动设置mtproxy代理 xff0c 冗长的代码我们怎么去找 xff1f 1 xff09 首先我们发现点代理链接能弹对话框 xff0c 们可以在strings xml中搜索得到 xff1a lt string name 61 34 U
NS3 的 ipv4-static-routing-test-suite 源码分析

下面进行源码注释 xff1a End to end tests for Ipv4 static routing include 34 ns3 boolean h 34 include 34 ns3 config h 34 include 3
c语言向上取整计算方法

用整数N 除以 M xff0c 要求向上取整数 1 xff09 int n 61 N 43 M 1 M xff1b 简化后就是 xff1a 2 xff09 int n 61 N 1 M 43 1 xff1b 最笨的办法 3 int n 61
比std::qsort还快的快速排序（1千万整数1.7秒）——（快速排序栈溢出与递归优化）

前几天发现老外的开源项目中事件队列中用的就是std qsort排序 xff0c 后续插入时候使用了堆方式快速排序实际应用中是比堆排序要快的 xff0c 这主要是因为硬件层次会对数据执行高速缓存 xff0c 数据使用一二三级高速缓存比访问内
C#使用ProtoBuf

1 Google ProtoBuf 经过测试 xff0c protobuf比json存储效率还是要高 xff0c 即时号称最快的fastjson也没有protobuf快 xff0c 这里为了使用 c 做一个客户端兼容 xff0c 所以也需要
多线程如何实现高性能计数器（无锁）

多线程协作免不了使用计数器 xff0c 通常的代码 xff0c c 43 43 一般会使用锁 xff0c 或者原子变量操作 xff1a std mutex mutexCounter int count void add std lock g
ubuntu18/20 下如何生成core文件

ubuntu18 20 下如何生成core文件一设置原理 xff1a https blog csdn net Sunnyside article details 118439302 原来在ubuntu14 ubuntu16上只需要一步
c++的字节序与符号位的问题

看这样一道题 xff1a include lt stdio h gt int main void int w h int i 61 0xa1b2c3d4 char p 61 char amp i for int j 61 0 j lt 4
docker镜像之带vnc的ubuntu

docker镜像之带vnc图形界面ubuntu 前言 xff1a 为了在图形界面中使用firefox xff0c 需要找一个带rdp或者vnc的ubuntu xff0c 最好是gnome的界面 xff0c 折腾了3天 xff0c 终于找
STM32中，关于中断函数调用全局变量的问题

xfeff xfeff https blog csdn net leo liu006 article details 79334905 首先是问题的描述 xff1a 硬件单片机型号 xff0c STM32F103VET6 xff0c IDE
python使用selenium以及selenium-wire做质量与性能检测

python天生就是适合用来做爬虫 xff0c 结合selenium真是如虎添翼 xff1b 1 安装库 pip install selenium pip install selenium wire 2 xff09 添加驱动 xff0c 比

python使用selenium以及selenium-wire做质量与性能检测

python使用selenium以及selenium-wire做质量与性能检测 的相关文章

随机推荐

热门标签

python使用selenium以及selenium-wire做质量与性能检测的相关文章