如何使用 XPath Selenium 和 Python 从
标签获取文本

2023-12-03

我需要用 XPath 从 a 中的文本中捕获一行<p>。我需要存储文本Content-type: text/plain; charset=us-ascii到 python 中的变量中，但我收到下一个错误：

selenium.common.exceptions.WebDriverException: Message: TypeError: Expected an element or WindowProxy, got: [object Text] {}

这是我尝试的代码：

import selenium.webdriver as webdriver

browser = webdriver.Firefox()
browser.get('https://www.w3.org/Protocols/rfc1341/7_1_Text.html')

foo = browser.find_element_by_xpath('/html/body/p[5]/text()')
print(foo)

<h1>7.1  The Text Content-Type</h1>
<p>
The text Content-Type is intended for sending material which
is  principally textual in form.  It is the default Content-
Type.  A "charset" parameter may be  used  to  indicate  the
character set of the body text.  The primary subtype of text
is "plain".  This indicates plain (unformatted)  text.   The
default  Content-Type  for  Internet  mail  is  "text/plain;
charset=us-ascii".
<p>
Beyond plain text, there are many formats  for  representing
what might be known as "extended text" -- text with embedded
formatting and  presentation  information.   An  interesting
characteristic of many such representations is that they are
to some extent  readable  even  without  the  software  that
interprets  them.   It is useful, then, to distinguish them,
at the highest level, from such unreadable data  as  images,
audio,  or  text  represented in an unreadable form.  In the
absence  of  appropriate  interpretation  software,  it   is
reasonable to show subtypes of text to the user, while it is
not reasonable to do so with most nontextual data.
<p>
Such formatted textual  data  should  be  represented  using
subtypes  of text.  Plausible subtypes of text are typically
given by the common name of the representation format, e.g.,
"text/richtext".
<p>
<h3>7.1.1     The charset parameter</h3>
<p>
A critical parameter that may be specified in  the  Content-
Type  field  for  text  data  is the character set.  This is
specified with a "charset" parameter, as in:
<p>
     Content-type: text/plain; charset=us-ascii
<p>
Unlike some  other  parameter  values,  the  values  of  the
charset  parameter  are  NOT  case  sensitive.   The default
character set, which must be assumed in  the  absence  of  a
charset parameter, is US-ASCII.

打印文本内容类型：文本/纯文本；字符集=us-ascii你必须诱导WebDriver等待为了visibility_of_element_located()您可以使用以下任一方法定位策略:

Using XPATH and text属性：

driver.get("https://www.w3.org/Protocols/rfc1341/7_1_Text.html")
print(WebDriverWait(driver, 20).until(EC.visibility_of_element_located((By.XPATH, "//h3[contains(., 'The charset parameter')]//following-sibling::p[2]"))).text)

Using XPATH and get_attribute():

driver.get("https://www.w3.org/Protocols/rfc1341/7_1_Text.html")
print(WebDriverWait(driver, 20).until(EC.visibility_of_element_located((By.XPATH, "//h3[contains(., 'The charset parameter')]//following-sibling::p[2]"))).get_attribute("innerHTML"))

控制台输出：

Content-type: text/plain; charset=us-ascii

Note：您必须添加以下导入：

from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.common.by import By
from selenium.webdriver.support import expected_conditions as EC

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

python

selenium

seleniumwebdriver

xpath

getattribute

如何使用 XPath Selenium 和 Python 从
标签获取文本的相关文章

在 Python 中使用 XPath 和 LXML

我有一个 python 脚本用于解析 XML 并将某些感兴趣的元素导出到 csv 文件中我现在尝试更改脚本以允许根据条件过滤 XML 文件等效的 XPath 查询将是 DC Events Confirmation contains T
Python 中的六边形自组织映射

我在寻找六边形自组织映射 http en wikipedia org wiki Self organizing map在Python上准备好模块如果存在的话绘制六边形单元格的方法将六边形单元作为数组或其他方式使用的算法 About
获取单个方程的脚本

在文本文件中输入 a 2 8 b 3 9 c 4 8 d 5 9 e a b f c d g 0 6 h 1 7 i e g j f h output i j 期望的输出输出 2 8 3 9 0 6 4 8 5 9 1 7 如果输入文件名
在 python-docx 中搜索和替换

我有一个包含以下字符串的文档模板你好我的名字是鲍勃鲍勃是一个很好的名字我想使用 python docx 打开此文档并使用查找和替换方法如果存在来更改每个字符串 Bob gt Mark 最后我想生成一个新文档其中包含字符
无法包含外部 pandas 文档 Pycharm v--2018.1.2

我无法包含外部 pandas 文档Pycharm v 2018 1 2 例如 numpy gt http docs scipy org doc numpy reference generated module name element na
CSS 和 XPath 选择器有什么区别？就跨浏览器测试的性能而言，哪个更好？

我正在与硒网络驱动程序 https en wikipedia org wiki Selenium software Selenium WebDriver2 25 0 在多语言 Web 应用程序上主要测试页面内容针对不同语言如阿拉伯语
python中函数变量的作用域

假设我们有两个函数 def ftpConnect ftp FTP server ftp login ftp cwd path def getFileList ftpConnect files ftp nlst print files 如果我
唯一的图像哈希值即使 EXIF 信息更新也不会改变

我正在寻找一种方法来为 python 和 php 中的图像创建唯一的哈希值我考虑过对原始文件使用 md5 和因为它们可以快速生成但是当我更新 EXIF 信息有时时区关闭时它会更改总和并且哈希也会更改有没有其他方法可以为这些文
使用 genfromtxt 导入 numpy 中缺失值的 csv 数据

我有一个 csv 文件看起来像这样实际文件有更多的列和行 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 假设文件的名称是info csv如果我尝试使用导入它 data numpy genfromtxt i
Pandas：根据列名进行列的成对乘法

我有以下数据框 gt gt gt df pd DataFrame ap1 X 1 2 3 4 as1 X 1 2 3 4 ap2 X 2 2 2 2 as2 X 3 3 3 3 gt gt gt df ap1 X as1 X ap2 X a
使用Python将图像转换为十六进制格式

我的下面有一个jpg文件tmp folder upload path tmp resized test jpg 我一直在使用下面的代码 Method 1 with open upload path rb as image file enco
我可以使用 dask 创建 multivariate_normal 矩阵吗？

有点相关这个帖子 https stackoverflow com questions 52337612 random multivariate normal on a dask array 我正在尝试复制multivariate norma
在谷歌Colab中使用cv2.imshow()

我正在尝试通过输入视频来对视频进行对象检测 cap cv2 VideoCapture video3 mp4 在处理部分之后我想使用实时对象检测来显示视频 while True ret image np cap read Expand di
Python Flask 是否定义了路由顺序？

在我看来我的设置类似于以下内容 app route test def test app route
是否可以写一个负的python类型注释

这可能听起来不合理但现在我需要否定类型注释我的意思是这样的 an int Not Iterable a string Iterable 这是因为我为一个函数编写了一个重载而 mypy 不理解我我的功能看起来像这样 overload
Google App Engine 中的自定义身份验证

有谁知道或知道我可以在哪里学习如何使用 Python 和 Google App Engine 创建自定义身份验证流程我不想使用 Google 帐户进行身份验证并且希望能够创建自己的用户如果不是专门针对 Google App Engin
将 Scikit-Learn OneHotEncoder 与 Pandas DataFrame 结合使用

我正在尝试使用 Scikit Learn 的 OneHotEncoder 将 Pandas DataFrame 中包含字符串的列替换为 one hot 编码的等效项我的下面的代码不起作用 from sklearn preprocessin
从时间序列生成日期特征

我有一个数据框其中包含如下列 Date temp data holiday day 01 01 2000 10000 0 1 02 01 2000 0 1 2 03 01 2000 2000 0 3 30 01 2000 200 0 30
如何在SqlAlchemy中执行“左外连接”

我需要执行这个查询 select field11 field12 from Table 1 t1 left outer join Table 2 t2 ON t2 tbl1 id t1 tbl1 id where t2 tbl2 id is
如何识别图形线条

我有以下格式的路径的 x y 数据示例仅用于说明 seq p1 p2 0 20 2 3 1 20 2 4 2 20 4 4 3 22 5 5 4 22 5 6 5 23 6 2 6 23 6 3 7 23 6 4 每条路径都有多个点它们

随机推荐

OnBackPressed（软键盘打开）

我想在软键盘打开时完成活动我想覆盖软键盘的后退事件并完成活动我正在使用这个但它不起作用有什么想法吗 public boolean onKeyPreIme int keyCode KeyEvent event if keyCode K
如何通过 myplayer 将视频嵌入到 QWidget 框架中？

是否有可能在PyQt4通过嵌入视频mpylayer into a QWidget 或进入它的子类如果是这样您能否提供一个最小的工作示例有关嵌入 MPlayer 的 Qt Widget 的完整示例请尝试qmp小部件但这里有一个最小的
BlackBerry 设备存储上的 SqLite 数据库

我正在尝试在 BlackBerry 模拟器的设备存储中创建数据库在9500模拟器中数据库创建成功但创建表时出现文件系统错误消息在 9700 模拟器上数据库在创建步骤失败是否有一个代码序列可以为所有模拟器创建数据库我编写了以
从 pandas 数据框数据透视表创建绘图

我是Python新手想知道如何在我使用数据透视表函数创建的数据上创建条形图 Create a pivot table for handicaps count calculation for no show people based on
CPython 中变量赋值是如何实现的？

我知道 Python 中的变量实际上只是某些底层对象的引用指针由于它们是指针我猜它们以某种方式存储或以其他方式与它们引用的对象的地址相关联这样的地址存储可能发生在 CPython 实现的低层但我对 C 的了解还不足以从源
R：提高成对计算的速度

我正在使用 R 编程语言假设我有以下两个数据框 set seed 123 df 1 lt data frame name 1 c john david alex kevin trevor xavier tom michael troy k
解析错误：语法错误，C:\wamp\www\calculator\wp-content\themes\calculator\page.php 中出现意外的“使用”(T_USE) [重复]

这个问题在这里已经有答案了 ob start require once dompdf autoload inc php use Dompdf Dompdf use Dompdf Dompdf instantiate and use the
如何在表单 Serialize() ajax 上包含提交按钮名称和值

我遇到了麻烦我的代码不起作用因为我的服务器脚本端需要来自提交按钮的名称我正在使用Ajax方法并且正在使用data serialize 当我点击提交时它不起作用这是我的 JavaScript 代码 function buy pro
ASP.net MVC 验证在不正确的字段上突出显示和图标 Jquery

我正在寻找一种方法来更改默认的 ASP net MVC 验证以便不再在每个不正确的表单字段旁边放置消息而是放置一个图标然后我会在页面的其他位置列出错误该图标将是一个图像因此我需要在不正确的字段旁边渲染图像标签除了放置一个图标之外
M power 查询 - 重建此数据组合

在 M 中获取以下查询的错误消息我尝试在合并步骤之前拆分为 2 个不同的查询但仍然收到这不会直接访问数据源请重建数据组合错误 let Source Query List from SP cols if Type Indicator
如何从 Microsoft Word 文档中删除超链接？

我正在编写一个 VB 宏来为我的工作做一些文档处理搜索文本行并将括号内的文本放入列表框中当我想删除文档中的所有超链接然后生成新的超链接不一定在原始超链接的位置时问题就出现了那么问题是如何删除现有的超链接我当前的问题是每次
让docker容器连接到网络设备

我正在尝试制作一些 docker 容器来容纳我的一些日常工具但我的很多工具都依赖于能够连接到设备通过 WiFi 来提取数据我一直在做研究并且很困惑试图了解需要做什么如果有的话来支持这种情况我知道通常 docker 容器是服务器
JTextPane 追加新字符串

每篇文章都会回答如何将字符串附加到 JEditorPane 的问题是这样的 jep setText jep getText new string 我已经尝试过这个 jep setText b Termination time b Cri
pyqtgraph滚动图：以块的形式绘制，仅显示当前窗口中最新的10秒样本

我在使用 pyqtgraph 滚动图时遇到问题预期成绩预期结果与实际结果非常相似pyqtgraph 示例滚动图 plot5 X值是时间可以通过简单的函数生成 Y 值是随机值每 10 秒采样一次作为一个块每个图最多可以有 1 个数
使用 Oracle 数据库的 Django InspectDB 问题

安装cx oracle并运行inspectdb 似乎没有得到任何输出有人可以帮忙吗将inspectdb 与Oracle 一起使用是否存在已知问题下面是命令和settings py python manage py inspectdb
Nexus 7 在 Windows 7 x64 中通过“adb devices”通过 USB 不可见

我已经做了显而易见的事情从最新的 Android SDK 安装了 USB 驱动程序并在平板电脑中打开了 USB 调试当的时候Nexus 7连接后设备将显示在 Windows 中装置经理作为 Android Phone Androi
如何在网页中隐藏敏感信息？

我正在尝试将一个包含测验数据包括答案的 json 对象从我的代码发送到 javascript 我用了 var quizJson 但问题是我的用户足够聪明可以使用查看源代码并揭示答案有什么建议吗提前致谢唯一正确的解决方案是不以任
从现有 .Net 项目自动创建 UML 图的免费工具

是否有任何免费工具可以从现有的 Net Visual Studio 2005 项目自动创建 UML 图如果您使用的是 Visual Studio Team Suite 2005 注意不是 2008 我相信您可以选择将 Net 代码逆向工
MongoDB 将写入锁定到什么级别？（或者：“每个连接”是什么意思

在 mongodb 文档中它说从版本 2 2 开始 MongoDB 对大多数读写操作在每个数据库的基础上实现了锁一些全局操作通常是涉及多个数据库的短期操作仍然需要全局实例范围锁在 2 2 之前每个 mongod 实例只有一
如何使用 XPath Selenium 和 Python 从
标签获取文本

我需要用 XPath 从 a 中的文本中捕获一行 p 我需要存储文本Content type text plain charset us ascii到 python 中的变量中但我收到下一个错误 selenium common excep

如何使用 XPath Selenium 和 Python 从 标签获取文本

如何使用 XPath Selenium 和 Python 从 标签获取文本 的相关文章

随机推荐

热门标签

如何使用 XPath Selenium 和 Python 从
标签获取文本

如何使用 XPath Selenium 和 Python 从
标签获取文本的相关文章