Python内置函数(47)——open

2023-11-12

英文文档：

open(file, mode='r', buffering=-1, encoding=None, errors=None, newline=None, closefd=True, opener=None)

Open file and return a corresponding file object. If the file cannot be opened, an OSError is raised.

file is either a string or bytes object giving the pathname (absolute or relative to the current working directory) of the file to be opened or an integer file descriptor of the file to be wrapped. (If a file descriptor is given, it is closed when the returned I/O object is closed, unless closefd is set to False.)

mode is an optional string that specifies the mode in which the file is opened. It defaults to 'r' which means open for reading in text mode. Other common values are 'w' for writing (truncating the file if it already exists), 'x' for exclusive creation and 'a' for appending (which on some Unix systems, means that all writes append to the end of the file regardless of the current seek position). In text mode, if encoding is not specified the encoding used is platform dependent: locale.getpreferredencoding(False) is called to get the current locale encoding. (For reading and writing raw bytes use binary mode and leave encoding unspecified.) The available modes are:

Character	Meaning
`'r'`	open for reading (default)
`'w'`	open for writing, truncating the file first
`'x'`	open for exclusive creation, failing if the file already exists
`'a'`	open for writing, appending to the end of the file if it exists
`'b'`	binary mode
`'t'`	text mode (default)
`'+'`	open a disk file for updating (reading and writing)
`'U'`	universal newlines mode (deprecated)

The default mode is 'r' (open for reading text, synonym of 'rt'). For binary read-write access, the mode 'w+b' opens and truncates the file to 0 bytes. 'r+b' opens the file without truncation.

As mentioned in the Overview, Python distinguishes between binary and text I/O. Files opened in binary mode (including 'b' in the mode argument) return contents as bytes objects without any decoding. In text mode (the default, or when 't' is included in the mode argument), the contents of the file are returned as str, the bytes having been first decoded using a platform-dependent encoding or using the specified encoding if given.

Note

Python doesn’t depend on the underlying operating system’s notion of text files; all the processing is done by Python itself, and is therefore platform-independent.

buffering is an optional integer used to set the buffering policy. Pass 0 to switch buffering off (only allowed in binary mode), 1 to select line buffering (only usable in text mode), and an integer > 1 to indicate the size in bytes of a fixed-size chunk buffer. When no buffering argument is given, the default buffering policy works as follows:

Binary files are buffered in fixed-size chunks; the size of the buffer is chosen using a heuristic trying to determine the underlying device’s “block size” and falling back on io.DEFAULT_BUFFER_SIZE. On many systems, the buffer will typically be 4096 or 8192 bytes long.
“Interactive” text files (files for which isatty() returns True) use line buffering. Other text files use the policy described above for binary files.

encoding is the name of the encoding used to decode or encode the file. This should only be used in text mode. The default encoding is platform dependent (whatever locale.getpreferredencoding() returns), but any text encoding supported by Python can be used. See the codecs module for the list of supported encodings.

errors is an optional string that specifies how encoding and decoding errors are to be handled–this cannot be used in binary mode. A variety of standard error handlers are available (listed under Error Handlers), though any error handling name that has been registered with codecs.register_error() is also valid. The standard names include:

'strict' to raise a ValueError exception if there is an encoding error. The default value of None has the same effect.
'ignore' ignores errors. Note that ignoring encoding errors can lead to data loss.
'replace' causes a replacement marker (such as '?') to be inserted where there is malformed data.
'surrogateescape' will represent any incorrect bytes as code points in the Unicode Private Use Area ranging from U+DC80 to U+DCFF. These private code points will then be turned back into the same bytes when the surrogateescape error handler is used when writing data. This is useful for processing files in an unknown encoding.
'xmlcharrefreplace' is only supported when writing to a file. Characters not supported by the encoding are replaced with the appropriate XML character reference &#nnn;.
'backslashreplace' replaces malformed data by Python’s backslashed escape sequences.
'namereplace' (also only supported when writing) replaces unsupported characters with \N{...} escape sequences.

newline controls how universal newlines mode works (it only applies to text mode). It can be None, '', '\n', '\r', and '\r\n'. It works as follows:

When reading input from the stream, if newline is None, universal newlines mode is enabled. Lines in the input can end in '\n', '\r', or '\r\n', and these are translated into '\n' before being returned to the caller. If it is '', universal newlines mode is enabled, but line endings are returned to the caller untranslated. If it has any of the other legal values, input lines are only terminated by the given string, and the line ending is returned to the caller untranslated.
When writing output to the stream, if newline is None, any '\n' characters written are translated to the system default line separator, os.linesep. If newline is '' or '\n', no translation takes place. If newline is any of the other legal values, any '\n' characters written are translated to the given string.

If closefd is False and a file descriptor rather than a filename was given, the underlying file descriptor will be kept open when the file is closed. If a filename is given closefd must be True (the default) otherwise an error will be raised.

A custom opener can be used by passing a callable as opener. The underlying file descriptor for the file object is then obtained by calling opener with (file, flags). opener must return an open file descriptor (passing os.open as opener results in functionality similar to passing None).

说明：

　　1. 函数功能打开一个文件，返回一个文件读写对象，然后可以对文件进行相应读写操作。

　　2. file参数表示的需要打开文件的相对路径(当前工作目录)或者一个绝对路径，当传入路径不存在此文件会报错。或者传入文件的句柄。

>>> a = open('test.txt') # 相对路径
>>> a
<_io.TextIOWrapper name='test.txt' mode='r' encoding='cp936'>
>>> a.close()

>>> a = open(r'D:\Python\Python35-32\test.txt') # 绝对路径
>>> a
<_io.TextIOWrapper name='D:\\Python\\Python35-32\\test.txt' mode='r' encoding='cp936'>

　　3. mode参数表示打开文件的模式，常见的打开模式有如下几种，实际调用的时候可以根据情况进行组合。

　　　　'r'：以只读模式打开（缺省模式）（必须保证文件存在）
　　　　'w'：以只写模式打开。若文件存在，则会自动清空文件，然后重新创建；若文件不存在，则新建文件。使用这个模式必须要保证文件所在目录存在，文件可以不存在。该模式下不能使用read*()方法

　　　　'a'：以追加模式打开。若文件存在，则会追加到文件的末尾；若文件不存在，则新建文件。该模式不能使用read*()方法。

　　下面四个模式要和上面的模式组合使用
　　　　'b'：以二进制模式打开

　　　　't'：以文本模式打开（缺省模式）
　　　　'+'：以读写模式打开
　　　　'U'：以通用换行符模式打开

　　常见的mode组合

　　　　'r'或'rt'：默认模式，文本读模式
　　　　'w'或'wt'：以文本写模式打开（打开前文件会被清空）
　　　　'rb'：以二进制读模式打开
　　　　'ab'：以二进制追加模式打开
　　　　'wb'：以二进制写模式打开（打开前文件会被清空）
　　　　'r+'：以文本读写模式打开，可以写到文件任何位置；默认写的指针开始指在文件开头, 因此会覆写文件
　　　　'w+'：以文本读写模式打开（打开前文件会被清空）。可以使用read*()
　　　　'a+'：以文本读写模式打开（写只能写在文件末尾）。可以使用read*()
　　　　'rb+'：以二进制读写模式打开
　　　　'wb+'：以二进制读写模式打开（打开前文件会被清空）
　　　　'ab+'：以二进制读写模式打开

# t为文本读写，b为二进制读写
>>> a = open('test.txt','rt')
>>> a.read()
'some text'
>>> a = open('test.txt','rb')
>>> a.read()
b'some text'

# r为只读，不能写入；w为只写，不能读取
>>> a = open('test.txt','rt')
>>> a.write('more text')
Traceback (most recent call last):
  File "<pyshell#67>", line 1, in <module>
    a.write('more text')
io.UnsupportedOperation: write
>>> a = open('test.txt','wt')
>>> a.read()
Traceback (most recent call last):
  File "<pyshell#69>", line 1, in <module>
    a.read()
io.UnsupportedOperation: not readable

#其它不一一举例了

　　4. buffering表示文件在读取操作时使用的缓冲策略。

　　　　　　0：代表buffer关闭（只适用于二进制模式）
　　　　　　1：代表line buffer（只适用于文本模式）
　　　　　　>1：表示初始化的buffer大小

　　5. encoding参数表示读写文件时所使用的的文件编码格式。

　　假设现在test.txt文件以utf-8编码存储了一下文本：

>>> a = open('test.txt','rt') # 未正确指定编码，有可能报错
>>> a.read()
Traceback (most recent call last):
  File "<pyshell#87>", line 1, in <module>
    a.read()
UnicodeDecodeError: 'gbk' codec can't decode byte 0xac in position 8: illegal multibyte sequence

>>> a = open('test.txt','rt',encoding = 'utf-8')
>>> a.read()
'我是第1行文本，我将被显示在屏幕\n我是第2行文本，我将被显示在屏幕\n我是第3行文本，我将被显示在屏幕'
>>>

　　6. errors参数表示读写文件时碰到错误的报错级别。

　　常见的报错基本有：

'strict' 严格级别，字符编码有报错即抛出异常，也是默认的级别，errors参数值传入None按此级别处理.
'ignore' 忽略级别，字符编码有错，忽略掉.
'replace' 替换级别，字符编码有错的，替换成？.

>>> a = open('test.txt','rt',encoding = 'utf-8')
>>> a.read()
'我是第1行文本，我将被显示在屏幕\n我是第2行文本，我将被显示在屏幕\n我是第3行文本，我将被显示在屏幕'
>>> a = open('test.txt','rt')
>>> a.read()
Traceback (most recent call last):
  File "<pyshell#91>", line 1, in <module>
    a.read()
UnicodeDecodeError: 'gbk' codec can't decode byte 0xac in position 8: illegal multibyte sequence
>>> a = open('test.txt','rt',errors = 'ignore' )
>>> a.read()
'鎴戞槸绗1琛屾枃鏈锛屾垜灏嗚鏄剧ず鍦ㄥ睆骞\n鎴戞槸绗2琛屾枃鏈锛屾垜灏嗚鏄剧ず鍦ㄥ睆骞\n鎴戞槸绗3琛屾枃鏈锛屾垜灏嗚鏄剧ず鍦ㄥ睆骞'
>>> a = open('test.txt','rt',errors = 'replace' )
>>> a.read()
'鎴戞槸绗�1琛屾枃鏈�锛屾垜灏嗚��鏄剧ず鍦ㄥ睆骞�\n鎴戞槸绗�2琛屾枃鏈�锛屾垜灏嗚��鏄剧ず鍦ㄥ睆骞�\n鎴戞槸绗�3琛屾枃鏈�锛屾垜灏嗚��鏄剧ず鍦ㄥ睆骞�'

　　7. newline表示用于区分换行符(只对文本模式有效，可以取的值有None,'\n','\r','','\r\n')

>>> a = open('test.txt','rt',encoding = 'utf-8',newline = '\r')
>>> a.readline()
'我是第1行文本，我将被显示在屏幕\r'
>>> a = open('test.txt','rt',encoding = 'utf-8',newline = '\n')
>>> a.readline()
'我是第1行文本，我将被显示在屏幕\r\n'

　　8. closefd表示传入的file参数类型（缺省为True），传入文件路径时一定为True，传入文件句柄则为False。

>>> a = open('test.txt','rt',encoding = 'utf-8',newline = '\n',closefd = False)
Traceback (most recent call last):
  File "<pyshell#115>", line 1, in <module>
    a = open('test.txt','rt',encoding = 'utf-8',newline = '\n',closefd = False)
ValueError: Cannot use closefd=False with file name
>>> a = open('test.txt','rt',encoding = 'utf-8',newline = '\n',closefd = True)

转载于:https://www.cnblogs.com/sesshoumaru/p/6047046.html

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

python

Python内置函数(47)——open 的相关文章

(discord.py) 尝试更改成员角色时，“用户”对象没有属性“角色”

因此我正在尝试编写一个机器人让某人在命令中指定的主持人指定的一段时间内暂停角色我知道该变量称为小时即使它目前以秒为单位我稍后会解决这个问题基本上它是由主持人在消息暂停 personmention numberofhours
InterfaceError：连接已关闭（使用 django + celery + Scrapy）

当我在 Celery 任务中使用 Scrapy 解析函数有时可能需要 10 分钟时我得到了这个信息我用姜戈 1 6 5 django celery 3 1 16 芹菜 3 1 16 psycopg2 2 5 5 我也使用了psyc
Python PAM 模块的安全问题？

我有兴趣编写一个 PAM 模块该模块将利用流行的 Unix 登录身份验证机制我过去的大部分编程经验都是使用 Python 进行的并且我正在交互的系统已经有一个 Python API 我用谷歌搜索发现pam python http pa
DreamPie 不适用于 Python 3.2

我最喜欢的 Python shell 是DreamPie http dreampie sourceforge net 我想将它与 Python 3 2 一起使用我使用了添加解释器 DreamPie 应用程序并添加了 Python 3 2
导入错误：没有名为 _ssl 的模块

带 Python 2 7 的 Ubuntu Maverick 我不知道如何解决以下导入错误 gt gt gt import ssl Traceback most recent call last File
Flask 和 uWSGI - 无法加载应用程序 0 (mountpoint='')（找不到可调用或导入错误）

当我尝试使用 uWSGI 启动 Flask 时出现以下错误我是这样开始的 gt cd gt root localhost uwsgi socket 127 0 0 1 6000 file path to folder run py ca
如何等到 Excel 计算公式后再继续 win32com

我有一个 win32com Python 脚本它将多个 Excel 文件合并到电子表格中并将其另存为 PDF 现在的工作原理是输出几乎都是 NAME 因为文件是在计算 Excel 文件内容之前输出的这可能需要一分钟如何强制工作簿计算值
SQL Alchemy 中的 NULL 安全不等式比较？

目前我知道如何表达 NULL 安全的唯一方法 SQL Alchemy 中的比较其中与 NULL 条目的比较计算结果为 True 而不是 NULL 是 or field None field value 有没有办法在 SQL Alchem
如何使用 Scrapy 从网站获取所有纯文本？

我希望在 HTML 呈现后可以从网站上看到所有文本我正在使用 Scrapy 框架使用 Python 工作和xpath body text 我能够获取它但是带有 HTML 标签而且我只想要文本有什么解决办法吗最简单的选择是ext
安装后 Anaconda 提示损坏

我刚刚安装张量流GPU创建单独的后环境按照以下指示here https github com antoniosehk keras tensorflow windows installation 但是安装后当我关闭提示窗口并打开新航站楼弹出
IRichBolt 在storm-1.0.0 和 pyleus-0.3.0 上运行拓扑时出错

我正在运行风暴拓扑 pyleus verbose local xyz topology jar using storm 1 0 0 pyleus 0 3 0 centos 6 6并得到错误线程 main java lang NoClass
表达式中的 Python 'in' 关键字与 for 循环中的比较 [重复]

这个问题在这里已经有答案了我明白什么是in运算符在此代码中执行的操作 some list 1 2 3 4 5 print 2 in some list 我也明白i将采用此代码中列表的每个值 for i in 1 2 3 4 5 print
如何将 numpy.matrix 提高到非整数幂？

The 运算符为numpy matrix不支持非整数幂 gt gt gt m matrix 1 0 0 5 0 5 gt gt gt m 2 5 TypeError exponent must be an integer 我想要的是 oct
Python - 按月对日期进行分组

这是一个简单的问题起初我认为很简单而忽略了它一个小时过去了我不太确定所以我有一个Python列表datetime对象我想用图表来表示它们 x 值是年份和月份 y 值是此列表中本月发生的日期对象的数量也许一个例子可以更好地证明这
如何在 Django 中使用并发进程记录到单个文件而不使用独占锁

给定一个在多个服务器上同时执行的 Django 应用程序该应用程序如何记录到单个共享日志文件在网络共享中而不保持该文件以独占模式永久打开当您想要利用日志流时这种情况适用于 Windows Azure 网站上托管的 Django 应
设置 torch.gather(...) 调用的结果

我有一个形状为 n x m 的 2D pytorch 张量我想使用索引列表来索引第二个维度可以使用 torch gather 完成然后然后还设置新值到索引的结果 Example data torch tensor 0 1 2 3 4
如何从没有结尾的管道中读取 python 中的 stdin

当管道来自打开时不知道正确的名称我无法从 python 中的标准输入或管道读取数据文件我有作为例子管道测试 py import sys import time k 0 try for line in sys stdin k k
在python中，如何仅搜索所选子字符串之前的一个单词

给定文本文件中的长行列表我只想返回紧邻其前面的子字符串例如单词狗描述狗的单词例如假设有这些行包含狗 hotdog big dog is dogged dog spy with my dog brown dogs 在这种情况下期望
您可以在 Python 类型注释中指定方差吗？

你能发现下面代码中的错误吗米皮不能 from typing import Dict Any def add items d Dict str Any gt None d foo 5 d Dict str str add items d f
Python - 字典和列表相交

给定以下数据结构找出这两种数据结构共有的交集键的最有效方法是什么 dict1 2A 3A 4B list1 2A 4B Expected output 2A 4B 如果这也能产生更快的输出我可以将列表不是 dict1 组织到任何其他数

随机推荐

Java动态代理一——动态类Proxy的使用

1 什么是动态代理答动态代理可以提供对另一个对象的访问同时隐藏实际对象的具体事实代理一般会实现它所表示的实际对象的接口代理可以访问实际对象但是延迟实现实际对象的部分功能实际对象实现系统的实际功能代理对象对客户隐藏了实际对象
CSS选择器总结

元素选择器作用通过元素选择器可以选择页面中的所有元素语法标签名如下选中所有的P标签 p color red font size 40px ID选择器作用通过元素ID属性值选中唯一的一个元素语法 id属性值如下选中ID为
数据结构（2.1）——时间复杂度和空间复杂度计算

前言 1 因为上一篇博客数据结构 2 算法对于时间复杂度和空间复杂度计算的讲解太少所以我在次增加多个案例讲解 2 上一篇已经详细介绍了为什么我们的算法要使用复杂度这一个概念因此我这一篇将重点介绍复杂度如何进行计算时间复杂度计算
使用ulisesbocchio对spring-boot项目properties配置文件信息加密

2019独角兽企业重金招聘Python工程师标准 gt gt gt Spring boot项目中properties文件中的密码明文不太安全所以想到给明文加密了解了一下有一个依赖工具可以实现这个功能 Ulisesbocchio插件 1
【机器学习】使用scikit-learn实现多元线性回归（10min阅读时长）

Multiple Linear Regression 多元线性回归之前有一篇简单线性回归的文章大家感兴趣可以看看使用scikit learn实现简单线性回归 Objectives 目标看完这篇文章将会 1 使用scikit lea
勇士屠熊，绿军射鹿，夕阳西下，人群散尽，唯有烈火燎原势不可挡

SpringBoot的日志一了解日志 1 什么是日志 2 日志的作用二自定义打印日志 1 实现步骤 2 日志的格式说明三日志级别 1 了解日志级别 2 配置日志级别四日志持久化五使用lombok进行日志输出 1 步骤 2
zerotier搭建moon模式

最近发现zerotier内网穿透在和家里nas存储交互网速好像不怎么样于是想搞个moon看看是不是会有所改善先决条件建议有一台云服务器很多童鞋说要钱刚刚白piao了一百度云的服务器一年只要38RMB 配置CentOS7 9 1C
编译cryptopp库

1 下载源码网址 https github com golang crypto git 2 打开里面的cryptest sln 如下图 3 打开后如下图所示 4 接着邮件crptlib属性修改内容如下所示 release版本改为如下对
【知识点】eval() 的用法

目录一基本知识二具体实例三项目应用总结一基本知识返回传入字符的表达式的结果即将字符串当成有效的表达式进行运算求值并返回结果从某种意义上说 eval就是实现list dict tuple和 str 之间的相互转换
cookie、session以及token的定义、区别、使用环境

Cookie Cookie 的工作原理由于 HTTP 是一种无状态的协议服务器单从网络连接上无从知道客户身份怎么办呢就给客户端们颁发一个通行证吧每人一个无论谁访问都必须携带自己通行证这样服务器就能从通行证上确认客户身份了这就
一个按键控制数码管的开和关_按键控制数码管显示

功能按键查询控制数码管显示的数据定时器中断控制数码管扫描显示所用器件 STC12C5A32S2 include config h define uint unsigned int define uchar unsigned char
2021-08-12 一阶系统的频率响应低通滤波器
深入浅出PID控制算法（三）————增量式与位置式PID算法的C语言实现与电机控制经验总结

前文对PID算法离散化和增量式PID算法原理进行来探索之后又使用Matlab进行了仿真实验对PID三个参数又有了更深入的认识接下来我们来使用C语言进行PID算法实现并且结合控制电机的项目来深入学习 1 PID 算法C 语言原代码先
[BJDCTF2020]EasySearch1

BJDCTF2020 EasySearch1 0x01漏洞类型打开题目如图所示还是对CTF套路不太熟悉拿到这种就以为是sql注入启动sqlmap就一顿操作都大了搞竞赛还来得及吗参考别人的wp后知道是源码泄露这里就不给服务器
QT中监控全局键盘鼠标事件

先介绍一下在单一Widget等控件中监听鼠标键盘事件的代码 void mouseMoveEvent QMouseEvent event void mouseReleaseEvent QMouseEvent event void keyPre
CNN代码系列之训练源文件及头文件（二）

本博客为CNN卷积代码系列之训练源文件及头文件注意本博客是系列博客请链接上一博客http blog csdn net samylee article details 69325368 训练主程序中的头文件 funset hpp ifn
半路出家OCR后成领域专家，白翔：计算机视觉科研没有捷径

极市学者专访第三期听大牛说说计算机视觉那些事儿 AI派在读学生小姐姐Beyonce Java实战项目练习群长按识别下方二维码按需求添加扫码添加Beyonce小姐姐扫码关注进Java学习大礼包本次极市学者访谈我们非常荣幸地邀
WebSSH2 界面ssh

工具 Virtual Machines14 1 系统环境 CentOS 7 64位 2个 IP 192 168 163 138 IP 192 168 163 141 概述在138系统中安装部署WebSSH服务通过浏览器 http Web
[SLAM四元数基础系列一] 四元数定义 Hamilton vs JPL

四元数定义 Hamilton vs JPL 简介四种区分方式 Hamilton vs JPL 引用不管是卡尔曼滤波或者BA优化形式的SLAM或者VIO系统中都需要用到单位四元数 Quaternion 来表示旋转主要是单位四元数表示旋
Python内置函数(47)——open

英文文档 open file mode r buffering 1 encoding None errors None newline None closefd True opener None Open file and return a

Python内置函数(47)——open

Python内置函数(47)——open 的相关文章

随机推荐

热门标签