一、函数
1. 简介
函数是组织好的,可重复使用的,用来实现单一,或相关联功能的代码段。函数能提高应用的模块性,和代码的重复利用率。
2. 组成
函数代码块以 def 关键词开头,后接函数名和圆括号()。
任何传入参数和自变量必须放在圆括号中间。圆括号之间可以用于定义参数。
函数的第一行语句可以选择性地使用文档字符串—用于存放函数说明。
函数主体部分:函数内容以冒号起始,并且缩进。
函数结束部分:return [表达式] 结束函数,选择性地返回一个值给调用方。不带表达式的return相当于返回 None。
示例:
1 def functionname( parameters ): 2 "函数声明、注释等内容,一般为__doc__部分" 3 函数主体部分 4 return [expression]
3. 简单调用
以上边的示例为例,调用方法很简单执行如下代码。
functionname(参数)
注意,函数定义好之后,并不会运行。只有在调用的时候会运行。
二、函数各组成部分
2.1 函数的命名
函数名应该为小写,可以用下划线风格单词以增加可读性。如:myfunction,my_example_function。
Python之父Guido推荐的命名规范包括如下几点:模块名和包名采用小写字母并且以下划线分隔单词的形式;
类名采用以大写字母开头,并且以大写字母分隔单词的形式命名;
全局或者类常量,全部使用大写字母,并且以下划线分隔单词;其余变量命名则是采用全部小写字母,并且以下划线分隔单词的形式命名。
以上的内容如果是内部的,则使用下划线开头命名。
2.2 函数参数
函数的参数分为三类:普通参数、默认参数、指定参数、可变参数
2.2.1 普通参数
1 def functionname(name,age): 2 print("I'm %s, age %s" % (name, age)) 3 4 functionname("wzg",18) 5 out: I'm wzg, age 18
2.2.2 默认参数
1 def functionname(name,age=18): 2 print("I'm %s, age %s" % (name, age)) 3 4 functionname("wzg") 5 out: I'm wzg, age 18
2.2.3 指定参数
1 def functionname(name,age=18): 2 print("I'm %s, age %s" % (name, age)) 3 4 functionname(age=18, name="wzg") 5 out: I'm wzg, age 18
2.2.4 可变参数
2.2.4.1 可变参数*
1 def function_name(*args): 2 print(args, type(args)) 3 4 function_name(1,2,3) 5 out: (1, 2, 3) <class 'tuple'> 6 7 function_name((1,2,3)) 8 out: ((1, 2, 3),) <class 'tuple'>
由上边的例子可以看出,默认将传入的参数,全部放在元组中,即args = (…),在执行()的时候,会进行tuple.__init__方法。
1 def function_name(*args): 2 print(args, type(args)) 3 function_name(*(1,2,3)) 4 5 out: (1, 2, 3) <class 'tuple'> 6 7 function_name(*'wzg') 8 9 out: ('w', 'z', 'g') <class 'tuple'>
从上边的例子,可以看出带*的参数,会循环变量中的每个元素加入至tuple中。字符串的话循环每个字母。
2.2.4.2 可变参数**
1 def function_name(**args): 2 print(args, type(args)) 3 function_name(name='wzg') 4 out: {'name': 'wzg'} <class 'dict'> 5 dic={"k1":"v1","k2":"v2"} 6 function_name(**dic) 7 out: {'k1': 'v1', 'k2': 'v2'} <class 'dict'> 8 function_name(d1=dic) 9 out: {'d1': {'k1': 'v1', 'k2': 'v2'}} <class 'dict'>
从上边的例子可以看出,可变参数** , 默认将参数的参数,全部放在字典中进行处理。
2.3 函数注释
Python有一种独一无二的的注释方式: 使用文档字符串. 文档字符串是包, 模块, 类或函数里的第一个语句. 这些字符串可以通过对象的__doc__成员被自动提取, 并且被pydoc所用。参照下面的一个代码
1 def function_name(big_table, keys, other_silly_variable=None): 2 """Fetches rows from a Bigtable. 3 4 Retrieves rows pertaining to the given keys from the Table instance 5 represented by big_table. Silly things may happen if 6 other_silly_variable is not None. 7 8 Args: 9 big_table: An open Bigtable Table instance. 10 keys: A sequence of strings representing the key of each table row 11 to fetch. 12 other_silly_variable: Another optional variable, that has a much 13 longer name than the other args, and which does nothing. 14 15 Returns: 16 A dict mapping keys to the corresponding table row data 17 fetched. Each row is represented as a tuple of strings. For 18 example: 19 20 {'Serak': ('Rigel VII', 'Preparer'), 21 'Zim': ('Irk', 'Invader'), 22 'Lrrr': ('Omicron Persei 8', 'Emperor')} 23 24 If a key from the keys argument is missing from the dictionary, 25 then that row was not found in the table. 26 27 Raises: 28 IOError: An error occurred accessing the bigtable.Table object. 29 """ 30 function_body 31 return [expression]
注释
从例子中,可以看出函数注释包含以下几个部分:
1.整体功能说明 2.输入参数说明 3.输出/返回值说明 4.异常说明 5.其他
2.4 函数主体
函数主体部分就是代码逻辑的实现/处理过程。
2.5 函数返回值
函数返回值是一个可选的选项,可以返回一个表达式、某种数据结构等。默认返回None
三、函数的分类
函数大概可以分为以下几类
內建函数
自定义函数
匿名函数
3.1 內建函数 __builtins__
从Python3.5官网拔下来一张最新的内置函数列表
与2.7Python相比:
新增:ascii() ,bytes() , exec(),
减少:basestring() ,cmp(), execfile(), file(),long(),raw_input(),reduce(), reload() , unichr(), unicode() ,xrange()
3.2 常用內建函数
函数名 | 作用 |
all(iterable) | 1、集合中的元素都为真的时候为真 2、特别的,若为空串返回为True |
any(iterable) | 1、集合中的元素有一个为真的时候为真 2、特别的,若为空串返回为False |
bool([x]) | 将x转换为Boolean类型 |
ascii() | 只要执行这个方法,则会自动调用对象的__repr__。这个函数跟repr()函数一样,返回一个可打印的对象字符串方式表示。当遇到非ASCII码时,就会输出x,u或U等字符来表示。与Python 2版本里的repr()是等效的函数。 |
abs(x) | 求绝对值 |
pow(x, y[, z]) | 返回x的y次幂 |
oct(x) | 将一个数字转化为8进制 |
hex(x) | 将整数x转换为16进制字符串 |
bin(x) | 将整数x转换为二进制字符串 |
bytes(“要转化的字符串”, encoding=”编码”) | 字符串转换为字节类型 |
3.3 自定义函数
我们平时使用的大多数函数,以及开发中创建的函数,都属于自定义函数。这极大的提高了代码的重用性和可读性。
自定义函数的创建和使用,在上文中已经进行了说明和示例,参照上边文章即可。这里不作过多说明。
3.4 匿名函数
python 使用 lambda 来创建匿名函数。
lambda只是一个表达式,函数体比def简单很多。
lambda的主体是一个表达式,而不是一个代码块。仅仅能在lambda表达式中封装有限的逻辑进去。
lambda函数拥有自己的命名空间,且不能访问自有参数列表之外或全局命名空间里的参数。
虽然lambda函数看起来只能写一行,却不等同于C或C++的内联函数,后者的目的是调用小函数时不占用栈内存从而增加运行效率。
3.4.1 匿名函数创建语法
lambda [arg1 [,arg2,…..argn]]:expression
3.4.2 示例
>>> lambda x: x+1 #一个参数
>>> lambda x,y,z:x+y+z #多个参数
>>> lambda x,y=3: x*y #允许参数存在默认值,但是默认值的参数必须参数顺序最后
1 a= lambda x,y=3: x*y 2 print(a(4)) 3 4 out: 12 5 6 print(a(2,4)) 7 8 out: 8 9 10 b= lambda x,y,z: x*y*z 11 print(b(1,2,3)) 12 13 out: 6
lambda
四、作用域
python中的作用域分4种情况:
L:local,局部作用域,即函数中定义的变量;
E:enclosing,嵌套的父级函数的局部作用域,即包含此函数的上级函数的局部作用域,但不是全局的;
G:global,全局变量,就是模块级别定义的变量;
B:built-in,系统固定模块里面的变量,比如int, bytearray等。
搜索变量的优先级顺序依次是:作用域局部>外层作用域>当前模块中的全局>python内置作用域,也就是LEGB。
Python中,是以函数作为作用域的,没有块级作用域的概念,这点同js一样。这里的块级是只if.while,for等语句。要深入理解Python作用域,有以下几个要点:
Python中存在作用域链,查找顺序为由内到外,直到查不到时报错not defined
Python的作用域,在函数没有执行之前就已经全部确定,作用域链也已经确定
Python函数在没有执行之前,函数内部不执行。而部分作用域的情况,是需要去看到底有没有执行。
具体代码如下:
name = 'boss' def f1(): print(name) def f2(): name = 'eric' return f1 ret = f2() ret() # out: boss #------------------------- name = 'boss' def f1(): print(name) def f2(): name = 'eric' f1() f2() # out: boss
作用域链案例
li = [] for i in range(10): def f1(): return i li.append(f1) print(li[0]()) # out: 9 print(li[1]()) # out: 9 print(li[2]()) # out: 9 print(li[3]()) # out: 9 #--------------------------------本质去看有没有执行 li = [] for i in range(10): def f1(x=i): return x li.append(f1) print(li[0]()) # out: 0 print(li[1]()) # out: 1 print(li[2]()) # out: 2 print(li[3]()) # out: 3
函数在没执行之前,内部不执行案例1
1 li =[lambda :x for x in range(10)] 2 3 ret = li[0]() 4 print(ret) # out: 9 5 ret = li[1]() 6 print(ret) # out: 9 7 ret = li[2]() 8 print(ret) # out: 9
函数在没执行之前,内部不执行案例2
五、文件操作
操作文件,一般需要经过三大步骤
1. 打开文件
2. 操作文件
3. 关闭文件(非必须)
3.1 打开文件
打开文件时,需要指定文件路径和以何等方式打开文件,打开后,即可获取该文件句柄,日后通过此文件句柄对该文件操作。通常我们使用open()函数来打开文件,源码中说明了打开模式:
1 def open(file, mode='r', buffering=None, encoding=None, errors=None, newline=None, closefd=True): # known special case of open 2 """ 3 Open file and return a stream. Raise IOError upon failure. 4 5 file is either a text or byte string giving the name (and the path 6 if the file isn't in the current working directory) of the file to 7 be opened or an integer file descriptor of the file to be 8 wrapped. (If a file descriptor is given, it is closed when the 9 returned I/O object is closed, unless closefd is set to False.) 10 11 mode is an optional string that specifies the mode in which the file 12 is opened. It defaults to 'r' which means open for reading in text 13 mode. Other common values are 'w' for writing (truncating the file if 14 it already exists), 'x' for creating and writing to a new file, and 15 'a' for appending (which on some Unix systems, means that all writes 16 append to the end of the file regardless of the current seek position). 17 In text mode, if encoding is not specified the encoding used is platform 18 dependent: locale.getpreferredencoding(False) is called to get the 19 current locale encoding. (For reading and writing raw bytes use binary 20 mode and leave encoding unspecified.) The available modes are: 21 22 ========= =============================================================== 23 Character Meaning 24 --------- --------------------------------------------------------------- 25 'r' open for reading (default) 26 'w' open for writing, truncating the file first 27 'x' create a new file and open it for writing 如果文件存在则报错 28 'a' open for writing, appending to the end of the file if it exists 29 'b' binary mode 30 't' text mode (default) 31 '+' open a disk file for updating (reading and writing) 32 'U' universal newline mode (deprecated) 33 ========= =============================================================== 34 35 The default mode is 'rt' (open for reading text). For binary random 36 access, the mode 'w+b' opens and truncates the file to 0 bytes, while 37 'r+b' opens the file without truncation. The 'x' mode implies 'w' and 38 raises an `FileExistsError` if the file already exists. 39 40 Python distinguishes between files opened in binary and text modes, 41 even when the underlying operating system doesn't. Files opened in 42 binary mode (appending 'b' to the mode argument) return contents as 43 bytes objects without any decoding. In text mode (the default, or when 44 't' is appended to the mode argument), the contents of the file are 45 returned as strings, the bytes having been first decoded using a 46 platform-dependent encoding or using the specified encoding if given. 47 48 'U' mode is deprecated and will raise an exception in future versions 49 of Python. It has no effect in Python 3. Use newline to control 50 universal newlines mode. 51 52 buffering is an optional integer used to set the buffering policy. 53 Pass 0 to switch buffering off (only allowed in binary mode), 1 to select 54 line buffering (only usable in text mode), and an integer > 1 to indicate 55 the size of a fixed-size chunk buffer. When no buffering argument is 56 given, the default buffering policy works as follows: 57 58 * Binary files are buffered in fixed-size chunks; the size of the buffer 59 is chosen using a heuristic trying to determine the underlying device's 60 "block size" and falling back on `io.DEFAULT_BUFFER_SIZE`. 61 On many systems, the buffer will typically be 4096 or 8192 bytes long. 62 63 * "Interactive" text files (files for which isatty() returns True) 64 use line buffering. Other text files use the policy described above 65 for binary files. 66 67 encoding is the name of the encoding used to decode or encode the 68 file. This should only be used in text mode. The default encoding is 69 platform dependent, but any encoding supported by Python can be 70 passed. See the codecs module for the list of supported encodings. 71 72 errors is an optional string that specifies how encoding errors are to 73 be handled---this argument should not be used in binary mode. Pass 74 'strict' to raise a ValueError exception if there is an encoding error 75 (the default of None has the same effect), or pass 'ignore' to ignore 76 errors. (Note that ignoring encoding errors can lead to data loss.) 77 See the documentation for codecs.register or run 'help(codecs.Codec)' 78 for a list of the permitted encoding error strings. 79 80 newline controls how universal newlines works (it only applies to text 81 mode). It can be None, '', ' ', '', and ' '. It works as 82 follows: 83 84 * On input, if newline is None, universal newlines mode is 85 enabled. Lines in the input can end in ' ', '', or ' ', and 86 these are translated into ' ' before being returned to the 87 caller. If it is '', universal newline mode is enabled, but line 88 endings are returned to the caller untranslated. If it has any of 89 the other legal values, input lines are only terminated by the given 90 string, and the line ending is returned to the caller untranslated. 91 92 * On output, if newline is None, any ' ' characters written are 93 translated to the system default line separator, os.linesep. If 94 newline is '' or ' ', no translation takes place. If newline is any 95 of the other legal values, any ' ' characters written are translated 96 to the given string. 97 98 If closefd is False, the underlying file descriptor will be kept open 99 when the file is closed. This does not work when a file name is given 100 and must be True in that case. 101 102 A custom opener can be used by passing a callable as *opener*. The 103 underlying file descriptor for the file object is then obtained by 104 calling *opener* with (*file*, *flags*). *opener* must return an open 105 file descriptor (passing os.open as *opener* results in functionality 106 similar to passing None). 107 108 open() returns a file object whose type depends on the mode, and 109 through which the standard file operations such as reading and writing 110 are performed. When open() is used to open a file in a text mode ('w', 111 'r', 'wt', 'rt', etc.), it returns a TextIOWrapper. When used to open 112 a file in a binary mode, the returned class varies: in read binary 113 mode, it returns a BufferedReader; in write binary and append binary 114 modes, it returns a BufferedWriter, and in read/write mode, it returns 115 a BufferedRandom. 116 117 It is also possible to use a string or bytearray as a file for both 118 reading and writing. For strings StringIO can be used like a file 119 opened in a text mode, and for bytes a BytesIO can be used like a file 120 opened in a binary mode. 121 """ 122 pass
open函数3.5.1源码
模式 | 描述 |
---|---|
r | 以只读方式打开文件。文件的指针将会放在文件的开头。这是默认模式。 |
rb | 以二进制格式打开一个文件用于只读。文件指针将会放在文件的开头。这是默认模式。 |
r+ | 打开一个文件用于读写。文件指针将会放在文件的开头。 |
rb+ | 以二进制格式打开一个文件用于读写。文件指针将会放在文件的开头。 |
w | 打开一个文件只用于写入。如果该文件已存在则将其覆盖。如果该文件不存在,创建新文件。 |
wb | 以二进制格式打开一个文件只用于写入。如果该文件已存在则将其覆盖。如果该文件不存在,创建新文件。 |
w+ | 打开一个文件用于读写。如果该文件已存在则将其覆盖。如果该文件不存在,创建新文件。这种方法基本不用 |
wb+ | 以二进制格式打开一个文件用于读写。如果该文件已存在则将其覆盖。如果该文件不存在,创建新文件。 |
a | 打开一个文件用于追加。如果该文件已存在,文件指针将会放在文件的结尾。也就是说,新的内容将会被写入到已有内容之后。如果该文件不存在,创建新文件进行写入。 |
ab | 以二进制格式打开一个文件用于追加。如果该文件已存在,文件指针将会放在文件的结尾。也就是说,新的内容将会被写入到已有内容之后。如果该文件不存在,创建新文件进行写入。 |
a+ | 打开一个文件用于读写。如果该文件已存在,文件指针将会放在文件的结尾。文件打开时会是追加模式。如果该文件不存在,创建新文件用于读写。 |
ab+ |
以二进制格式打开一个文件用于追加。如果该文件已存在,文件指针将会放在文件的结尾。如果该文件不存在,创建新文件用于读写。 |
x |
如果文件存在则报错,如果不存在,则创建文件并写内容 |
3.2 操作文件
创建文件句柄之后,可以针对句柄对文件进行如下操纵
1 class TextIOWrapper(_TextIOBase): 2 """ 3 Character and line based layer over a BufferedIOBase object, buffer. 4 5 encoding gives the name of the encoding that the stream will be 6 decoded or encoded with. It defaults to locale.getpreferredencoding(False). 7 8 errors determines the strictness of encoding and decoding (see 9 help(codecs.Codec) or the documentation for codecs.register) and 10 defaults to "strict". 11 12 newline controls how line endings are handled. It can be None, '', 13 ' ', '', and ' '. It works as follows: 14 15 * On input, if newline is None, universal newlines mode is 16 enabled. Lines in the input can end in ' ', '', or ' ', and 17 these are translated into ' ' before being returned to the 18 caller. If it is '', universal newline mode is enabled, but line 19 endings are returned to the caller untranslated. If it has any of 20 the other legal values, input lines are only terminated by the given 21 string, and the line ending is returned to the caller untranslated. 22 23 * On output, if newline is None, any ' ' characters written are 24 translated to the system default line separator, os.linesep. If 25 newline is '' or ' ', no translation takes place. If newline is any 26 of the other legal values, any ' ' characters written are translated 27 to the given string. 28 29 If line_buffering is True, a call to flush is implied when a call to 30 write contains a newline character. 31 """ 32 def close(self, *args, **kwargs): # real signature unknown 33 pass '''关闭文件句柄''' 34 35 def detach(self, *args, **kwargs): # real signature unknown 36 pass '''移除掉已存在的文本编码层''' 37 38 def fileno(self, *args, **kwargs): # real signature unknown 39 pass '''返回所使用的底层实现请求从操作系统I/O操作的整数文件描述符''' 40 41 def flush(self, *args, **kwargs): # real signature unknown 42 pass '''将缓冲区数据刷入文件''' 43 44 def isatty(self, *args, **kwargs): # real signature unknown 45 pass '''判断当前文件是否连入tty设备''' 46 47 def read(self, *args, **kwargs): # real signature unknown 48 pass '''读取文件中所有内容''' 49 50 def readable(self, *args, **kwargs): # real signature unknown '''判断文件是否可读''' 51 pass 52 53 def readline(self, *args, **kwargs): # real signature unknown '''仅读取文件的一行''' 54 pass 55 56 def seek(self, *args, **kwargs): # real signature unknown 57 pass '''设置文件指针的位置''' 58 59 def seekable(self, *args, **kwargs): # real signature unknown '''判断文件指针是否可修改''' 60 pass 61 62 def tell(self, *args, **kwargs): # real signature unknown 63 pass '''返回该文件指针的当前位置''' 64 65 def truncate(self, *args, **kwargs): # real signature unknown '''截断文件当前指针之后的内容,仅保留指针之前的''' 66 pass 67 68 def writable(self, *args, **kwargs): # real signature unknown '''判断文件是否可写''' 69 pass 70 71 def write(self, *args, **kwargs): # real signature unknown 72 pass '''将字符串写入文件''' 73 74 def __getstate__(self, *args, **kwargs): # real signature unknown 75 pass 76 77 def __init__(self, *args, **kwargs): # real signature unknown 78 pass 79 80 @staticmethod # known case of __new__ 81 def __new__(*args, **kwargs): # real signature unknown 82 """ Create and return a new object. See help(type) for accurate signature. """ 83 pass 84 85 def __next__(self, *args, **kwargs): # real signature unknown 86 """ Implement next(self). """ 87 pass 88 89 def __repr__(self, *args, **kwargs): # real signature unknown 90 """ Return repr(self). """ 91 pass 92 93 buffer = property(lambda self: object(), lambda self, v: None, lambda self: None) # default 94 95 closed = property(lambda self: object(), lambda self, v: None, lambda self: None) # default 96 97 encoding = property(lambda self: object(), lambda self, v: None, lambda self: None) # default 98 99 errors = property(lambda self: object(), lambda self, v: None, lambda self: None) # default 100 101 line_buffering = property(lambda self: object(), lambda self, v: None, lambda self: None) # default 102 103 name = property(lambda self: object(), lambda self, v: None, lambda self: None) # default 104 105 newlines = property(lambda self: object(), lambda self, v: None, lambda self: None) # default 106 107 _CHUNK_SIZE = property(lambda self: object(), lambda self, v: None, lambda self: None) # default 108 109 _finalizing = property(lambda self: object(), lambda self, v: None, lambda self: None) # default
python3.5.1源码
3.3 关闭文件
这步操作并不是必须的,如果使用open()函数打开文件,那么要记得最终close()文件
如果使用with 语句打开的文件,则不需要。在with 语句主体语句执行完之后,会自动调用close()来关闭文件句柄。语法如下:
1 with open('db1', 'r', encoding="utf-8") as f1: 2 for line in f1: 3 print(line)
同时打开2个文件,语法如下:
1 with open('file1', 'r', encoding="utf-8") as f1, open("file2", 'w',encoding="utf-8") as f2: 2 pass
本章总结至此结束!
最新评论