3. Data model¶
3.1. Objects, values and types¶
Objects are Python’s abstraction for data. All data in a Python program is represented by objects or by relations between objects. (In a sense, and in conformance to Von Neumann’s model of a “stored program computer”, code is also represented by objects.)
Every object has an identity, a type and a value. An object’s identity never
changes once it has been created; you may think of it as the object’s address in
memory. The is
operator compares the identity of two objects; the
id()
function returns an integer representing its identity.
CPython implementation detail: For CPython, id(x)
is the memory address where x
is stored.
An object’s type determines the operations that the object supports (e.g., “does
it have a length?”) and also defines the possible values for objects of that
type. The type()
function returns an object’s type (which is an object
itself). Like its identity, an object’s type is also unchangeable.
[1]
The value of some objects can change. Objects whose value can change are said to be mutable; objects whose value is unchangeable once they are created are called immutable. (The value of an immutable container object that contains a reference to a mutable object can change when the latter’s value is changed; however the container is still considered immutable, because the collection of objects it contains cannot be changed. So, immutability is not strictly the same as having an unchangeable value, it is more subtle.) An object’s mutability is determined by its type; for instance, numbers, strings and tuples are immutable, while dictionaries and lists are mutable.
Objects are never explicitly destroyed; however, when they become unreachable they may be garbage-collected. An implementation is allowed to postpone garbage collection or omit it altogether — it is a matter of implementation quality how garbage collection is implemented, as long as no objects are collected that are still reachable.
CPython implementation detail: CPython currently uses a reference-counting scheme with (optional) delayed
detection of cyclically linked garbage, which collects most objects as soon
as they become unreachable, but is not guaranteed to collect garbage
containing circular references. See the documentation of the gc
module for information on controlling the collection of cyclic garbage.
Other implementations act differently and CPython may change.
Do not depend on immediate finalization of objects when they become
unreachable (so you should always close files explicitly).
Note that the use of the implementation’s tracing or debugging facilities may
keep objects alive that would normally be collectable. Also note that catching
an exception with a try
…except
statement may keep
objects alive.
Some objects contain references to “external” resources such as open files or
windows. It is understood that these resources are freed when the object is
garbage-collected, but since garbage collection is not guaranteed to happen,
such objects also provide an explicit way to release the external resource,
usually a close()
method. Programs are strongly recommended to explicitly
close such objects. The try
…finally
statement
and the with
statement provide convenient ways to do this.
Some objects contain references to other objects; these are called containers. Examples of containers are tuples, lists and dictionaries. The references are part of a container’s value. In most cases, when we talk about the value of a container, we imply the values, not the identities of the contained objects; however, when we talk about the mutability of a container, only the identities of the immediately contained objects are implied. So, if an immutable container (like a tuple) contains a reference to a mutable object, its value changes if that mutable object is changed.
Types affect almost all aspects of object behavior. Even the importance of
object identity is affected in some sense: for immutable types, operations that
compute new values may actually return a reference to any existing object with
the same type and value, while for mutable objects this is not allowed.
For example, after a = 1; b = 1
, a and b may or may not refer to
the same object with the value one, depending on the implementation.
This is because int
is an immutable type, so the reference to 1
can be reused. This behaviour depends on the implementation used, so should
not be relied upon, but is something to be aware of when making use of object
identity tests.
However, after c = []; d = []
, c and d are guaranteed to refer to two
different, unique, newly created empty lists. (Note that e = f = []
assigns
the same object to both e and f.)
3.2. The standard type hierarchy¶
Below is a list of the types that are built into Python. Extension modules (written in C, Java, or other languages, depending on the implementation) can define additional types. Future versions of Python may add types to the type hierarchy (e.g., rational numbers, efficiently stored arrays of integers, etc.), although such additions will often be provided via the standard library instead.
Some of the type descriptions below contain a paragraph listing ‘special attributes.’ These are attributes that provide access to the implementation and are not intended for general use. Their definition may change in the future.
3.2.1. None¶
This type has a single value. There is a single object with this value. This
object is accessed through the built-in name None
. It is used to signify the
absence of a value in many situations, e.g., it is returned from functions that
don’t explicitly return anything. Its truth value is false.
3.2.2. NotImplemented¶
This type has a single value. There is a single object with this value. This
object is accessed through the built-in name NotImplemented
. Numeric methods
and rich comparison methods should return this value if they do not implement the
operation for the operands provided. (The interpreter will then try the
reflected operation, or some other fallback, depending on the operator.) It
should not be evaluated in a boolean context.
See Implementing the arithmetic operations for more details.
Changed in version 3.9: Evaluating NotImplemented
in a boolean context is deprecated. While
it currently evaluates as true, it will emit a DeprecationWarning
.
It will raise a TypeError
in a future version of Python.
3.2.3. Ellipsis¶
This type has a single value. There is a single object with this value. This
object is accessed through the literal ...
or the built-in name
Ellipsis
. Its truth value is true.
3.2.4. numbers.Number
¶
These are created by numeric literals and returned as results by arithmetic operators and arithmetic built-in functions. Numeric objects are immutable; once created their value never changes. Python numbers are of course strongly related to mathematical numbers, but subject to the limitations of numerical representation in computers.
The string representations of the numeric classes, computed by
__repr__()
and __str__()
, have the following
properties:
They are valid numeric literals which, when passed to their class constructor, produce an object having the value of the original numeric.
The representation is in base 10, when possible.
Leading zeros, possibly excepting a single zero before a decimal point, are not shown.
Trailing zeros, possibly excepting a single zero after a decimal point, are not shown.
A sign is shown only when the number is negative.
Python distinguishes between integers, floating-point numbers, and complex numbers:
3.2.4.1. numbers.Integral
¶
These represent elements from the mathematical set of integers (positive and negative).
Note
The rules for integer representation are intended to give the most meaningful interpretation of shift and mask operations involving negative integers.
There are two types of integers:
- Integers (
int
) These represent numbers in an unlimited range, subject to available (virtual) memory only. For the purpose of shift and mask operations, a binary representation is assumed, and negative numbers are represented in a variant of 2’s complement which gives the illusion of an infinite string of sign bits extending to the left.
- Booleans (
bool
) These represent the truth values False and True. The two objects representing the values
False
andTrue
are the only Boolean objects. The Boolean type is a subtype of the integer type, and Boolean values behave like the values 0 and 1, respectively, in almost all contexts, the exception being that when converted to a string, the strings"False"
or"True"
are returned, respectively.
3.2.4.2. numbers.Real
(float
)¶
These represent machine-level double precision floating-point numbers. You are at the mercy of the underlying machine architecture (and C or Java implementation) for the accepted range and handling of overflow. Python does not support single-precision floating-point numbers; the savings in processor and memory usage that are usually the reason for using these are dwarfed by the overhead of using objects in Python, so there is no reason to complicate the language with two kinds of floating-point numbers.
3.2.4.3. numbers.Complex
(complex
)¶
These represent complex numbers as a pair of machine-level double precision
floating-point numbers. The same caveats apply as for floating-point numbers.
The real and imaginary parts of a complex number z
can be retrieved through
the read-only attributes z.real
and z.imag
.
3.2.5. Sequences¶
These represent finite ordered sets indexed by non-negative numbers. The
built-in function len()
returns the number of items of a sequence. When
the length of a sequence is n, the index set contains the numbers 0, 1,
…, n-1. Item i of sequence a is selected by a[i]
. Some sequences,
including built-in sequences, interpret negative subscripts by adding the
sequence length. For example, a[-2]
equals a[n-2]
, the second to last
item of sequence a with length n
.
Sequences also support slicing: a[i:j]
selects all items with index k such
that i <=
k <
j. When used as an expression, a slice is a
sequence of the same type. The comment above about negative indexes also applies
to negative slice positions.
Some sequences also support “extended slicing” with a third “step” parameter:
a[i:j:k]
selects all items of a with index x where x = i + n*k
, n
>=
0
and i <=
x <
j.
Sequences are distinguished according to their mutability:
3.2.5.1. Immutable sequences¶
An object of an immutable sequence type cannot change once it is created. (If the object contains references to other objects, these other objects may be mutable and may be changed; however, the collection of objects directly referenced by an immutable object cannot change.)
The following types are immutable sequences:
- Strings
A string is a sequence of values that represent Unicode code points. All the code points in the range
U+0000 - U+10FFFF
can be represented in a string. Python doesn’t have a char type; instead, every code point in the string is represented as a string object with length1
. The built-in functionord()
converts a code point from its string form to an integer in the range0 - 10FFFF
;chr()
converts an integer in the range0 - 10FFFF
to the corresponding length1
string object.str.encode()
can be used to convert astr
tobytes
using the given text encoding, andbytes.decode()
can be used to achieve the opposite.- Tuples
The items of a tuple are arbitrary Python objects. Tuples of two or more items are formed by comma-separated lists of expressions. A tuple of one item (a ‘singleton’) can be formed by affixing a comma to an expression (an expression by itself does not create a tuple, since parentheses must be usable for grouping of expressions). An empty tuple can be formed by an empty pair of parentheses.
- Bytes
A bytes object is an immutable array. The items are 8-bit bytes, represented by integers in the range 0 <= x < 256. Bytes literals (like
b'abc'
) and the built-inbytes()
constructor can be used to create bytes objects. Also, bytes objects can be decoded to strings via thedecode()
method.
3.2.5.2. Mutable sequences¶
Mutable sequences can be changed after they are created. The subscription and
slicing notations can be used as the target of assignment and del
(delete) statements.
Note
The collections
and array
module provide
additional examples of mutable sequence types.
There are currently two intrinsic mutable sequence types:
- Lists
The items of a list are arbitrary Python objects. Lists are formed by placing a comma-separated list of expressions in square brackets. (Note that there are no special cases needed to form lists of length 0 or 1.)
- Byte Arrays
A bytearray object is a mutable array. They are created by the built-in
bytearray()
constructor. Aside from being mutable (and hence unhashable), byte arrays otherwise provide the same interface and functionality as immutablebytes
objects.
3.2.6. Set types¶
These represent unordered, finite sets of unique, immutable objects. As such,
they cannot be indexed by any subscript. However, they can be iterated over, and
the built-in function len()
returns the number of items in a set. Common
uses for sets are fast membership testing, removing duplicates from a sequence,
and computing mathematical operations such as intersection, union, difference,
and symmetric difference.
For set elements, the same immutability rules apply as for dictionary keys. Note
that numeric types obey the normal rules for numeric comparison: if two numbers
compare equal (e.g., 1
and 1.0
), only one of them can be contained in a
set.
There are currently two intrinsic set types:
- Sets
These represent a mutable set. They are created by the built-in
set()
constructor and can be modified afterwards by several methods, such asadd()
.- Frozen sets
These represent an immutable set. They are created by the built-in
frozenset()
constructor. As a frozenset is immutable and hashable, it can be used again as an element of another set, or as a dictionary key.
3.2.7. Mappings¶
These represent finite sets of objects indexed by arbitrary index sets. The
subscript notation a[k]
selects the item indexed by k
from the mapping
a
; this can be used in expressions and as the target of assignments or
del
statements. The built-in function len()
returns the number
of items in a mapping.
There is currently a single intrinsic mapping type:
3.2.7.1. Dictionaries¶
These represent finite sets of objects indexed by nearly arbitrary values. The
only types of values not acceptable as keys are values containing lists or
dictionaries or other mutable types that are compared by value rather than by
object identity, the reason being that the efficient implementation of
dictionaries requires a key’s hash value to remain constant. Numeric types used
for keys obey the normal rules for numeric comparison: if two numbers compare
equal (e.g., 1
and 1.0
) then they can be used interchangeably to index
the same dictionary entry.
Dictionaries preserve insertion order, meaning that keys will be produced in the same order they were added sequentially over the dictionary. Replacing an existing key does not change the order, however removing a key and re-inserting it will add it to the end instead of keeping its old place.
Dictionaries are mutable; they can be created by the {}
notation (see
section Dictionary displays).
The extension modules dbm.ndbm
and dbm.gnu
provide
additional examples of mapping types, as does the collections
module.
Changed in version 3.7: Dictionaries did not preserve insertion order in versions of Python before 3.6. In CPython 3.6, insertion order was preserved, but it was considered an implementation detail at that time rather than a language guarantee.
3.2.8. Callable types¶
These are the types to which the function call operation (see section Calls) can be applied:
3.2.8.1. User-defined functions¶
A user-defined function object is created by a function definition (see section Function definitions). It should be called with an argument list containing the same number of items as the function’s formal parameter list.
3.2.8.1.1. Special read-only attributes¶
Attribute |
Meaning |
---|---|
|
A reference to the |
|
A cell object has the attribute |
3.2.8.1.2. Special writable attributes¶
Most of these attributes check the type of the assigned value:
Attribute |
Meaning |
---|---|
|
The function’s documentation string, or |
|
The function’s name.
See also: |
|
The function’s qualified name.
See also: Added in version 3.3. |
|
The name of the module the function was defined in,
or |
|
A |
|
The code object representing the compiled function body. |
|
The namespace supporting arbitrary function attributes.
See also: |
|
A |
|
A |
|
A Added in version 3.12. |
Function objects also support getting and setting arbitrary attributes, which can be used, for example, to attach metadata to functions. Regular attribute dot-notation is used to get and set such attributes.
CPython implementation detail: CPython’s current implementation only supports function attributes on user-defined functions. Function attributes on built-in functions may be supported in the future.
Additional information about a function’s definition can be retrieved from its
code object
(accessible via the __code__
attribute).
3.2.8.2. Instance methods¶
An instance method object combines a class, a class instance and any callable object (normally a user-defined function).
Special read-only attributes:
|
Refers to the class instance object to which the method is bound |
|
Refers to the original function object |
|
The method’s documentation
(same as |
|
The name of the method
(same as |
|
The name of the module the method was defined in, or |
Methods also support accessing (but not setting) the arbitrary function attributes on the underlying function object.
User-defined method objects may be created when getting an attribute of a
class (perhaps via an instance of that class), if that attribute is a
user-defined function object or a
classmethod
object.
When an instance method object is created by retrieving a user-defined
function object from a class via one of its
instances, its __self__
attribute is the instance, and the
method object is said to be bound. The new method’s __func__
attribute is the original function object.
When an instance method object is created by retrieving a classmethod
object from a class or instance, its __self__
attribute is the
class itself, and its __func__
attribute is the function object
underlying the class method.
When an instance method object is called, the underlying function
(__func__
) is called, inserting the class instance
(__self__
) in front of the argument list. For instance, when
C
is a class which contains a definition for a function
f()
, and x
is an instance of C
, calling x.f(1)
is
equivalent to calling C.f(x, 1)
.
When an instance method object is derived from a classmethod
object, the
“class instance” stored in __self__
will actually be the class
itself, so that calling either x.f(1)
or C.f(1)
is equivalent to
calling f(C,1)
where f
is the underlying function.
It is important to note that user-defined functions which are attributes of a class instance are not converted to bound methods; this only happens when the function is an attribute of the class.
3.2.8.3. Generator functions¶
A function or method which uses the yield
statement (see section
The yield statement) is called a generator function. Such a function, when
called, always returns an iterator object which can be used to
execute the body of the function: calling the iterator’s
iterator.__next__()
method will cause the function to execute until
it provides a value using the yield
statement. When the
function executes a return
statement or falls off the end, a
StopIteration
exception is raised and the iterator will have
reached the end of the set of values to be returned.
3.2.8.4. Coroutine functions¶
A function or method which is defined using async def
is called
a coroutine function. Such a function, when called, returns a
coroutine object. It may contain await
expressions,
as well as async with
and async for
statements. See
also the Coroutine Objects section.
3.2.8.5. Asynchronous generator functions¶
A function or method which is defined using async def
and
which uses the yield
statement is called a
asynchronous generator function. Such a function, when called,
returns an asynchronous iterator object which can be used in an
async for
statement to execute the body of the function.
Calling the asynchronous iterator’s
aiterator.__anext__
method
will return an awaitable which when awaited
will execute until it provides a value using the yield
expression. When the function executes an empty return
statement or falls off the end, a StopAsyncIteration
exception
is raised and the asynchronous iterator will have reached the end of
the set of values to be yielded.
3.2.8.6. Built-in functions¶
A built-in function object is a wrapper around a C function. Examples of
built-in functions are len()
and math.sin()
(math
is a
standard built-in module). The number and type of the arguments are
determined by the C function. Special read-only attributes:
__doc__
is the function’s documentation string, orNone
if unavailable. Seefunction.__doc__
.__name__
is the function’s name. Seefunction.__name__
.__self__
is set toNone
(but see the next item).__module__
is the name of the module the function was defined in orNone
if unavailable. Seefunction.__module__
.
3.2.8.7. Built-in methods¶
This is really a different disguise of a built-in function, this time containing
an object passed to the C function as an implicit extra argument. An example of
a built-in method is alist.append()
, assuming alist is a list object. In
this case, the special read-only attribute __self__
is set to the object
denoted by alist. (The attribute has the same semantics as it does with
other instance methods
.)
3.2.8.8. Classes¶
Classes are callable. These objects normally act as factories for new
instances of themselves, but variations are possible for class types that
override __new__()
. The arguments of the call are passed to
__new__()
and, in the typical case, to __init__()
to
initialize the new instance.
3.2.8.9. Class Instances¶
Instances of arbitrary classes can be made callable by defining a
__call__()
method in their class.
3.2.9. Modules¶
Modules are a basic organizational unit of Python code, and are created by
the import system as invoked either by the
import
statement, or by calling
functions such as importlib.import_module()
and built-in
__import__()
. A module object has a namespace implemented by a
dictionary
object (this is the dictionary referenced by the
__globals__
attribute of functions defined in the module). Attribute references are
translated to lookups in this dictionary, e.g., m.x
is equivalent to
m.__dict__["x"]
. A module object does not contain the code object used
to initialize the module (since it isn’t needed once the initialization is
done).
Attribute assignment updates the module’s namespace dictionary, e.g.,
m.x = 1
is equivalent to m.__dict__["x"] = 1
.
3.2.9.2. Other writable attributes on module objects¶
As well as the import-related attributes listed above, module objects also have the following writable attributes:
- module.__doc__¶
The module’s documentation string, or
None
if unavailable. See also:__doc__ attributes
.
- module.__annotations__¶
A dictionary containing variable annotations collected during module body execution. For best practices on working with
__annotations__
, please see Annotations Best Practices.
3.2.9.3. Module dictionaries¶
Module objects also have the following special read-only attribute:
- module.__dict__¶
The module’s namespace as a dictionary object. Uniquely among the attributes listed here,
__dict__
cannot be accessed as a global variable from within a module; it can only be accessed as an attribute on module objects.CPython implementation detail: Because of the way CPython clears module dictionaries, the module dictionary will be cleared when the module falls out of scope even if the dictionary still has live references. To avoid this, copy the dictionary or keep the module around while using its dictionary directly.
3.2.10. Custom classes¶
Custom class types are typically created by class definitions (see section
Class definitions). A class has a namespace implemented by a dictionary object.
Class attribute references are translated to lookups in this dictionary, e.g.,
C.x
is translated to C.__dict__["x"]
(although there are a number of
hooks which allow for other means of locating attributes). When the attribute
name is not found there, the attribute search continues in the base classes.
This search of the base classes uses the C3 method resolution order which
behaves correctly even in the presence of ‘diamond’ inheritance structures
where there are multiple inheritance paths leading back to a common ancestor.
Additional details on the C3 MRO used by Python can be found at
The Python 2.3 Method Resolution Order.
When a class attribute reference (for class C
, say) would yield a
class method object, it is transformed into an instance method object whose
__self__
attribute is C
.
When it would yield a staticmethod
object,
it is transformed into the object wrapped by the static method
object. See section Implementing Descriptors for another way in which attributes
retrieved from a class may differ from those actually contained in its
__dict__
.
Class attribute assignments update the class’s dictionary, never the dictionary of a base class.
A class object can be called (see above) to yield a class instance (see below).
3.2.10.1. Special attributes¶
Attribute |
Meaning |
---|---|
|
The class’s name.
See also: |
|
The class’s qualified name.
See also: |
|
The name of the module in which the class was defined. |
|
A |
|
A |
|
The class’s documentation string, or |
|
A dictionary containing
variable annotations
collected during class body execution. For best practices on working
with Caution Accessing the |
|
A Added in version 3.12. |
|
A Added in version 3.13. |
|
The line number of the first line of the class definition,
including decorators.
Setting the Added in version 3.13. |
|
The |
3.2.10.2. Special methods¶
In addition to the special attributes described above, all Python classes also have the following two methods available:
- type.mro()¶
This method can be overridden by a metaclass to customize the method resolution order for its instances. It is called at class instantiation, and its result is stored in
__mro__
.
- type.__subclasses__()¶
Each class keeps a list of weak references to its immediate subclasses. This method returns a list of all those references still alive. The list is in definition order. Example:
>>> class A: pass >>> class B(A): pass >>> A.__subclasses__() [<class 'B'>]
3.2.11. Class instances¶
A class instance is created by calling a class object (see above). A class
instance has a namespace implemented as a dictionary which is the first place
in which attribute references are searched. When an attribute is not found
there, and the instance’s class has an attribute by that name, the search
continues with the class attributes. If a class attribute is found that is a
user-defined function object, it is transformed into an instance method
object whose __self__
attribute is the instance. Static method and
class method objects are also transformed; see above under “Classes”. See
section Implementing Descriptors for another way in which attributes of a class
retrieved via its instances may differ from the objects actually stored in
the class’s __dict__
. If no class attribute is found, and the
object’s class has a __getattr__()
method, that is called to satisfy
the lookup.
Attribute assignments and deletions update the instance’s dictionary, never a
class’s dictionary. If the class has a __setattr__()
or
__delattr__()
method, this is called instead of updating the instance
dictionary directly.
Class instances can pretend to be numbers, sequences, or mappings if they have methods with certain special names. See section Special method names.
3.2.11.1. Special attributes¶
- object.__class__¶
The class to which a class instance belongs.
3.2.12. I/O objects (also known as file objects)¶
A file object represents an open file. Various shortcuts are
available to create file objects: the open()
built-in function, and
also os.popen()
, os.fdopen()
, and the
makefile()
method of socket objects (and perhaps by
other functions or methods provided by extension modules).
The objects sys.stdin
, sys.stdout
and sys.stderr
are
initialized to file objects corresponding to the interpreter’s standard
input, output and error streams; they are all open in text mode and
therefore follow the interface defined by the io.TextIOBase
abstract class.
3.2.13. Internal types¶
A few types used internally by the interpreter are exposed to the user. Their definitions may change with future versions of the interpreter, but they are mentioned here for completeness.
3.2.13.1. Code objects¶
Code objects represent byte-compiled executable Python code, or bytecode. The difference between a code object and a function object is that the function object contains an explicit reference to the function’s globals (the module in which it was defined), while a code object contains no context; also the default argument values are stored in the function object, not in the code object (because they represent values calculated at run-time). Unlike function objects, code objects are immutable and contain no references (directly or indirectly) to mutable objects.
3.2.13.1.1. Special read-only attributes¶
|
The function name |
|
The fully qualified function name Added in version 3.11. |
|
The total number of positional parameters (including positional-only parameters and parameters with default values) that the function has |
|
The number of positional-only parameters (including arguments with default values) that the function has |
|
The number of keyword-only parameters (including arguments with default values) that the function has |
|
The number of local variables used by the function (including parameters) |
|
A |
|
A |
|
A Note: references to global and builtin names are not included. |
|
A string representing the sequence of bytecode instructions in the function |
|
A |
|
A |
|
The name of the file from which the code was compiled |
|
The line number of the first line of the function |
|
A string encoding the mapping from bytecode offsets to line numbers. For details, see the source code of the interpreter. Deprecated since version 3.12: This attribute of code objects is deprecated, and may be removed in Python 3.15. |
|
The required stack size of the code object |
|
An |
The following flag bits are defined for co_flags
:
bit 0x04
is set if
the function uses the *arguments
syntax to accept an arbitrary number of
positional arguments; bit 0x08
is set if the function uses the
**keywords
syntax to accept arbitrary keyword arguments; bit 0x20
is set
if the function is a generator. See Code Objects Bit Flags for details
on the semantics of each flags that might be present.
Future feature declarations (for example, from __future__ import division
) also use bits
in co_flags
to indicate whether a code object was compiled with a
particular feature enabled. See compiler_flag
.
Other bits in co_flags
are reserved for internal use.
If a code object represents a function, the first item in
co_consts
is
the documentation string of the function, or None
if undefined.
3.2.13.1.2. Methods on code objects¶
- codeobject.co_positions()¶
Returns an iterable over the source code positions of each bytecode instruction in the code object.
The iterator returns
tuple
s containing the(start_line, end_line, start_column, end_column)
. The i-th tuple corresponds to the position of the source code that compiled to the i-th code unit. Column information is 0-indexed utf-8 byte offsets on the given source line.This positional information can be missing. A non-exhaustive lists of cases where this may happen:
Running the interpreter with
-X
no_debug_ranges
.Loading a pyc file compiled while using
-X
no_debug_ranges
.Position tuples corresponding to artificial instructions.
Line and column numbers that can’t be represented due to implementation specific limitations.
When this occurs, some or all of the tuple elements can be
None
.Added in version 3.11.
Note
This feature requires storing column positions in code objects which may result in a small increase of disk usage of compiled Python files or interpreter memory usage. To avoid storing the extra information and/or deactivate printing the extra traceback information, the
-X
no_debug_ranges
command line flag or thePYTHONNODEBUGRANGES
environment variable can be used.
- codeobject.co_lines()¶
Returns an iterator that yields information about successive ranges of bytecodes. Each item yielded is a
(start, end, lineno)
tuple
:start
(anint
) represents the offset (inclusive) of the start of the bytecode rangeend
(anint
) represents the offset (exclusive) of the end of the bytecode rangelineno
is anint
representing the line number of the bytecode range, orNone
if the bytecodes in the given range have no line number
The items yielded will have the following properties:
The first range yielded will have a
start
of 0.The
(start, end)
ranges will be non-decreasing and consecutive. That is, for any pair oftuple
s, thestart
of the second will be equal to theend
of the first.No range will be backwards:
end >= start
for all triples.The last
tuple
yielded will haveend
equal to the size of the bytecode.
Zero-width ranges, where
start == end
, are allowed. Zero-width ranges are used for lines that are present in the source code, but have been eliminated by the bytecode compiler.Added in version 3.10.
See also
- PEP 626 - Precise line numbers for debugging and other tools.
The PEP that introduced the
co_lines()
method.
- codeobject.replace(**kwargs)¶
Return a copy of the code object with new values for the specified fields.
Code objects are also supported by the generic function
copy.replace()
.Added in version 3.8.
3.2.13.2. Frame objects¶
Frame objects represent execution frames. They may occur in traceback objects, and are also passed to registered trace functions.
3.2.13.2.1. Special read-only attributes¶
|
Points to the previous stack frame (towards the caller),
or |
|
The code object being executed in this frame.
Accessing this attribute raises an auditing event
|
|
The mapping used by the frame to look up local variables. If the frame refers to an optimized scope, this may return a write-through proxy object. Changed in version 3.13: Return a proxy for optimized scopes. |
|
The dictionary used by the frame to look up global variables |
|
The dictionary used by the frame to look up built-in (intrinsic) names |
|
The “precise instruction” of the frame object (this is an index into the bytecode string of the code object) |
3.2.13.2.2. Special writable attributes¶
|
If not |
|
Set this attribute to |
|
Set this attribute to |
|
The current line number of the frame – writing to this from within a trace function jumps to the given line (only for the bottom-most frame). A debugger can implement a Jump command (aka Set Next Statement) by writing to this attribute. |
3.2.13.2.3. Frame object methods¶
Frame objects support one method:
- frame.clear()¶
This method clears all references to local variables held by the frame. Also, if the frame belonged to a generator, the generator is finalized. This helps break reference cycles involving frame objects (for example when catching an exception and storing its traceback for later use).
RuntimeError
is raised if the frame is currently executing or suspended.Added in version 3.4.
Changed in version 3.13: Attempting to clear a suspended frame raises
RuntimeError
(as has always been the case for executing frames).
3.2.13.3. Traceback objects¶
Traceback objects represent the stack trace of an exception.
A traceback object
is implicitly created when an exception occurs, and may also be explicitly
created by calling types.TracebackType
.
Changed in version 3.7: Traceback objects can now be explicitly instantiated from Python code.
For implicitly created tracebacks, when the search for an exception handler
unwinds the execution stack, at each unwound level a traceback object is
inserted in front of the current traceback. When an exception handler is
entered, the stack trace is made available to the program. (See section
The try statement.) It is accessible as the third item of the
tuple returned by sys.exc_info()
, and as the
__traceback__
attribute
of the caught exception.
When the program contains no suitable
handler, the stack trace is written (nicely formatted) to the standard error
stream; if the interpreter is interactive, it is also made available to the user
as sys.last_traceback
.
For explicitly created tracebacks, it is up to the creator of the traceback
to determine how the tb_next
attributes should be linked to
form a full stack trace.
Special read-only attributes:
|
Points to the execution frame of the current level. Accessing this attribute raises an
auditing event |
|
Gives the line number where the exception occurred |
|
Indicates the “precise instruction”. |
The line number and last instruction in the traceback may differ from the
line number of its frame object if the exception
occurred in a
try
statement with no matching except clause or with a
finally
clause.
- traceback.tb_next¶
The special writable attribute
tb_next
is the next level in the stack trace (towards the frame where the exception occurred), orNone
if there is no next level.Changed in version 3.7: This attribute is now writable
3.2.13.4. Slice objects¶
Slice objects are used to represent slices for
__getitem__()
methods. They are also created by the built-in slice()
function.
Special read-only attributes: start
is the lower bound;
stop
is the upper bound; step
is the step
value; each is None
if omitted. These attributes can have any type.
Slice objects support one method:
- slice.indices(self, length)¶
This method takes a single integer argument length and computes information about the slice that the slice object would describe if applied to a sequence of length items. It returns a tuple of three integers; respectively these are the start and stop indices and the step or stride length of the slice. Missing or out-of-bounds indices are handled in a manner consistent with regular slices.
3.2.13.5. Static method objects¶
Static method objects provide a way of defeating the transformation of function
objects to method objects described above. A static method object is a wrapper
around any other object, usually a user-defined method object. When a static
method object is retrieved from a class or a class instance, the object actually
returned is the wrapped object, which is not subject to any further
transformation. Static method objects are also callable. Static method
objects are created by the built-in staticmethod()
constructor.
3.2.13.6. Class method objects¶
A class method object, like a static method object, is a wrapper around another
object that alters the way in which that object is retrieved from classes and
class instances. The behaviour of class method objects upon such retrieval is
described above, under “instance methods”. Class method objects are created
by the built-in classmethod()
constructor.
3.3. Special method names¶
A class can implement certain operations that are invoked by special syntax
(such as arithmetic operations or subscripting and slicing) by defining methods
with special names. This is Python’s approach to operator overloading,
allowing classes to define their own behavior with respect to language
operators. For instance, if a class defines a method named
__getitem__()
,
and x
is an instance of this class, then x[i]
is roughly equivalent
to type(x).__getitem__(x, i)
. Except where mentioned, attempts to execute an
operation raise an exception when no appropriate method is defined (typically
AttributeError
or TypeError
).
Setting a special method to None
indicates that the corresponding
operation is not available. For example, if a class sets
__iter__()
to None
, the class is not iterable, so calling
iter()
on its instances will raise a TypeError
(without
falling back to __getitem__()
). [2]
When implementing a class that emulates any built-in type, it is important that
the emulation only be implemented to the degree that it makes sense for the
object being modelled. For example, some sequences may work well with retrieval
of individual elements, but extracting a slice may not make sense. (One example
of this is the NodeList
interface in the W3C’s Document
Object Model.)
3.3.1. Basic customization¶
- object.__new__(cls[, ...])¶
Called to create a new instance of class cls.
__new__()
is a static method (special-cased so you need not declare it as such) that takes the class of which an instance was requested as its first argument. The remaining arguments are those passed to the object constructor expression (the call to the class). The return value of__new__()
should be the new object instance (usually an instance of cls).Typical implementations create a new instance of the class by invoking the superclass’s
__new__()
method usingsuper().__new__(cls[, ...])
with appropriate arguments and then modifying the newly created instance as necessary before returning it.If
__new__()
is invoked during object construction and it returns an instance of cls, then the new instance’s__init__()
method will be invoked like__init__(self[, ...])
, where self is the new instance and the remaining arguments are the same as were passed to the object constructor.If
__new__()
does not return an instance of cls, then the new instance’s__init__()
method will not be invoked.__new__()
is intended mainly to allow subclasses of immutable types (like int, str, or tuple) to customize instance creation. It is also commonly overridden in custom metaclasses in order to customize class creation.
- object.__init__(self[, ...])¶
Called after the instance has been created (by
__new__()
), but before it is returned to the caller. The arguments are those passed to the class constructor expression. If a base class has an__init__()
method, the derived class’s__init__()
method, if any, must explicitly call it to ensure proper initialization of the base class part of the instance; for example:super().__init__([args...])
.Because
__new__()
and__init__()
work together in constructing objects (__new__()
to create it, and__init__()
to customize it), no non-None
value may be returned by__init__()
; doing so will cause aTypeError
to be raised at runtime.
- object.__del__(self)¶
Called when the instance is about to be destroyed. This is also called a finalizer or (improperly) a destructor. If a base class has a
__del__()
method, the derived class’s__del__()
method, if any, must explicitly call it to ensure proper deletion of the base class part of the instance.It is possible (though not recommended!) for the
__del__()
method to postpone destruction of the instance by creating a new reference to it. This is called object resurrection. It is implementation-dependent whether__del__()
is called a second time when a resurrected object is about to be destroyed; the current CPython implementation only calls it once.It is not guaranteed that
__del__()
methods are called for objects that still exist when the interpreter exits.weakref.finalize
provides a straightforward way to register a cleanup function to be called when an object is garbage collected.Note
del x
doesn’t directly callx.__del__()
— the former decrements the reference count forx
by one, and the latter is only called whenx
’s reference count reaches zero.CPython implementation detail: It is possible for a reference cycle to prevent the reference count of an object from going to zero. In this case, the cycle will be later detected and deleted by the cyclic garbage collector. A common cause of reference cycles is when an exception has been caught in a local variable. The frame’s locals then reference the exception, which references its own traceback, which references the locals of all frames caught in the traceback.
See also
Documentation for the
gc
module.Warning
Due to the precarious circumstances under which
__del__()
methods are invoked, exceptions that occur during their execution are ignored, and a warning is printed tosys.stderr
instead. In particular:__del__()
can be invoked when arbitrary code is being executed, including from any arbitrary thread. If__del__()
needs to take a lock or invoke any other blocking resource, it may deadlock as the resource may already be taken by the code that gets interrupted to execute__del__()
.__del__()
can be executed during interpreter shutdown. As a consequence, the global variables it needs to access (including other modules) may already have been deleted or set toNone
. Python guarantees that globals whose name begins with a single underscore are deleted from their module before other globals are deleted; if no other references to such globals exist, this may help in assuring that imported modules are still available at the time when the__del__()
method is called.
- object.__repr__(self)¶
Called by the
repr()
built-in function to compute the “official” string representation of an object. If at all possible, this should look like a valid Python expression that could be used to recreate an object with the same value (given an appropriate environment). If this is not possible, a string of the form<...some useful description...>
should be returned. The return value must be a string object. If a class defines__repr__()
but not__str__()
, then__repr__()
is also used when an “informal” string representation of instances of that class is required.This is typically used for debugging, so it is important that the representation is information-rich and unambiguous. A default implementation is provided by the
object
class itself.
- object.__str__(self)¶
Called by
str(object)
, the default__format__()
implementation, and the built-in functionprint()
, to compute the “informal” or nicely printable string representation of an object. The return value must be a str object.This method differs from