2018-06-30 09:44:59 +03:00
|
|
|
.. _file-handling:
|
|
|
|
|
2016-11-14 18:00:17 +03:00
|
|
|
File Handling in Pillow
|
|
|
|
=======================
|
|
|
|
|
2019-03-22 15:57:05 +03:00
|
|
|
When opening a file as an image, Pillow requires a filename, ``pathlib.Path``
|
|
|
|
object, or a file-like object. Pillow uses the filename or ``Path`` to open a
|
|
|
|
file, so for the rest of this article, they will all be treated as a file-like
|
|
|
|
object.
|
2016-11-14 18:00:17 +03:00
|
|
|
|
|
|
|
The first four of these items are equivalent, the last is dangerous
|
|
|
|
and may fail::
|
|
|
|
|
|
|
|
from PIL import Image
|
|
|
|
import io
|
|
|
|
import pathlib
|
2018-01-27 09:04:46 +03:00
|
|
|
|
2016-11-14 18:00:17 +03:00
|
|
|
im = Image.open('test.jpg')
|
|
|
|
|
|
|
|
im2 = Image.open(pathlib.Path('test.jpg'))
|
|
|
|
|
|
|
|
f = open('test.jpg', 'rb')
|
|
|
|
im3 = Image.open(f)
|
2018-01-27 09:04:46 +03:00
|
|
|
|
2016-11-14 18:00:17 +03:00
|
|
|
with open('test.jpg', 'rb') as f:
|
|
|
|
im4 = Image.open(io.BytesIO(f.read()))
|
|
|
|
|
|
|
|
# Dangerous FAIL:
|
|
|
|
with open('test.jpg', 'rb') as f:
|
|
|
|
im5 = Image.open(f)
|
|
|
|
im5.load() # FAILS, closed file
|
|
|
|
|
2018-12-11 16:50:32 +03:00
|
|
|
If a filename or a path-like object is passed to Pillow, then the resulting
|
2018-12-11 06:39:10 +03:00
|
|
|
file object opened by Pillow may also be closed by Pillow after the
|
|
|
|
``Image.Image.load()`` method is called, provided the associated image does not
|
|
|
|
have multiple frames.
|
2016-11-14 18:00:17 +03:00
|
|
|
|
|
|
|
Pillow cannot in general close and reopen a file, so any access to
|
2018-01-27 09:04:46 +03:00
|
|
|
that file needs to be prior to the close.
|
2016-11-14 18:00:17 +03:00
|
|
|
|
|
|
|
Issues
|
|
|
|
------
|
|
|
|
|
|
|
|
* Using the file context manager to provide a file-like object to
|
|
|
|
Pillow is dangerous unless the context of the image is limited to
|
2018-01-27 09:04:46 +03:00
|
|
|
the context of the file.
|
2016-11-14 18:00:17 +03:00
|
|
|
|
|
|
|
Image Lifecycle
|
|
|
|
---------------
|
|
|
|
|
2019-03-22 15:57:05 +03:00
|
|
|
* ``Image.open()`` Filenames and ``Path`` objects are opened as a file.
|
|
|
|
Metadata is read from the open file. The file is left open for further usage.
|
2016-11-14 18:00:17 +03:00
|
|
|
|
2018-12-11 06:39:10 +03:00
|
|
|
* ``Image.Image.load()`` When the pixel data from the image is
|
2016-11-14 18:00:17 +03:00
|
|
|
required, ``load()`` is called. The current frame is read into
|
|
|
|
memory. The image can now be used independently of the underlying
|
2018-01-27 09:04:46 +03:00
|
|
|
image file.
|
2016-11-14 18:00:17 +03:00
|
|
|
|
2019-03-22 15:57:05 +03:00
|
|
|
If a filename or a ``Path`` object was passed to ``Image.open()``, then the
|
|
|
|
file object was opened by Pillow and is considered to be used exclusively by
|
|
|
|
Pillow. So if the image is a single-frame image, the file will be closed in
|
|
|
|
this method after the frame is read. If the image is a multi-frame image,
|
|
|
|
(e.g. multipage TIFF and animated GIF) the image file is left open so that
|
|
|
|
``Image.Image.seek()`` can load the appropriate frame.
|
2016-11-14 18:00:17 +03:00
|
|
|
|
2019-03-22 15:57:05 +03:00
|
|
|
* ``Image.Image.close()`` Closes the file and destroys the core image object.
|
|
|
|
This is used in the Pillow context manager support. e.g.::
|
2016-11-14 18:00:17 +03:00
|
|
|
|
|
|
|
with Image.open('test.jpg') as img:
|
2018-01-27 09:04:46 +03:00
|
|
|
... # image operations here.
|
2016-11-14 18:00:17 +03:00
|
|
|
|
|
|
|
|
2018-12-11 16:53:31 +03:00
|
|
|
The lifecycle of a single-frame image is relatively simple. The file
|
2016-11-14 18:00:17 +03:00
|
|
|
must remain open until the ``load()`` or ``close()`` function is
|
2018-01-27 09:04:46 +03:00
|
|
|
called.
|
2016-11-14 18:00:17 +03:00
|
|
|
|
|
|
|
Multi-frame images are more complicated. The ``load()`` method is not
|
2018-12-11 06:39:10 +03:00
|
|
|
a terminal method, so it should not close the underlying file. In general,
|
|
|
|
Pillow does not know if there are going to be any requests for additional
|
2018-01-27 09:04:46 +03:00
|
|
|
data until the caller has explicitly closed the image.
|
2016-11-14 18:00:17 +03:00
|
|
|
|
|
|
|
|
|
|
|
Complications
|
|
|
|
-------------
|
|
|
|
|
2019-03-22 15:57:05 +03:00
|
|
|
* ``TiffImagePlugin`` has some code to pass the underlying file descriptor into
|
|
|
|
libtiff (if working on an actual file). Since libtiff closes the file
|
|
|
|
descriptor internally, it is duplicated prior to passing it into libtiff.
|
2016-11-14 18:00:17 +03:00
|
|
|
|
|
|
|
* I don't think that there's any way to make this safe without
|
|
|
|
changing the lazy loading::
|
|
|
|
|
|
|
|
# Dangerous FAIL:
|
|
|
|
with open('test.jpg', 'rb') as f:
|
|
|
|
im5 = Image.open(f)
|
|
|
|
im5.load() # FAILS, closed file
|
|
|
|
|
|
|
|
|
|
|
|
Proposed File Handling
|
|
|
|
----------------------
|
|
|
|
|
|
|
|
* ``Image.Image.load()`` should close the image file, unless there are
|
|
|
|
multiple frames.
|
|
|
|
|
2018-01-27 09:04:46 +03:00
|
|
|
* ``Image.Image.seek()`` should never close the image file.
|
2016-11-14 18:00:17 +03:00
|
|
|
|
|
|
|
* Users of the library should call ``Image.Image.close()`` on any
|
2018-01-27 09:04:46 +03:00
|
|
|
multi-frame image to ensure that the underlying file is closed.
|