Python parser for Apache/nginx-style HTML directory listing.
Python parser for Apache/nginx-style HTML directory listing
.. code-block:: python
import htmllistparse cwd, listing = htmllistparse.fetch_listing(some_url, timeout=30)
where cwd is the current directory, listing is a list of FileEntry named tuples:
name: File name, str. Have a trailing / if it's a directory.modified: Last modification time, time.struct_time or None. Timezone is not known.size: File size, int or None. May be estimated from the prefix, such as "K", "M".description: File description, file type, or any other things found. str as HTML, or None.Supports:
<pre>-style index<table>-style index<ul>-style.. note::
Please wrap the functions in a general try... except block. It may throw exceptions unexpectedly.
Reinvented HTTP Filesystem.
::
usage: rehttpfs.py [-h] [-o OPTIONS] [-t TIMEOUT] [-u USER_AGENT] [-v] [-d] url mountpoint
Mount HTML directory listings.
positional arguments: url URL to mount mountpoint filesystem mount point
optional arguments: -h, --help show this help message and exit -o OPTIONS comma separated FUSE options -t TIMEOUT, --timeout TIMEOUT HTTP request timeout -u USER_AGENT, --user-agent USER_AGENT HTTP User-Agent -v, --verbose enable debug logging -d, --daemon run in background