A python package to find repetitive format pattern in HTML pages and extract information from them using this pattern. The idea is that in pages that have some kind of a list, there will be a repetitive pattern for the human eye (the page format).

Project Activity

See All Activity >

License

GNU General Public License version 3.0 (GPLv3)

Follow HtmlList

HtmlList Web Site

Other Useful Business Software
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
Try Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of HtmlList!

Additional Project Details

Intended Audience

Developers

Programming Language

Python

Related Categories

Python HTML XHTML, Python Information Analysis Software, Python Libraries

Registered

2009-06-16