Projects / HTML::TableExtract

HTML::TableExtract

HTML::TableExtract is a Perl module that simplifies the extraction of information from tables within HTML documents. Tables, no matter how nested or clustered, can be targeted symbolically with column headers or by more specific depth and count information.

Tags
Licenses
Operating Systems
Implementation

Recent releases

  •  25 Feb 2006 09:01

    Release Notes: A subtable slicing bug and an hrow() attachment bug were fixed. Tests were added.

    •  21 Oct 2005 21:55

      Release Notes: Tightens up element interactions in TREE() mode when examining rows, columns, cells, etc. Was running into trouble with dereferencing scalars vs objects. The space() H::TE::T method has been documented, and tests have been added. POD tests have been added. There are documentation updates and fixes.

      •  26 Feb 2005 11:13

        Release Notes: Tables can now be selected by table tag attributes. The lineage() method now returns row and column information as well as depth and count for each ancestor (a potential backwards incompatibility exists - entries are now 4 element arrays rather than 2). Header matching and column retention enhancements were made. Old-style procedures were deprecated in preparation for them to become methods. Various bugfixes were made.

        Screenshot

        Project Spotlight

        OpenStack4j

        A Fluent OpenStack client API for Java.

        Screenshot

        Project Spotlight

        TurnKey TWiki Appliance

        A TWiki appliance that is easy to use and lightweight.