Anthracite is a collection of Web mining power tools combined in an easy-to-use graphical environment that lets users quickly and seamlessly extract data from Internet sources, modify it to suit their needs, and export it to templates or databases, e.g. for RSS feeds.
| Tags | Text Processing Markup XML HTML/XHTML Filters Internet Web Indexing/Search Information Management Workflow Frameworks Metadata/Semantic Models |
|---|---|
| Operating Systems | Mac OS X |
| Implementation | Objective C |
Recent releases


Release Notes: A problem with the license key manager was fixed.


Release Notes: This release has new XSLT and Filter Tag processors, a new solution "Web History to Tag Cloud", and a handful of other updates, enhancements, and fixes.


Release Notes: This release increases the document canvas size to 2K square for working with more objects, adds an overview inspector window for navigating the larger document size, adds per-source User-Agent settings and stores options in an external file, includes several more text encoding settings options, and improves plugin support, plus ships with the first available plugin that enables the use of Safari History files.


Release Notes: This release fixes a user-reported issue with regular expression find/replace parenthetical back expressions, and adds new sample documents showing how to convert RSS feeds to CSV for use with databases.


Release Notes: This release adds a ready to run solution that converts SEC filings into the hCard microformat, as well as an example of integrating Python scripts and speaking RSS headlines. It also fixes two issues for users, one related to MySQL file descriptor resource starvation and the other in the Column Excerpt processor when using negative indexes.