RSS 1 project tagged "Extract"

Download Website Updated 11 Nov 2013 jsoup

Screenshot
Pop 188.64
Vit 14.26

jsoup is a Java library for working with real-world HTML. It can parse HTML from a URL, file, or string. It can find and extract data, using DOM traversal or CSS selectors. The HTML elements, attributes, and text can be manipulated. It can clean user-submitted content against a safe white-list. jsoup is designed to deal with all varieties of HTML found in the wild, from pristine and validating to invalid tag-soup; jsoup will create a sensible parse tree.

Screenshot

Project Spotlight

LanguageTool

A style and grammar checker for English, Polish, German, and other languages

Screenshot

Project Spotlight

skalibs

Public domain general-purpose libraries.