RSS 30 projects tagged "Parser"

Download Website Updated 11 Nov 2013 jsoup

Screenshot
Pop 188.71
Vit 14.18

jsoup is a Java library for working with real-world HTML. It can parse HTML from a URL, file, or string. It can find and extract data, using DOM traversal or CSS selectors. The HTML elements, attributes, and text can be manipulated. It can clean user-submitted content against a safe white-list. jsoup is designed to deal with all varieties of HTML found in the wild, from pristine and validating to invalid tag-soup; jsoup will create a sensible parse tree.

Download Website Updated 21 Mar 2011 LEPL

Screenshot
Pop 122.24
Vit 7.35

LEPL is a recursive descent parser library written in Python. It is based on parser combinator libraries popular in functional programming, but also exploits Python language features. Operators provide a friendly syntax, and the consistent use of generators supports full backtracking and resource management. Backtracking implies that a wide variety of grammars are supported; appropriate memoisation ensures that even left-recursive grammars terminate.

Download Website Updated 16 Dec 2012 listparser

Screenshot
Pop 85.43
Vit 7.84

listparser is a Python library that parses subscription lists (also called reading lists) and returns all of the feeds, subscription lists, and "opportunity" URLs that it finds. It supports OPML, RDF+FOAF, and the iGoogle exported settings format.

No download No website Updated 15 Aug 2012 JWPL

Screenshot
Pop 60.45
Vit 1.98

JWPL is a language independent, database-driven, high performance Wikipedia API that provides structured access to information nuggets like redirects, categories, articles, and link structure. It contains a Mediawiki Markup parser that can be used to further analyze the contents of a Wikipedia page or standalone with other text, TimeMachine, which reconstructs a snapshot of Wikipedia from a specific date, or multiple snapshots from a time span, and RevisionMachine, which offers efficient access to the history of articles using a dedicated storage format which decreases storage space by 98%. This enables random access to the whole revision history without requiring several terabytes of storage for a single Wikipedia dump.

Download Website Updated 09 Dec 2012 feedparser

Screenshot
Pop 59.58
Vit 3.78

feedparser is a Python library that parses feeds. It supports the Atom, RDF, RSS, and CDF feed formats.

Download No website Updated 14 Jan 2014 gradle-sablecc-plugin

Screenshot
Pop 58.24
Vit 1.00

gradle-sablecc-plugin is a gradle plugin which creates parsers using SableCC. SableCC supports automatic CST-to-AST transformation, emits all the visitor patterns and analysis helpers you will likely ever need, and is LR, not LL(k). Many example grammars are available for modern languages; the author of this plugin has written dozens.

No download Website Updated 02 Apr 2009 YAJL

Screenshot
Pop 57.18
Vit 42.96

YAJL (Yet Another JSON Library) is a small event-driven (SAX-style) JSON parser written in ANSI C, and a small validating JSON generator. It's highly portable, data representation independent, fast, generates verbose error messages including context of where the error occurs in the input text, can parse JSON data incrementally off a stream, and is tiny.

No download Website Updated 19 May 2012 The Lean Mean C++ Option Parser

Screenshot
Pop 49.42
Vit 2.16

The Lean Mean C++ Option Parser handles program arguments (argc, argv). It supports the short and long option formats of getopt(), getopt_long(), and getopt_long_only(), but has a more convenient interface. It is a freestanding, header-only library with no dependencies, not even libc or STL. It comes with a usage message formatter which supports column alignment and line wrapping, making it ideal for localized messages with different lengths.

No download Website Updated 20 Jan 2010 Gelatin

Screenshot
Pop 48.74
Vit 1.00

Gelatin is a simple and fast program for transforming text to structured formats such as XML, JSON, or YAML. It is a combined lexer, parser, and output generator.

Download Website Updated 07 May 2010 arg

Screenshot
Pop 43.47
Vit 1.00

arg is a C++ command-line parser. Its goal is to minimize the coding efforts of adding the processing of command-line parameters to C++ programs.

Screenshot

Project Spotlight

Monit

A utility for monitoring Unix system services.

Screenshot

Project Spotlight

MacX DVD Ripper Mac Free Edition

Software that rips DVD to MOV, MP4, M4V, and iTunes.