RSS 44 projects tagged "Parser"

Download Website Updated 28 Jan 2013 jsoup

Screenshot
Pop 187.00
Vit 14.49

jsoup is a Java library for working with real-world HTML. It can parse HTML from a URL, file, or string. It can find and extract data, using DOM traversal or CSS selectors. The HTML elements, attributes, and text can be manipulated. It can clean user-submitted content against a safe white-list. jsoup is designed to deal with all varieties of HTML found in the wild, from pristine and validating to invalid tag-soup; jsoup will create a sensible parse tree.

Download No website Updated 17 May 2013 any-dl

Screenshot
Pop 121.01
Vit 7.44

any-dl is a generic video downloader tool that uses a domain specific language to describe how to download videos from each video site.

Download Website Updated 21 Mar 2011 LEPL

Screenshot
Pop 118.71
Vit 7.96

LEPL is a recursive descent parser library written in Python. It is based on parser combinator libraries popular in functional programming, but also exploits Python language features. Operators provide a friendly syntax, and the consistent use of generators supports full backtracking and resource management. Backtracking implies that a wide variety of grammars are supported; appropriate memoisation ensures that even left-recursive grammars terminate.

Download Website Updated 06 Apr 2013 lihata

Screenshot
Pop 112.78
Vit 1.86

lihata is a compact textual language which can represent a tree of lists, hashes, and tables. The syntax tries to be minimal and flexible to allow formatting a lihata file to fit the context it represents. The source release contains an event and DoM parser and helper functions for maintaining lihata trees. lihata is a convenient language for both simple and complex configuration files and text representation of data files.

Download Website Updated 16 Dec 2012 listparser

Screenshot
Pop 106.40
Vit 12.47

listparser is a Python library that parses subscription lists (also called reading lists) and returns all of the feeds, subscription lists, and "opportunity" URLs that it finds. It supports OPML, RDF+FOAF, and the iGoogle exported settings format.

Download No website Updated 02 Apr 2013 cardme

Screenshot
Pop 105.24
Vit 14.03

cardme is a Java library implementation of RFC 2426, VCard. It provides Java applications with a way to read and write from and to the VCard file format. The project's goals are to provide a flexible and easy to use library with excellent documentation.

No download No website Updated 04 May 2013 pyC11

Screenshot
Pop 97.94
Vit 1.00

pyC11 is a grammar to parse programs in the C programming language following ISO/IEC 9899:2011. It is written using pyPEG, a parsing framework for Python. The grammar supports Python 2.7 and 3.x. The test bench requires py.test.

Download Website Updated 09 Dec 2012 feedparser

Screenshot
Pop 89.16
Vit 5.58

feedparser is a Python library that parses feeds. It supports the Atom, RDF, RSS, and CDF feed formats.

Download Website Updated 29 Nov 2011 Apache OpenNLP

Screenshot
Pop 84.29
Vit 1.54

Apache OpenNLP is a machine learning based toolkit for the processing of natural language text. It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution. These tasks are usually required to build more advanced text processing services.

No download Website Updated 19 May 2012 The Lean Mean C++ Option Parser

Screenshot
Pop 61.14
Vit 2.30

The Lean Mean C++ Option Parser handles program arguments (argc, argv). It supports the short and long option formats of getopt(), getopt_long(), and getopt_long_only(), but has a more convenient interface. It is a freestanding, header-only library with no dependencies, not even libc or STL. It comes with a usage message formatter which supports column alignment and line wrapping, making it ideal for localized messages with different lengths.

Screenshot

Project Spotlight

R2CORBA

CORBA for Ruby.

Screenshot

Project Spotlight

NuttX

A standards compliant RTOS with a small footprint.