51 projects tagged "Parser"

No download No website Updated 15 Aug 2012 JWPL

Screenshot
Pop 59.30
Vit 1.96

JWPL is a language independent, database-driven, high performance Wikipedia API that provides structured access to information nuggets like redirects, categories, articles, and link structure. It contains a Mediawiki Markup parser that can be used to further analyze the contents of a Wikipedia page or standalone with other text, TimeMachine, which reconstructs a snapshot of Wikipedia from a specific date, or multiple snapshots from a time span, and RevisionMachine, which offers efficient access to the history of articles using a dedicated storage format which decreases storage space by 98%. This enables random access to the whole revision history without requiring several terabytes of storage for a single Wikipedia dump.

Download Website Updated 10 Jul 2012 csvgrep

Screenshot
Pop 15.49
Vit 26.62

csvgrep is a commandline program which enables users to execute searches on text-delimited files using a rudimentary query language. Its query language is bound to simplicity and expressivity, to be easily comprehensible. It aims at replacing both grep and awk when you are challenged to retrieve information from a text-delimited file based on the content of a specific field (or column). You can get what you want using the semantic already in the file’s underlying structure.

No download Website Updated 05 Jul 2012 UniCC

Screenshot
Pop 23.45
Vit 26.72

UniCC, (Universal Compiler-Compiler) is a powerful LALR(1) parser generator and language development system for computer professionals. It serves as an all-round design and build tool assisting compiler writers in any parsing-related task, including production quality compiler construction and the implementation of domain specific languages. It unifies an integrated generator for lexical analyzers and a powerful LALR(1) parser generator into one software solution. The programming interface is a rich, extendable, and innovative BNF-based grammar definition language for expressing context-free grammars.

No download Website Updated 19 May 2012 The Lean Mean C++ Option Parser

Screenshot
Pop 45.56
Vit 2.15

The Lean Mean C++ Option Parser handles program arguments (argc, argv). It supports the short and long option formats of getopt(), getopt_long(), and getopt_long_only(), but has a more convenient interface. It is a freestanding, header-only library with no dependencies, not even libc or STL. It comes with a usage message formatter which supports column alignment and line wrapping, making it ideal for localized messages with different lengths.

No download No website Updated 17 May 2012 Piglet

Screenshot
Pop 19.36
Vit 27.60

Piglet is a tool for parsing and lexing text for the .NET framework. The purpose of Piglet is to provide an easy-to-use tool for parsing text which can be easily included in any .NET project as a single assembly. In contrast to most parser generators, Piglet provides a fluent interface which enables you to express your grammar in a syntax which is accessible for users with no prior experience of parser generators. Piglet generates efficient, type safe, and reentrant LALR(1) parsers at runtime, which saves you from having a pre-compile step to generate your parsing tables. It also includes a lexical scanner generator which can be used independently of the parser generator.

No download No website Updated 25 Mar 2012 XPaF

Screenshot
Pop 20.00
Vit 28.55

XPath-based Parsing Framework (XPaF) is a simple and fast parsing framework which makes it easy to extract relations (subject-predicate-object triples) from HTML and XML documents.

Download Website Updated 29 Nov 2011 Apache OpenNLP

Screenshot
Pop 84.10
Vit 1.49

Apache OpenNLP is a machine learning based toolkit for the processing of natural language text. It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution. These tasks are usually required to build more advanced text processing services.

No download Website Updated 08 Oct 2011 Serd

Screenshot
Pop 16.58
Vit 31.36

Serd is a lightweight C library for RDF syntax which supports reading and writing Turtle and NTriples.

Download No website Updated 28 Sep 2011 NGEN XDCC Packlist Parser

Screenshot
Pop 17.44
Vit 31.54

NGEN XDCC Packlist Parser is an XDCC packlist parser which integrates into Apache through mod_cgi and is easy to set up and configure.

Download No website Updated 15 Sep 2011 Yap4j

Screenshot
Pop 23.07
Vit 31.74

Yap4j is the simplest library for parsing CSV files in Java. It deserializes CSV files into a list of POJOs using a set of Java annotations, while allowing you to specify Object-CSV mappings. It automatically converts to and from a wide range of data types, and includes support for types from popular libraries such as Joda Time, and support for custom record delimiters.

Screenshot

Project Spotlight

aicwl

An Ada library of industrial control widgets.

Screenshot

Project Spotlight

Chicken

A Scheme to C compiler.