RSS 48 projects tagged "Parser"

No download No website Updated 15 Aug 2012 JWPL

Screenshot
Pop 61.14
Vit 1.98

JWPL is a language independent, database-driven, high performance Wikipedia API that provides structured access to information nuggets like redirects, categories, articles, and link structure. It contains a Mediawiki Markup parser that can be used to further analyze the contents of a Wikipedia page or standalone with other text, TimeMachine, which reconstructs a snapshot of Wikipedia from a specific date, or multiple snapshots from a time span, and RevisionMachine, which offers efficient access to the history of articles using a dedicated storage format which decreases storage space by 98%. This enables random access to the whole revision history without requiring several terabytes of storage for a single Wikipedia dump.

No download No website Updated 25 Mar 2012 XPaF

Screenshot
Pop 21.49
Vit 27.50

XPath-based Parsing Framework (XPaF) is a simple and fast parsing framework which makes it easy to extract relations (subject-predicate-object triples) from HTML and XML documents.

No download No website Updated 17 May 2012 Piglet

Screenshot
Pop 20.32
Vit 26.51

Piglet is a tool for parsing and lexing text for the .NET framework. The purpose of Piglet is to provide an easy-to-use tool for parsing text which can be easily included in any .NET project as a single assembly. In contrast to most parser generators, Piglet provides a fluent interface which enables you to express your grammar in a syntax which is accessible for users with no prior experience of parser generators. Piglet generates efficient, type safe, and reentrant LALR(1) parsers at runtime, which saves you from having a pre-compile step to generate your parsing tables. It also includes a lexical scanner generator which can be used independently of the parser generator.

No download Website Updated 05 Jul 2012 UniCC

Screenshot
Pop 23.15
Vit 25.59

UniCC, (Universal Compiler-Compiler) is a powerful LALR(1) parser generator and language development system for computer professionals. It serves as an all-round design and build tool assisting compiler writers in any parsing-related task, including production quality compiler construction and the implementation of domain specific languages. It unifies an integrated generator for lexical analyzers and a powerful LALR(1) parser generator into one software solution. The programming interface is a rich, extendable, and innovative BNF-based grammar definition language for expressing context-free grammars.

Download Website Updated 10 Jul 2012 csvgrep

Screenshot
Pop 15.84
Vit 25.49

csvgrep is a commandline program which enables users to execute searches on text-delimited files using a rudimentary query language. Its query language is bound to simplicity and expressivity, to be easily comprehensible. It aims at replacing both grep and awk when you are challenged to retrieve information from a text-delimited file based on the content of a specific field (or column). You can get what you want using the semantic already in the file’s underlying structure.

Download No website Updated 31 Oct 2012 MightyString

Screenshot
Pop 16.67
Vit 25.36

MightyString adds array functionality and other tools for Ruby strings, including matching, indexing, substitution, and deletion. MightyString::HTML.strip_html provides more ideal HTML-to-ASCII formatting output. This is an advanced block "filtering" module. It works very well, with extremely rare cases which fall through its fingers.

No download No website Updated 04 Sep 2012 EXIP

Screenshot
Pop 32.31
Vit 1.00

EXIP provides a C library for the parsing and serialization of Efficient XML Interchange (EXI) format streams. The focus is portability and efficiency for embedded systems development. The project was started at the EISLAB research group in the Department of Computer Science, Electrical and Space Engineering, Luleå University of Technology, and is part of research efforts to bring resource-constrained embedded devices, such as wireless sensor nodes, closer to the enterprise business processes taking place in processing, manufacturing, and communication industries.

Download No website Updated 21 Feb 2014 any-dl

Screenshot
Pop 253.54
Vit 6.22

any-dl is a generic video downloader tool that uses a domain specific language to describe how to download videos from each video site.

Download Website Updated 12 Feb 2013 PHP Markdown and BBCode Parser

Screenshot
Pop 16.70
Vit 20.80

PHP Markdown and BBCode Parser is a package that converts Markdown and BBCode to HTML. A base class is also capable of sanitizing the text by removing JavaScript tags.

Download Website Updated 12 Feb 2013 HtmlCleaner

Screenshot
Pop 16.97
Vit 20.77

HtmlCleaner is an HTML parser. HTML found on the Web is usually dirty, ill-formed, and unsuitable for further processing. For any serious consumption of such documents, it is necessary to first clean up the mess and bring order to the tags, attributes, and ordinary text. For a given HTML document, HtmlCleaner reorders individual elements and produces well-formed XML. By default, it follows rules similar to those which most Web browsers use to create a Document Object Model. However, the user may provide custom tag and rule sets for tag filtering and balancing.

Screenshot

Project Spotlight

Catharsis.NET.Web.Widgets

An ASP.NET MVC tag library with social media widgets.

Screenshot

Project Spotlight

coreBOS

A business empowering tool and adaptable software program.