RSS 62 projects tagged "Text Processing"

Download Website Updated 18 Sep 2013 pyratemp

Screenshot
Pop 71.87
Vit 8.55

pyratemp is probably (one of) the smallest complete template-engines for Python (with about 500 LOC). It has a very small set of special syntax in the templates. This reduces complexity and the probability of bugs and lead to an easy-to-use and intuitive user-interface. It uses embedded Python-expressions (in a "sandbox"), is well documented, has full Unicode-support, and produces very good error-messages, which is very useful when creating new templates.

Download Website Updated 28 Jun 2012 Xapian and Omega

Screenshot
Pop 404.90
Vit 16.30

Xapian is a search engine library, scalable to collections containing hundreds of millions of documents. It's written in C++ with bindings for Perl, Python, PHP, Java, Tcl, C#, Ruby, and Lua. It is a highly adaptable toolkit that allows developers to easily add advanced indexing and search facilities to their own applications. It supports the Probabilistic Information Retrieval model and also a rich set of boolean query operators. Omega is a Web search application built upon the Xapian library. It can index a Web server's document tree (including HTML, PDF, OpenOffice, MS Word/Excel/Powerpoint/Works, WordPerfect, RTF, PS, etc.), or data exported from arbitrary sources (e.g. SQL databases).

Download Website Updated 19 Jun 2012 python-djvulibre

Screenshot
Pop 57.76
Vit 7.19

python-djvulibre is a set of Python bindings for the DjVuLibre library, an open source implementation of DjVu.

No download Website Updated 12 Jun 2012 python-libcommon

Screenshot
Pop 57.58
Vit 6.65

python-libcommon contains Map, Set, and Graph data types for Python. It provides data types (like Graph) that can handle directed or undirected labeled graphs. XML output is handled to describe graph structure.

No download Website Updated 09 Jun 2012 PyBison

Screenshot
Pop 42.00
Vit 1.00

PyBison is a sophisticated yet easy-to-use parser creation toolkit for Python that interfaces directly to Bison (yacc)-based parsers. It provides full LALR(1) grammar support, allowing for simple parsing tasks through to writing compilers for high-level languages. Parser code is automatically generated from rules within user-created Parser classes (written in Python), and then, compiled, yacc'ed and linked into a shared library, which is loaded into the running process. All this happens automatically. When the parser runs, it connects directly with the yyparse() routine, and takes event callbacks upon parse targets being reached.

Download Website Updated 09 Jun 2012 The XSLT C library for GNOME

Screenshot
Pop 365.54
Vit 18.78

Libxslt is a C library for GNOME which allows developers to work with XSLT. It is based on libxml for XML parsing, tree manipulation, and XPath support. Also included is 'xsltproc', a command line XSLT processor. The library is written in plain C, making as few assumptions as possible, and sticking closely to ANSI C/POSIX for easy embedding. It should work on Linux, Unix, and Windows. Though not designed primarily with performances in mind, libxslt seems to be a relatively fast processor. It also include full support for the EXSLT set of extension functions as well as some common extensions present in other XSLT engines.

Download Website Updated 30 Oct 2011 PyLucene

Screenshot
Pop 115.15
Vit 9.76

PyLucene is a Python extension for accessing Java Lucene from Python. Its goal is to allow use of Lucene's text indexing and searching capabilities from Python. It is designed to be API compatible with the latest version of Java Lucene.

Download Website Updated 21 Mar 2011 LEPL

Screenshot
Pop 121.75
Vit 7.36

LEPL is a recursive descent parser library written in Python. It is based on parser combinator libraries popular in functional programming, but also exploits Python language features. Operators provide a friendly syntax, and the consistent use of generators supports full backtracking and resource management. Backtracking implies that a wide variety of grammars are supported; appropriate memoisation ensures that even left-recursive grammars terminate.

No download Website Updated 29 Jun 2010 RXPY

Screenshot
Pop 29.80
Vit 1.00

RXPY is a pure-Python regular expression library, compatible with the re package. RXPY can be used as a test-bed for new matching algorithms, since the parser and compatibility components are reusable.

Download Website Updated 30 May 2010 Karaoke con SMIL

Screenshot
Pop 15.75
Vit 1.00

Karaoke con SMIL is a project to synchronize audio, text, and images and export the final result to SMIL, which can be played in multimedia players such as Real Player, Helix Player, Ambulant Player, etc. on several devices (computers, mobile phones, PDAs, etc.). In short, this application synchronizes the multimedia elements in a way similar to karaoke.

Screenshot

Project Spotlight

C-ICAP Classify

An AI content filter.

Screenshot

Project Spotlight

Vim Outliner

A full-featured outliner for Vim.