2131 projects tagged "Text Processing"

Download Website Updated 16 Mar 2008 MyHeadlines

Screenshot
Pop 125.86
Vit 6.62

MyHeadlines is module that adds syndicated headline functionality to any PHP and MySQL-based website. Your users may subscribe to multiple RSS feeds from a fully categorized database of over 1,000 sources. It was previously a PHPNuke/PostNuke Addon, but can now be integrated with any Web site.

Download Website Updated 30 Jun 2012 Raptor RDF Syntax Library

Screenshot
Pop 178.49
Vit 17.85

Raptor is a C library providing a set of parsers and serializers for Resource Description Framework (RDF) triples by parsing syntaxes into RDF triples and serializing triples into a syntax. The parsers support RDF/XML, N-Triples, GRDDL, and Turtle, and via RSS tag soup: XML RSS, Atom 0.3, and Atom 1.0. The serializers support RDF/XML (3 flavours), Turtle, DOT, N-Triples, RSS 1.0, and Atom 1.0. Raptor handles RDF/XML as used by RDF applications such as RSS 1.0, FOAF, DOAP, Dublin Core, and OWL. It can use either expat or libxml2 for XML parsing, libcurl when available for URI retrieval, and is portable to many POSIX systems.

Download Website Updated 06 Jun 2001 Vim LaTeX Macros

Screenshot
Pop 24.74
Vit 68.99

Vim LaTeX Macros is a set of macros meant to make writing LaTeX with Vim more convenient.

No download Website Updated 03 Oct 2012 XML parser class

Screenshot
Pop 105.88
Vit 7.31

XML parser class is a PHP class that parses arbitrary XML input and builds an array with the structure of all tag and data elements. Optionally it can keep track of the positions of each element to locate elements that may be contextually in error. Supports a parsed file cache to minimize the overhead of parsing the same file repeatedly. Optimized parsing of simplified XML (SML) formats ignoring the tag attributes.

Download Website Updated 06 Mar 2008 Mguesser

Screenshot
Pop 30.13
Vit 2.49

Mguesser is a tool to guess a text's character set and language. It is a standalone part of the mnoGoSearch engine. More than 100 various character set and language combinations are supported.

Download Website Updated 06 Sep 2001 XML-Lit

Screenshot
Pop 30.98
Vit 1.43

XML-Lit is a simple program to perform very basic literate programming with any XML-based markup language. It uses James Clark's Expat XML parser to weave (convert to a form suitable for processing) and tangle (extract the source code from) your XML documents. It has only been tested with DocBook at the moment, but there is no reason why it should not work with any arbitrary XML markup.

Download Website Updated 05 Mar 2013 XIST

Screenshot
Pop 140.71
Vit 18.52

XIST is an extensible HTML and XML generator. It is also an XML parser with a very simple and Python-esque tree API. Every XML element type corresponds to a Python class, and these Python classes provide a conversion method to transform the XML tree (e.g. into HTML). XIST can be considered 'object-oriented XSLT'. XIST also includes a cross-platform templating language, Oracle utilities, and various other tools.

Download Website Updated 24 Jun 2001 LinkMaster

Screenshot
Pop 32.62
Vit 68.86

LinkMaster is a method of linking data between different applications on Palm devices. There are not many applications that support this method, but the list is growing. Even without special application support, it tracks recently-used programs and bookmarks for quick access.

Download Website Updated 09 Feb 2010 Enca

Screenshot
Pop 307.99
Vit 9.62

Enca detects the encoding of text files, on the basis of knowledge of their language. It can also convert them to other encodings, allowing you to recode files without knowing their current encoding. It supports most of Central and East European languages, and a few Unicode variants, independently on language.

Download Website Updated 30 Jun 2012 GNU Source-highlight

Screenshot
Pop 231.74
Vit 16.46

GNU Source-highlight produces a document with syntax highlighting when given a source file. It handles many languages, e.g., Java, C/C++, Prolog, Perl, PHP3, Python, Flex, HTML, and other formats, e.g., ChangeLog and log files, as source languages and HTML, XHTML, DocBook, ANSI color escapes, LaTeX, and Texinfo as output formats. Input and output formats can be specified with a regular expression-oriented syntax.

Screenshot

Project Spotlight

Cerridwen

Accurate solar system data for everyone.

Screenshot

Project Spotlight

Toxic

A general purpose template engine.