2138 projects tagged "Text Processing"
Libxslt is a C library for GNOME which allows developers to work with XSLT. It is based on libxml for XML parsing, tree manipulation, and XPath support. Also included is 'xsltproc', a command line XSLT processor. The library is written in plain C, making as few assumptions as possible, and sticking closely to ANSI C/POSIX for easy embedding. It should work on Linux, Unix, and Windows. Though not designed primarily with performances in mind, libxslt seems to be a relatively fast processor. It also include full support for the EXSLT set of extension functions as well as some common extensions present in other XSLT engines.
FIGlet is a program for making large letters out of ordinary text. It prints its input using large characters made up of ordinary screen characters. FIGlet output is generally reminiscent of the sort of "signatures" many people like to put at the end of email and UseNet messages. It is also reminiscent of the output of some banner programs, although it is oriented normally, not sideways.
xsltester is an applet for testing XSL files. It can execute and judge test routines written in XSL templates without another applied XML file. Therefore developers have to create only one testing XSL file for a test. It contains another applet, xslviewer, which can displays an API document for an XSL file. It displays lists of templates (elements), named templates (functions and subroutines), and parameters, and optionally displays test results for each.
libSieve is an implementation of the Sieve mail sorting language originally developed for the Cyrus mail system. libSieve helps to bring mail sorting functionality into your mail server application without the need to reinvent the wheel. As a library, it is not intended for end users expecting a ready-to-run program.
QOF (Query Object Framework), provides a set of C Language utilities for performing generic structured complex queries on a set of data held by a set of C/C++ objects. This framework is unique in that it does not require SQL or any database at all to perform the query. Thus, it allows programmers to add query support to their applications without having to hook into an SQL database.
xslbook is an XSLT applet to display a book-style document in a Web browser that supports XSLT. xslbook transforms XML files created by users into HTML documents with an XSLT engine built in Web browsers. It can display HTML documents in Web browsers directly without other transformation tools. It has an automatic numbering function for chapters or block elements, an automatic generation function for table of contents and navigation links, a data operating function for other XML files, keyword search, and so on. It is a sub-project of xslet.
GNU Libidn is an implementation of the Stringprep, Punycode, and IDNA specifications defined by the IETF Internationalized Domain Names (IDN) working group. It is used to prepare internationalized strings (such as domain name labels, usernames, and passwords) in order to increase the likelihood that string input and string comparison work in ways that make sense for typical users throughout the world. The library contains a generic Stringprep implementation that does Unicode 3.2 NFKC normalization, mapping and prohibition of characters, and bidirectional character handling. Profiles for iSCSI, Kerberos 5, Nameprep, SASL, and XMPP are included. Punycode and ASCII Compatible Encoding (ACE) via IDNA are supported.
SDoP (Simple DocBook Processor) reads a DocBook XML file, processes it into typeset pages, and outputs the result as PostScript (which can easily be converted to a PDF). It is "simple" because it supports only a subset of DocBook, and also because it does not make use of a DTD or stylesheets or any other heavyweight apparatus. It is a single program. SDoP is used to format the Exim reference manual.
Search::Xapian is a Perl XS frontend to the Xapian C++ search library. It is a fairly complete wrapper: most features of the Xapian library are made available for use from Perl. Xapian is a highly adaptable toolkit that allows developers to easily add advanced indexing and search facilities to their own applications. It supports the Probabilistic Information Retrieval model as well as a rich set of boolean query operators. It's fast and scalable to hundreds of millions of documents.