169 projects tagged "Text Processing"
Recoll is a personal full text desktop search tool based on Xapian. It provides an easy to use, feature-rich, easy administration interface with a Qt-based GUI. Text, HTML, PDF, PostScript, MS Word, OpenOffice, Wordperfect, KWord, Abiword, maildir, and mailbox mail folder formats are supported, along with their compressed versions and quite a few others. Powerful query facilities are provided. Multiple character sets are supported, and internal processing and storage uses Unicode UTF-8. Stemming is performed at query time and the stemming language can be switched after indexing.
Lover is a network communication framework that uses its own network message protocol, called PASP (Pure And Simple Protocol). The Java source code includes an example of how to pass data over the network through a testing port. The syntax of the message format is based on key-value pairs.
EZ Reusable Objects (EZRO) is a Web application that can be used by non-technical staff to manage content as "objects." Content objects containing text, video, and audio can be shared, modified, and re-styled to appear via a traditional Web site, an on-line course, an innovative "Coach," or as a community of interest site. It is highly scalable and can be used for public Web sites, secure environments, and private intra/extranets.
MyAtom Writer is a graphical tool for creating standard Atom 1.0 documents that embed links to FOAF and OTER documents. A Description component is used to specify descriptions of documents, information regarding the author, and content types. The content type is used to specify whether the Atom document contains links to FOAF and OTER documents. The Entry component is used to enter documents in XML, XHTML, HTML, or text format.
WAscii is a Web frontend intended to display an AsciiDoc documentation repository. It allows you to search and browse your documentation files and automatically converts AsciiDoc to HTML, PDF, and ODF documents. It is intended to work directly from a subversion repository containing your AsciiDoc files.
Barcode Writer in Pure Postscript implements the printing of many barcode formats entirely within PostScript, so that the process of converting the input string into the printed output is performed by the printer or print system. The project supports all major barcode formats including: EAN-13 (JAN-13), EAN-8 (JAN-8), UPC-A, UPC-E, EAN-5 & EAN-2 (EAN/UPC add-ons), ISBN (including legacy ISBN), ISMN (including legacy ISMN), ISSN, Code 128 (A, B & C), GS1-128, SSCC-18 (EAN-18, NVE), EAN-14, Code 39, Code 39 Extended, Code 93, Code 93 Extended, Code 32 (Italian Pharmacode), Pharmazentralnummer (PZN), Interleaved 2 of 5, ITF-14 (UPC SCS), GS1 DataBar (Omnidirectional, Stacked, Stacked Omnidirectional, Limited, Expanded, Expanded Stacked), Code 2 of 5 (Industrial, IATA, Matrix, Datalogic & COOP), Code 11 (USD-8), BC412, Codabar (NW-7), Pharmacode (including two-track), MSI, Plessey, Telepen, Channel Code, PosiCode, PDF417, Data Matrix (ECC200), QR Code (including Micro QR Code), and more.
Project35 is an application suite that allows users to generate data entry forms from XML schema. Application designers use a Configuration Tool to associate records and record fields defined in the schema with application properties that include features such as: validation services, controlled vocabulary services, general plugins, and various aspects of look-and-feel.
TWiki is a flexible, powerful, and simple Web based collaboration platform. It is suitable for dynamic intranets and knowledge bases, and for sharing and managing documents and collaborative projects. It resembles a normal Web site, but every page can be changed from a browser. It features automatic link generation, full text search, group authorization, Web forms, reporting, change notification, file attachments, revision control of pages and attachments, a modular templating system with skins, hierarchical navigation based on the topic parenting feature, and more. Plugins can be used to enhance the program and build groupware applications.
Silva is a CMS for organizations that manage multiple or complex Web sites. Content is stored in clean XML, independent of layout and presentation. Features include versioning, a workflow system, an integral visual editor, content reuse, sophisticated access control, multi-site management, extensive import/export facilities, fine-grained templating, and hi-res image storage and manipulation. Silva is built on top of the Zope Web application platform.