2131 projects tagged "Text Processing"

Download Website Updated 27 Mar 2014 GroupServer

Screenshot
Pop 273.04
Vit 31.71

GroupServer is a Web-based mailing list manager designed for large sites. It provides email interaction like a traditional mailing list manager but also supports reading, searching, and posting of messages and files via the Web. Users have forum-style profiles, and can manage their email addresses and other settings using the same Web interface. It has supports features such as Atom feeds, a basic CMS, statistics, multiple verified addresses per user, and bounce detection, and is able to be heavily customized.

Download Website Updated 26 Mar 2014 Flat File Extractor

Screenshot
Pop 368.33
Vit 30.09

ffe is a flat file extractor. It can be used for reading different flat file structures and displaying them in different formats. ffe can read fixed length and separated text files and fixed length binary files. It is a command line tool developed under GNU/Linux. The main areas of use are extracting particular fields or records from a flat file, converting data from one format to an other, e.g. from CSV to fixed length, verifying a flat file structure, as a testing tool for flat file development, and displaying flat file content in human readable form.

Download Website Updated 11 Mar 2014 htmLawed

Screenshot
Pop 195.08
Vit 25.03

htmLawed is a PHP script that makes input text more secure, HTML standards-compliant, and suitable in general from the viewpoint of a Web-page administrator, for use in the body of HTML 4 or XHTML 1 or 1.1 documents. It is a customizable HTML/XHTML filter, processor, purifier, and sanitizer. It can ensure that HTML tags are balanced and properly nested tags, neutralize code that may be used for cross-site scripting (XSS) attacks, and limit the allowed HTML elements, tags, attributes, or URL protocols.

Download Website Updated 04 Mar 2014 Sanzang

Screenshot
Pop 126.58
Vit 7.39

Sanzang is a compact and simple cross-platform machine translation system. It is especially useful for translating from the CJK languages (Chinese, Japanese, and Korean), and it is very suitable for working with ancient and otherwise difficult texts. Unlike most other machine translation systems, Sanzang is small and approachable. Any user can develop his or her own translation rules, and these rules are simply stored in a text file and applied at runtime.

Download Website Updated 03 Mar 2014 tex-upmethodology

Screenshot
Pop 146.55
Vit 27.20

tex-upmethodology provides a complete set of LaTeX styles that permit you to write documents according to a UP-based methodology. Its major features are document history, task management, design and specification documentation, and helping tools. tex-upmethodology is officially supported by CTAN.

Download Website Updated 28 Feb 2014 DocBook Doclet

Screenshot
Pop 358.23
Vit 68.39

DocBook Doclet is a javadoc doclet that creates DocBook XML and UML class diagrams from Javadoc.

Download Website Updated 23 Feb 2014 loook

Screenshot
Pop 100.32
Vit 13.31

Loook searches for text strings in OpenOffice.org (and StarOffice 6.0 or later) files. AND, OR, and phrase searches are supported. It doesn't create an index, but searching should be fast enough, unless you have very many files.

No download Website Updated 21 Feb 2014 ExactScan

Screenshot
Pop 135.83
Vit 26.19

ExactScan is a versatile document capture application for home offices and workgroups. It is designed from the ground up for high-speed document scanners and can easily handle hundreds of images per minute, including duplex scans. Included functionality reaches from managing, sorting, and editing singles pages to writing multi- as well as single-page PDF files including JPEG compression and TIFF, JPEG, JPEG2000, and PNG bitmap files. ExactScan allows performing state of the art image processing including automatic cropping, deskewing, dynamic thresholding for perfect black and white documents, and descreening print rasters.

No download Website Updated 17 Feb 2014 iText

Screenshot
Pop 535.72
Vit 50.31

iText is a library that contains classes to generate and manipulate documents in the Portable Document Format (PDF). Document manipulation includes splitting, merging, and filling out forms (AcroForms, static and dynamic XFA forms).

No download Website Updated 27 Jan 2014 Apache MetaModel (incubation)

Screenshot
Pop 134.99
Vit 21.19

With MetaModel, you use a type-safe SQL-like API for querying any datastore. It is a data access framework providing a common interface for exploration and querying of different types of datastores. It isn't a data mapping framework. Instead, it emphasizes abstraction of metadata and the ability to add data sources at runtime, making MetaModel great for generic data processing applications, but less so for applications modeled around a particular domain.

Screenshot

Project Spotlight

phpMyAdmin

A tool that handles the basic administration of MySQL over the Web.

Screenshot

Project Spotlight

Collax V-Cube+

Virtualization and HA Management of virtual machines and embedded HA Storage.