2131 projects tagged "Text Processing"

Download Website Updated 25 Nov 2012 Sagasu

Screenshot
Pop 99.24
Vit 12.82

Sagasu is a GNOME tool to find strings in multiple files. The user specifies the search directory and the set of files to be searched. Double-clicking on a search result launches a user command that can for example load the file in an editor at the appropriate line. The search can optionally ignore CVS directories. Sagasu is a Japanese word that means "to search."

Download No website Updated 23 Nov 2012 GeSHi

Screenshot
Pop 160.28
Vit 18.65

GeSHi is a generic syntax highlighter for PHP that takes any source code and highlights it in XHTML and CSS. It features case-sensitive or insensitive highlighting, auto-caps/non-caps of any keyword, an unlimited scope for styling, the use of CSS in which almost any aspect of the source can be highlighted, the use of CSS classes to massively reduce the amount of output code, function-to-URL capabilities, line numbering, and much more. Over 100 languages are supported, including Java, C, PHP, HTML, CSS, SQL, Pascal, C++, XML, ASP, and ASM.

Download Website Updated 23 Nov 2012 boxes

Screenshot
Pop 126.85
Vit 6.87

Boxes is a text filter that can draw any kind of box around its input text. Box design choices range from simple boxes to complex ASCII art. A box can also be removed and repaired, even if it has been badly damaged by editing of the text inside. Since the generated boxes may be open on any side, the program can also be used to create regional comments in any programming language. New box designs of all sorts can easily be added and shared by appending to a free format configuration file. In addition to being a command line tool, Boxes integrates well with any text editor that supports filters.

Download Website Updated 11 Nov 2012 AsmXml

Screenshot
Pop 72.91
Vit 7.45

AsmXml is a very fast XML parser and decoder for x86 platforms. It is written in pure assembler and supports only a subset of the XML 1.0 specification.

Download Website Updated 05 Nov 2012 cb2Bib

Screenshot
Pop 145.35
Vit 21.05

The cb2Bib is a tool for rapidly extracting bibliographic references from email alerts, journal Web pages, and PDF files. It facilitates the capture of single references from unformatted and non standard sources. Output references are written in BibTeX. Article files can be easily linked and renamed by dragging them onto the cb2Bib window. Additionally, it permits editing and browsing BibTeX files, citing references, searching references and the full contents of the referenced documents, inserting bibliographic metadata to documents, and writing short notes that interrelate several references.

Download Website Updated 31 Oct 2012 Jericho HTML Parser

Screenshot
Pop 118.49
Vit 10.09

Jericho HTML Parser is a Java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognized or invalid HTML. It also provides high-level HTML form manipulation functions.

Download Website Updated 26 Oct 2012 Daniels Colorize.pl

Screenshot
Pop 33.84
Vit 2.52

Colorize.pl is a short script that reads from stdin and writes to stdout. Rows that match a user's search strings will be colorized with user-defined colors. Command line options are available. Colorization is done via ANSI escape codes.

Download Website Updated 25 Oct 2012 Mono Project

Screenshot
Pop 326.16
Vit 16.52

Mono Project is an Open Source implementation of the various ECMA and .NET framework technologies for Unix, Mac OS X, and Windows. The project includes a compiler, a class library, and a CLI runtime engine.

Download Website Updated 22 Oct 2012 Invenio

Screenshot
Pop 79.40
Vit 7.48

Invenio (formerly CDSware) is a suite of applications that provides the framework and tools for building and managing an autonomous digital library server. It complies with the Open Archives Initiative metadata harvesting protocol (OAI-PMH) and uses MARC 21 as its underlying bibliographic standard. Its flexibility and performance make it a comprehensive solution for the management of document repositories of moderate to large size.

Download Website Updated 11 Oct 2012 Grutatxt

Screenshot
Pop 108.39
Vit 8.97

Grutatxt is a plain text to HTML (and other formats) converter. It successfully converts subtle text markup to lists, bold, italics, tables, and headings to their corresponding HTML, troff, man page, or LaTeX markup without having to write unreadable source text files.

Screenshot

Project Spotlight

phpMyAdmin

A tool that handles the basic administration of MySQL over the Web.

Screenshot

Project Spotlight

Collax V-Cube+

Virtualization and HA Management of virtual machines and embedded HA Storage.