AutoLaTeX is a tool for managing small to large LaTeX documents. It detects which files which are used to build the document (included TeX files, BibTeX, figures, etc.), and launches the various different tools (latex, bibtex, makeindex) when the sources files have been changed. It provides translation rules which automatically generate figures in EPS, PNG, or PDF formats from different types of sources (dia, xfig, svg, astah, source code, etc.) AutoLaTeX also provides graphical user interfaces, a plugin for the editors Gedit and Sublime Text, and a standalone Gtk application.
docx2txt is a tool that attempts to generate equivalent text files from Microsoft .docx documents, preserving some formatting and document information (which MS text conversion drops) along with appropriate character conversions for a good (ASCII) text experience. It is a platform independent solution consisting of (core) Perl and (wrapper) Unix/Windows shell scripts and a configuration file to control the output text appearance to fair extent. It can very conveniently be used to build a Web based docx document conversion service. Some Makefiles and Windows batch files are provided for easy installation of the scripts. With unzippers like CakeCmd that can deal with corrupt Zip archives, this tool can extract text from corrupt docx documents in many cases, where MS word processor fails to even open them.
Barcode Writer in Pure Postscript implements the printing of many barcode formats entirely within PostScript, so that the process of converting the input string into the printed output is performed by the printer or print system. The project supports all major barcode formats including: EAN-13 (JAN-13), EAN-8 (JAN-8), UPC-A, UPC-E, EAN-5 & EAN-2 (EAN/UPC add-ons), ISBN (including legacy ISBN), ISMN (including legacy ISMN), ISSN, Code 128 (A, B & C), GS1-128, SSCC-18 (EAN-18, NVE), EAN-14, Code 39, Code 39 Extended, Code 93, Code 93 Extended, Code 32 (Italian Pharmacode), Pharmazentralnummer (PZN), Interleaved 2 of 5, ITF-14 (UPC SCS), GS1 DataBar (Omnidirectional, Stacked, Stacked Omnidirectional, Limited, Expanded, Expanded Stacked), Code 2 of 5 (Industrial, IATA, Matrix, Datalogic & COOP), Code 11 (USD-8), BC412, Codabar (NW-7), Pharmacode (including two-track), MSI, Plessey, Telepen, Channel Code, PosiCode, PDF417, Data Matrix (ECC200), QR Code (including Micro QR Code), and more.
doclifter helps with lifting documents with nroff markup to XML-DocBook. Lifting documents from presentation level to semantic level is hard, and a really good job requires human polishing. This tool aims to do everything that can be mechanized, and to preserve any troff-level information that might have structural implications in XML comments. TBL tables are translated into DocBook table markup, PIC into SVG, and EQN into MathML (relying on pic2svg and GNU eqn for the last two).
PSX is a PHP framework for creating RESTful APIs. It helps you to build clean URLs serving Web standard formats like JSON, XML, Atom, and RSS. It includes a handler system that abstracts away SQL queries from domain logic, a routing system that executes correct controller method for the location of the controller and the method annotation, and a flexible data system that converts database records into formats like JSON, XML, Atom, and RSS. A lightweight DI container handles dependencies. The controller supports request and response filters that can modify the HTTP request or response, and filters are provided for Basic and Oauth authentication.
htmLawed is a PHP script that makes input text more secure, HTML standards-compliant, and suitable in general from the viewpoint of a Web-page administrator, for use in the body of HTML 4 or XHTML 1 or 1.1 documents. It is a customizable HTML/XHTML filter, processor, purifier, and sanitizer. It can ensure that HTML tags are balanced and properly nested tags, neutralize code that may be used for cross-site scripting (XSS) attacks, and limit the allowed HTML elements, tags, attributes, or URL protocols.
Mini-XML is a small XML parsing library that you can use to read XML and XML-like data files in your application without requiring large non-standard libraries. It only requires an ANSI C compatible compiler (GCC works, as do most vendors' ANSI C compilers) and a "make" program. It supports reading of UTF-8 and UTF-16 and writing of UTF-8 encoded XML strings and files, and provides a hierarchical view of the file via a linked-list tree structure of typed nodes and functions for managing, traversing, indexing, and searching the tree.
queXC is a Web-based data cleaning and coding/classification system that takes a data file (such as data collected from a questionnaire) and cleans the text input fields by spacing them and spell checking them. It allows operators to code text fields to existing coding schemes, or to create a coding scheme on the fly. Multiple operators can code and clean simultaneously, with the ability to assign operators to do particular codes. The queXC system includes some coding schemes created from ABS (Australian Bureau of Statistics) data. It can be used as an open source replacement for Nvivo in some situations.