GPP is a general-purpose preprocessor with customizable syntax, suitable for a wide range of preprocessing tasks. Its independence from any programming language makes it much more versatile than cpp, while its syntax is lighter and more flexible than that of m4. The syntax is fully customizable, which makes it possible to process text files, HTML, or source code equally efficiently in a variety of languages.
img2pdf is a simple commandline utility to convert a list of PNG, TIFF, or JPEG images to a single PDF document, one image per page. It was designed specifically to convert scanned pages to PDF format, but it will accept images of any aspect ratio and scale them to full page size (which, in general, will not maintain the aspect ratio). The hard work is performed by the Panda PDF library and its dependencies.
OpenXAdES enables people to work with legally-binding digital signatures (primarily giving and verifying them). Legislation often defines a set of requirements that legal digital signature technologies and infrastructures must be compliant with, and OpenXAdES aims at meeting many, if not all, such requirements from different legislations.
Ellogon is a multi-lingual, cross-platform, general-purpose language engineering environment, developed in order to aid both researchers who are doing research in computational linguistics, as well as companies who produce and deliver language engineering systems. As a language engineering platform, it offers an extensive set of facilities, including tools for processing and visualising textual/HTML/XML data and associated linguistic information, support for lexical resources (like creating and embedding lexicons), tools for creating annotated corpora, accessing databases, comparing annotated data, or transforming linguistic information into vectors for use with various machine learning algorithms.
nexB OpenAssets is a tool for inventorying, managing, and monitoring applications, software, hardware, networks, and generally any IT asset. It is designed so that system administrators, IT, and finance can determine what they have, how it is configured, what it is used for, and how much it is being used, so that informed decisions can be made. It complements existing network management software, integrates with a growing number of protocols and tools, and features no-agent discovery and inventory, configuration management including dependencies and correlation, monitoring, and reporting. It makes extensive and innovative use of XML, Xpath, and Xquery.
pHoToMoLo is a toolset for managing JPEG images in JFIF or Exif format. Photomolo generates a static HTML 4.01- and CSS 1-based multi-resolution image gallery for a directory tree containing images. Exifiron losslessly compresses and corrects the orientation of Exif JPEG images. Jpegnail removes or embeds thumbnails. Jpegcom lets you edit embedded image comments using any text editor and image viewer. A shell script frontend for Exifiron is included for copying images from removable storage, such as USB or Compact Flash.
libiptcdata is a C library for manipulating the International Press Telecommunications Council (IPTC) metadata stored within multimedia files such as images. This metadata can include captions and keywords, often used by popular photo management applications. The library provides routines for parsing, viewing, modifying, and saving this metadata. The libiptcdata package also includes Python bindings and a command-line utility, iptc, for viewing and editing IPTC data in JPEG files.
fakedbfs is a suite of tools and a library for cataloguing files. Anything that is a file can be catalogued along with metadata. The metadata is then searchable through various querying facilities. It runs entirely in userland and was inspired by BFS, which doesn't do content indexing, only metadata.
Apophenia is an open statistical library. It provides functions on the same level as those of the typical stats package (such as OLS, probit, or singular value decomposition) but doesn't tie the user to an ad hoc language or environment. The core functions are written in C, but a Python interface is included. It supports SQLite and MySQL databases, exponentially expanding the upper limit to manageable data sets. Since the library is significantly faster than most stats packages, computationally-intensive procedures like maximum likelihood estimation and Monte Carlo routines are easy to implement.
JSON::XS implements JSON for Perl. Unlike other modules, its primary goal is to encode to syntactically correct JSON and flag invalid JSON while decoding. It ensures round-trip integrity of data types while being intuitive to use. Currently being the fastest of the JSON encoders available for Perl, it supports a variety of format options, such as single-line, ASCII-only, or pretty-printed, and can be tuned for speed or memory usage. It comes with a wealth of documentation describing usage and implementation details.