SHOGUN is a machine learning toolbox whose focus is on large scale kernel methods and especially on Support Vector Machines (SVM). It provides a generic SVM object interfacing to several different SVM implementations, all making use of the same underlying, efficient kernel implementations. Apart from SVMs and regression, SHOGUN also features a number of linear methods like Linear Discriminant Analysis (LDA), Linear Programming Machine (LPM), (Kernel) Perceptrons, and algorithms to train hidden Markov models. SHOGUN can be used from within C++, Matlab, R, Octave, and Python.
pmcyg is a tool for creating customized collections of Cygwin packages together with the Cygwin installer. This enables self-contained distributions to be created without having to mirror the entire set of Cygwin packages. For example, it can create a single CD-ROM or DVD from which a personalized version of Cygwin can be installed on machines not connected to the Web. A basic installer can be created in less than 25MB; a full set of Cygwin packages might fill a DVD-ROM; or a selection of packages can be constructed to suit your chosen installation medium. pmcyg will then download these and all their dependencies automatically. pmcyg can be used either as a command-line script or via a simple graphical interface.
Freesteam is a library that allows you to calculate water and steam properties using the industry-standard IAPWS-IF97 steam tables. It is a library of C++ classes which can be integrated into your code. Plug-ins to allow freesteam to be integrated with spreadsheets and other applications are under development. The included unit tests assure compliance with published property values. IAPWS-95 (scientific use) correlations are also included and used as part of this validation.
Ecasound is a software package designed for multitrack audio processing. It can be used for simple tasks like audio playback, recording, and format conversions, as well as for multitrack effect processing, mixing, recording and signal recycling. It supports a wide range of audio inputs, outputs, and effect algorithms. Effects and audio objects can be combined in various ways, and their parameters can be controlled by operator objects like oscillators and MIDI-CCs. A versatile console-mode user interface is included in the package.
GetData is a library that provides an API to interface with dirfile databases. The dirfile database format is designed to provide a fast, scalable format for storing and reading binary, synchronously-sampled, time-ordered data. GetData was originally written for the BOOMERanG and BLAST experiments as a data format suitable for use for both quick-look and data reduction. It is now used by many other cosmological and astrophysical experiments including ACT, Planck, Spider, Keck, as well as other projects.
Urwid is a Python library for making text console applications. It has many features including fluid interface resizing, support for UTF-8 and CJK encodings, standard and custom text layout modes, simple markup for setting text attributes, and a powerful, dynamic list box that handles a mix of widget types. It is flexible, modular, and leaves the developer in control.
Backshift is a deduplicating (variable-sized, content-based blocks), compressing (xz or bz2) backup program. Full saves and incrementals are pretty indistinct other than the amount of data transmitted, somewhat like with "rsync --link-dest" but without the huge number of hardlinks. It also de-duplicates large file content at a granularity of about 2 megabytes on average; there tends to be a unique copy of each file with size less than around 2 megabytes on average.
GRASS (the Geographic Resources Analysis Support System) is a software raster- and vector-based GIS (Geographic Information System), image processing system, graphics production system, and spatial modeling system. It contains many modules for raster data manipulation, vector data manipulation, rendering images on the monitor or paper, multispectral image geocoding and processing, point data management and general data management. It also has tools for interfacing with digitizers, scanners, and the PostgreSQL, DBF, and ODBC connected databases. GRASS operates on all common operating systems.