GRASS (the Geographic Resources Analysis Support System) is a software raster- and vector-based GIS (Geographic Information System), image processing system, graphics production system, and spatial modeling system. It contains many modules for raster data manipulation, vector data manipulation, rendering images on the monitor or paper, multispectral image geocoding and processing, point data management and general data management. It also has tools for interfacing with digitizers, scanners, and the PostgreSQL, DBF, and ODBC connected databases. GRASS operates on all common operating systems.
The libmba package is a collection of mostly independent C modules potentially useful to any project. There are the usual ADTs including a linkedlist, hashmap, pool, stack, and varray, a flexible memory allocator, CSV parser, path canonicalization routine, I18N text abstraction, configuration file module, portable semaphores, condition variables, and more. The code is designed so that individual modules can be integrated into existing codebases rather than requiring the user to commit to the entire library. The code has no typedefs, few comments, and extensive man pages and HTML documentation.
RapidMiner (formerly YALE) is a flexible Java environment for knowledge discovery in databases, machine learning, and data mining. Many nestable learning and preprocessing operators (including Weka) are provided. It features an XML-based graphical user interface, a plugin mechanism, and high-dimensional plotting, and provides an easy-to-use extension mechanism that makes it possible to integrate new operators and adapt the system to your personal requirements. A command line version is also included.
Berkeley DB XML is a native XML database engine for use within your product. Made available as a C++ library with language bindings for Java, Perl, Python, PHP, and Tcl, it integrates directly into your application (it is not a standalone database server). It provides XQuery access into a database of document containers. XML documents are stored and indexed in their native format using Berkeley DB as the transactional database engine.
The Chemistry Development Kit (CDK) is a library of Java classes for chemo-, bioinformatics, computational chemistry, and chemometrics. It provides important algorithms like substructure search, SMILES, Gasteiger charges, QSAR descriptor calculation, 3D structure generation, 2D layout and rendering, many IO formats, atom typing, and more.
Ciao is a complete Prolog system subsuming ISO-Prolog with a novel modular design which allows both restricting and extending the language. Ciao extensions currently include feature terms (records), higher-order, functions, constraints, objects, persistent predicates, a good base for distributed execution (agents), and concurrency. Libraries also support WWW programming, sockets, and external interfaces (C, Java, TCL/Tk, relational databases, etc.). An Emacs-based environment, a stand-alone compiler, and a toplevel shell are also provided.
ROOT is an OO framework for large-scale scientific data analysis and data mining. It contains an efficient hierarchical OO database, a C++ interpreter, advanced statistical analysis, visualization, introspection, documentation, networking, and GUI classes. The command/scripting language is C++, and large scripts can be compiled and dynamically linked in. Using the PROOF (Parallel ROOT Facility) extension, large databases can be analyzed in parallel. The system runs on all known POSIX platforms, Windows, and MacOS X.