Xapian is a search engine library, scalable to collections containing hundreds of millions of documents. It's written in C++ with bindings for Perl, Python, PHP, Java, Tcl, C#, Ruby, and Lua. It is a highly adaptable toolkit that allows developers to easily add advanced indexing and search facilities to their own applications. It supports the Probabilistic Information Retrieval model and also a rich set of boolean query operators. Omega is a Web search application built upon the Xapian library. It can index a Web server's document tree (including HTML, PDF, OpenOffice, MS Word/Excel/Powerpoint/Works, WordPerfect, RTF, PS, etc.), or data exported from arbitrary sources (e.g. SQL databases).
Berkeley DB XML is a native XML database engine for use within your product. Made available as a C++ library with language bindings for Java, Perl, Python, PHP, and Tcl, it integrates directly into your application (it is not a standalone database server). It provides XQuery access into a database of document containers. XML documents are stored and indexed in their native format using Berkeley DB as the transactional database engine.
Astro::Aladin is a an object-oriented interface into the resources provided by the CDS Aladin project. The Aladin Java application is driven in script mode using an anonymous pipe, while high level access to resources is provided by the module which disguises the mechanics of the data access.
ACDK is a development framework with a similar target of Microsoft's .NET or Sun's ONE platform, but it uses C++ as a core implementation language. It implements the standard library packages, including acdk::lang, acdk::lang::reflect, acdk::util, acdk::io, acdk::text (including regexpr), acdk::net, acdk::sql, acdk::xml, and more. Flexible allocator/garbage collection, threading, and Unicode are implemented in the core of ACDK. Extensions make C++ objects available for reflection, serialization, aspect-oriented class attributes, and [D]ynamic [M] ethod [I]nvocation. This DMI acts as an universal object oriented call interface to connect C++ with scripting languages (Java, Perl, Tcl, Python, Lisp, Visual Basic, and VBScript) and standard component technologies (CORBA and COM).
OpenIsis provides a library and tools to access CDS/ISIS databases, which are mostly used for bibliographic data (ISO2709, Z39.50) but also well suited for other catalogues due to the very flexible indexing mechanism. Based on a C library, there are also bindings for Perl and Java (Tcl to come).
A Life Science Identifier (LSID) Resolver is a software system that implements an agreed LSID resolution protocol to allow higher-level software to be able to locate and access the data uniquely named by any LSID URN. This project is the first implementation of a Web service-based LSID URN resolution protocol, and includes Java J2EE, Perl, and Win32 COM/C++ client & server stacks, example programs, and tutorials on how to establish a LSID resolver for any organization.