Berkeley DB XML is a native XML database engine for use within your product. Made available as a C++ library with language bindings for Java, Perl, Python, PHP, and Tcl, it integrates directly into your application (it is not a standalone database server). It provides XQuery access into a database of document containers. XML documents are stored and indexed in their native format using Berkeley DB as the transactional database engine.
The IberAgents platform is a framework for component development in Java. It has facilities for Web service communication (SOAP), persistence to database or files, and interaction via the Web; thus providing an end-to-end solution for data management. It uses its own SOAP library, so it does not depend on Axis or Apache Soap.
JavaGit is a Java API that provides access to git repositories. The goal is to bring the power of git to the Java world as an API that is intuitive for developers new to git and developers who are veteran git users. It is engineered to provide the developer with access to the raw git commands through a command API as well as an object API designed to represent the .git repository, the working tree, and other familiar git concepts. JavaGit uses the git binaries installed on the host machine to provide git functionality, and has been designed to easily accommodate additional methods of access to git repositories.
K.E.T.T.L.E (Kettle ETTL Environment) is a meta- data driven ETL (Extraction, Transformation, Transportation, and Loading) tool. This means that no code has to be written to perform complex data transformations. It is possible to create plugins to do custom transformations or access proprietary data sources. Kettle supports most databases on the market, and has native support for slowly changing dimensions on most database platforms.
PDFTextStream is a PDF text and metadata extraction library available for Java and .NET. It supports all versions of the PDF document specification (including v1.7, used by Acrobat 8, 9, and X), extraction of text encoded using double-byte character sets (including Chinese, Japanese, and Korean), decryption of documents encrypted using 40-bit, 128-bit, 256-bit, and variable bit length ciphers, and extraction of all document metadata provided by PDF documents (including form data, bookmarks, and annotations). Easy integration with Jakarta Lucene is included, as well as interactive form update capability.
Red-Piranha is a search system that can actually learn what you are looking for. It can be used as a Web page, command line, or XML-WebService, so it will work with most languages, including Java, Perl, C#/.NET, and PHP. It includes learning abilities for the Desktop/Internet search functionality. All feedback from the user is stored in (editable) XML and RDF, and is used by the system to improve the quality of searches.
StelsCSV is a JDBC driver that allows performing SQL statements and other JDBC operations upon text files (comma separated, tab-separated, fixed length, etc.). Using this driver, users can easily create a simple database consisting of plain text files. The driver can be used for writing data importing programs and migration tools. It supports most keywords of ANSI SQL92, table joins, INSERT, UPDATE, and DELETE statements, data types, aggregate, converting, string, and user-defined SQL functions.