Berkeley DB XML is a native XML database engine for use within your product. Made available as a C++ library with language bindings for Java, Perl, Python, PHP, and Tcl, it integrates directly into your application (it is not a standalone database server). It provides XQuery access into a database of document containers. XML documents are stored and indexed in their native format using Berkeley DB as the transactional database engine.
The IberAgents platform is a framework for component development in Java. It has facilities for Web service communication (SOAP), persistence to database or files, and interaction via the Web; thus providing an end-to-end solution for data management. It uses its own SOAP library, so it does not depend on Axis or Apache Soap.
PDFTextStream is a PDF text and metadata extraction library available for Java and .NET. It supports all versions of the PDF document specification (including v1.7, used by Acrobat 8, 9, and X), extraction of text encoded using double-byte character sets (including Chinese, Japanese, and Korean), decryption of documents encrypted using 40-bit, 128-bit, 256-bit, and variable bit length ciphers, and extraction of all document metadata provided by PDF documents (including form data, bookmarks, and annotations). Easy integration with Jakarta Lucene is included, as well as interactive form update capability.
The zen Platform is a Java development environment for J2SE/J2EE. It eases and accelerates the Java application development by visual methods (RAD). Complete applications can be modelled in an intuitive fashion without the need for technical expertise. The complexity of Java and J2EE programming is tremendously hidden - reducing the learning curve for Java novices. Java experts are by no means restricted - they may apply their knowledge without limitations. Application flexibility is a core feature of the platform, which is delivered as an Eclipse plugin.
StelsCSV is a JDBC driver that allows performing SQL statements and other JDBC operations upon text files (comma separated, tab-separated, fixed length, etc.). Using this driver, users can easily create a simple database consisting of plain text files. The driver can be used for writing data importing programs and migration tools. It supports most keywords of ANSI SQL92, table joins, INSERT, UPDATE, and DELETE statements, data types, aggregate, converting, string, and user-defined SQL functions.
Red-Piranha is a search system that can actually learn what you are looking for. It can be used as a Web page, command line, or XML-WebService, so it will work with most languages, including Java, Perl, C#/.NET, and PHP. It includes learning abilities for the Desktop/Internet search functionality. All feedback from the user is stored in (editable) XML and RDF, and is used by the system to improve the quality of searches.
Isobel is a framework to build complex information retrieval and analysis systems. Isobel can be functionally divided in two subsytems, Isobel Gatherer (the crawling and filtering subsystem) and Isobel Analyzer (the analysis subsystem). The two subsytems can also be used separately. Isobel Gatherer offers ready-to-use services like content fetching, scheduling, document format conversion, Hyperlink graph storage and analysis, content storage and indexing. A programmer may easily add new services. Isobel Analyzer uses the IBM UIMA architecture to reuse the analysis components developed for this architecture.
K.E.T.T.L.E (Kettle ETTL Environment) is a meta- data driven ETL (Extraction, Transformation, Transportation, and Loading) tool. This means that no code has to be written to perform complex data transformations. It is possible to create plugins to do custom transformations or access proprietary data sources. Kettle supports most databases on the market, and has native support for slowly changing dimensions on most database platforms.