PDFTextStream is a PDF text and metadata extraction library available for Java and .NET. It supports all versions of the PDF document specification (including v1.7, used by Acrobat 8, 9, and X), extraction of text encoded using double-byte character sets (including Chinese, Japanese, and Korean), decryption of documents encrypted using 40-bit, 128-bit, 256-bit, and variable bit length ciphers, and extraction of all document metadata provided by PDF documents (including form data, bookmarks, and annotations). Easy integration with Jakarta Lucene is included, as well as interactive form update capability.
Berkeley DB XML is a native XML database engine for use within your product. Made available as a C++ library with language bindings for Java, Perl, Python, PHP, and Tcl, it integrates directly into your application (it is not a standalone database server). It provides XQuery access into a database of document containers. XML documents are stored and indexed in their native format using Berkeley DB as the transactional database engine.
K.E.T.T.L.E (Kettle ETTL Environment) is a meta- data driven ETL (Extraction, Transformation, Transportation, and Loading) tool. This means that no code has to be written to perform complex data transformations. It is possible to create plugins to do custom transformations or access proprietary data sources. Kettle supports most databases on the market, and has native support for slowly changing dimensions on most database platforms.
JavaGit is a Java API that provides access to git repositories. The goal is to bring the power of git to the Java world as an API that is intuitive for developers new to git and developers who are veteran git users. It is engineered to provide the developer with access to the raw git commands through a command API as well as an object API designed to represent the .git repository, the working tree, and other familiar git concepts. JavaGit uses the git binaries installed on the host machine to provide git functionality, and has been designed to easily accommodate additional methods of access to git repositories.
The zen Platform is a Java development environment for J2SE/J2EE. It eases and accelerates the Java application development by visual methods (RAD). Complete applications can be modelled in an intuitive fashion without the need for technical expertise. The complexity of Java and J2EE programming is tremendously hidden - reducing the learning curve for Java novices. Java experts are by no means restricted - they may apply their knowledge without limitations. Application flexibility is a core feature of the platform, which is delivered as an Eclipse plugin.
Red-Piranha is a search system that can actually learn what you are looking for. It can be used as a Web page, command line, or XML-WebService, so it will work with most languages, including Java, Perl, C#/.NET, and PHP. It includes learning abilities for the Desktop/Internet search functionality. All feedback from the user is stored in (editable) XML and RDF, and is used by the system to improve the quality of searches.
The IberAgents platform is a framework for component development in Java. It has facilities for Web service communication (SOAP), persistence to database or files, and interaction via the Web; thus providing an end-to-end solution for data management. It uses its own SOAP library, so it does not depend on Axis or Apache Soap.