The Informa RSS Library provides a convenient Java API for handling news channels and metadata about them. Different syntax formats (like RSS 0.91, 1.0 [RDF], 2.0, and Atom 0.3) for feeds are supported. There is also basic support for channel information descriptions (OPML). A full-text engine (Lucene) can be used for indexing and searching the news items. Two backends for storing the data are currently provided: In-Memory and Hibernate (which allows you to persist news items into allmost any JDBC compliant database).
TMAPI is a programming interface for accessing and manipulating data held in a topic map. The TMAPI specification defines a set of core interfaces which must be implemented by a compliant application, as well as a set of additional interfaces which may be implemented by a compliant application or which may be built upon the core interfaces. The goal is to define a common programming interface for all topic map processors, enabling portable code and a shorter learning curve for developers moving between topic map processers.
Doodle is a desktop search engine for Linux. It searches your hard drive for files using pattern matching on meta-data. It extracts file-format specific meta-data using libextractor and builds a suffix tree to index the files. The index can then be searched rapidly. It is similar to locate, but can take advantage of information such as ID3 tags. It is possible to do full-text indexing using the appropriate libextractor plugins. It also supports using FAM to keep the database up-to-date.
QML (Quantity Modeling Language) is a "thing"-based language for scientific and mathematical data modeling. Each "thing" is a quantity which may be associated with either a structure or physical phenomena. Quantities, in turn, may hold other Quantities or values (numbers or strings). Higher-level data models, which associate or define meanings to various quantities (such as velocity or position), can be built from QML quantities. The higher-level data model (XML) schema that inherits from QML may be understood, and its instance documents may be parsed into QML documents and objects by the QMLReader.
Red-Piranha is a search system that can actually learn what you are looking for. It can be used as a Web page, command line, or XML-WebService, so it will work with most languages, including Java, Perl, C#/.NET, and PHP. It includes learning abilities for the Desktop/Internet search functionality. All feedback from the user is stored in (editable) XML and RDF, and is used by the system to improve the quality of searches.
K.E.T.T.L.E (Kettle ETTL Environment) is a meta- data driven ETL (Extraction, Transformation, Transportation, and Loading) tool. This means that no code has to be written to perform complex data transformations. It is possible to create plugins to do custom transformations or access proprietary data sources. Kettle supports most databases on the market, and has native support for slowly changing dimensions on most database platforms.
[fleXive] is a Java EE 5 content repository aiming to support upcoming industry standards like CMIS. It strives to provide a holistic approach by offering a comprehensive set of tools and building blocks for building content-centric Web applications around a [fleXive] content repository. It speeds up development by easing many tedious and repetitive programming tasks and helping to keep your application(s) flexible during the development cycle and in production. It concentrates on enterprise-scale content modeling, storage, and retrieval, and includes comprehensive JSF support for displaying and manipulating these contents in (Web) applications. Key features include persistence, security, versioning, multi-language support, and scripting.