Compass is a Java framework which makes it simple to map your Java object model into a search engine. It is built on top of the Lucene search engine. Compass features declarative mapping technology OSEM (similar to O/R database mapping), transaction management, Google-like query syntax, externalization of common metadata, and much much more.
Isobel is a framework to build complex information retrieval and analysis systems. Isobel can be functionally divided in two subsytems, Isobel Gatherer (the crawling and filtering subsystem) and Isobel Analyzer (the analysis subsystem). The two subsytems can also be used separately. Isobel Gatherer offers ready-to-use services like content fetching, scheduling, document format conversion, Hyperlink graph storage and analysis, content storage and indexing. A programmer may easily add new services. Isobel Analyzer uses the IBM UIMA architecture to reuse the analysis components developed for this architecture.
Nutch is highly scalable Web searching software which builds on top of Apache Hadoop and Lucene Java. Key features include a Web crawler, indexer, crawl management tools, parsers for HTML, PDF, DOC, and several other document formats, and an expandable architecture that allows you to plug in additional functionality such as document parsers, custom scoring algorithms, custom content parsers, protocols, and more.
Solr is an enterprise search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted search, dynamic clustering, database integration, and rich document (e.g. Word and PDF) handling. Solr is highly scalable, providing distributed search and index replication, and it powers the search and navigation features of many of the world's largest internet sites. Solr is written in Java and runs as a standalone full-text search server within a servlet container such as Tomcat. Solr uses the Lucene Java search library at its core for full-text indexing and search, and has REST-like HTTP/XML and JSON APIs that make it easy to use from virtually any programming language. Solr's powerful external configuration allows it to be tailored to almost any type of application without Java coding, and it has an extensive plugin architecture when more advanced customization is required.
Ojax provides an OAI-PMH harvester to harvest metadata from multiple repositories and builds a single searchable Lucene index of this data. OJAX provides a dynamic user experience enabled by Ajax and full OpenSearch integration with the IE 7 and FireFox 2 search fields. It supports OpenSearch Discovery, OpenSearch Atom results, and OpenSearch Search Suggestions. Loosely coupled Web Services and OpenSearch Atom standard allow integration with a range of VLEs, CMSs, institutional repositories, and portals.