OpenSearchServer is a stable, high-performance search engine and a suite of high-powered full text search algorithms. Documents can be indexed in sixteen languages. Multi-lingual analyzers slice sentences into words, then run lemmatisation algorithms on words based on the document's language. Numerous document formats are supported, such as XML, HTML/XHTML, PDF, Word, PowerPoint, RTF, OpenOffice, plain text, MP3/4, Ogg, FLAC, etc. The Web interface, built around the Zkoss framework, provides an easy way to manage OSS. The integration is fast using the PHP client or the API (XML over HTTP). The crawlers of OpenSearchServer go through Web sites, file systems, and databases to rapidly and easily build your index.
The KiWi core system is a flexible platform for building different kinds of semantic social software applications on top (currently the Semantic Wiki and the TagIT application). It provides all the core services required in such applications, like editing and tagging, the storage of content and associated meta-data, its own triple store, transactions and versioning over content and meta-data, a linked open data server, and many small features semantic social software developers will like (like convenience services for working with ontologies or SKOS thesauruses, etc.).
Jumper provides an enterprise bookmarking engine for tagging and linking data objects. It lets you search and share high-value data across remote locations using tag metadata (expanded tag fields) to capture knowledge about data in remote data stores. It collects these tag profiles in a knowledge base where user-created tag profiles identify quality data resources, user-contributed tag information adds real-world knowledge about the data resources, and user-created reviews sort out the worthy resources from the inadequate. Other users can search for this data. In addition, they can directly contribute what they know about this data to the knowledge base. It allows the participants to act as a filter for what is valuable and build upon mainstream pursuits, but also uncovers valuable data hidden at the edge.
SitemapGen4j is a Java library to generate XML sitemaps. It supports gzipped output, sitemap validation, and sitemap index generation. It can also generate Google-specific sitemaps, such as Mobile sitemaps, Geo sitemaps, Code Search sitemaps, Google News sitemaps, and Video sitemaps.
LogicalDOC is a Web-based document management system that is easy to use and learn. Its architecture leverages best-of-breed Java technology to achieve a powerful and flexible solution. It supports its users with a powerful search engine (Lucene), Web service interface (JAX-WS via CXF) compatible with .NET and PHP, versioning, annotation on documents, a WebDAV interface, importing and exporting from .zip files. Documents can be organized into hierarchical folders, searched using the integrated search engine, or browsed by Tag. The system is extensible thanks to the technologies used (Spring-Hibernate) and its plugin architecture.
AzTSearch is a torrent search plugin for Azureus/Vuze that allows for quick and easy location of torrent resources across the Web. It uses existing torrent trackers as the source for this information. Currently, nine popular torrent trackers/engines are supported, including The Pirate Bay, Demonoid, Mininova, BTJunkie, IsoHunt, SUMOTorrent, BitTorrentMonster, Monova, and Fenopy. New torrent trackers can be added easily within the existing framework.
DoCASU is a Rich Internet Application (RIA) for the Alfresco Enterprise Content Management platform. With DoCASU, Alfresco users have a simplified and easy to use solution solution to access, search, and manage documents. DoCASU is based on Alfresco Web Scripts and Ext JS, and showcases what can be done with state-of-the-art frontend components. It can easily be extended and adapted to meet specific needs.
Sesat is a search middleware with federation capabilities and a built-in search portal framework. It makes it easy to build applications that look for information in many different places simultaneously. It can connect to almost any kind of data source that can be accessed using Java - databases, search indexes, files, back office systems, Web services, and ESBs.