Xapian is a search engine library, scalable to collections containing hundreds of millions of documents. It's written in C++ with bindings for Perl, Python, PHP, Java, Tcl, C#, Ruby, and Lua. It is a highly adaptable toolkit that allows developers to easily add advanced indexing and search facilities to their own applications. It supports the Probabilistic Information Retrieval model and also a rich set of boolean query operators. Omega is a Web search application built upon the Xapian library. It can index a Web server's document tree (including HTML, PDF, OpenOffice, MS Word/Excel/Powerpoint/Works, WordPerfect, RTF, PS, etc.), or data exported from arbitrary sources (e.g. SQL databases).
ExactScan is a versatile document capture application for home offices and workgroups. It is designed from the ground up for high-speed document scanners and can easily handle hundreds of images per minute, including duplex scans. Included functionality reaches from managing, sorting, and editing singles pages to writing multi- as well as single-page PDF files including JPEG compression and TIFF, JPEG, JPEG2000, and PNG bitmap files. ExactScan allows performing state of the art image processing including automatic cropping, deskewing, dynamic thresholding for perfect black and white documents, and descreening print rasters.
Emdros is a corpus query system for storing and searching linguistically annotated text. It is very generic, supporting almost any kind of annotation from almost any linguistic theory. All linguistic levels of analysis are supported, including phonology, morphology, the lexical level, syntax, and discourse. The core libraries act as a middleware layer between a client and an underlying SQL database. MySQL, PostgreSQL, and SQLite are supported.
nexB OpenAssets is a tool for inventorying, managing, and monitoring applications, software, hardware, networks, and generally any IT asset. It is designed so that system administrators, IT, and finance can determine what they have, how it is configured, what it is used for, and how much it is being used, so that informed decisions can be made. It complements existing network management software, integrates with a growing number of protocols and tools, and features no-agent discovery and inventory, configuration management including dependencies and correlation, monitoring, and reporting. It makes extensive and innovative use of XML, Xpath, and Xquery.
KTML is a browser-based WYSIWYG HTML editor that facilitates easy content formatting using its advanced table and image manipulation features. Unlike other online editors, it supports most platforms and configurations, providing users with advanced text and paragraph formatting capabilities as well as CSS support and easy HTML tag selection. It also features advanced image manipulation, an effective spell checking module, and a Server Behavior for Dreamweaver MX.
edit-on Pro is a cross-platform, in-browser, WYSIWYG editor Java applet which enables XHTML content authoring with XML markup. The editor is compact, powerful and requires no special libraries or client plugins. It includes CSS support, table editing, a spelling checker, multi-language support, and features an API that allows full customisation and seamless integration into Web-based applications. It perfectly complements content management systems, e-learning, knowledge management systems, and CRM. A free trial version is available for download and includes complete developer samples and a comprehensive integration manual.
QOF (Query Object Framework), provides a set of C Language utilities for performing generic structured complex queries on a set of data held by a set of C/C++ objects. This framework is unique in that it does not require SQL or any database at all to perform the query. Thus, it allows programmers to add query support to their applications without having to hook into an SQL database.