Terrier is software for the rapid development of Web, intranet, and desktop search engines. More generally, it is a modular platform for building large-scale information retrieval applications, providing indexing and probabilistic retrieval functionalities. It comes with a desktop search application.
Barcode Writer in Pure Postscript implements the printing of many barcode formats entirely within PostScript, so that the process of converting the input string into the printed output is performed by the printer or print system. The project supports all major barcode formats including: EAN-13 (JAN-13), EAN-8 (JAN-8), UPC-A, UPC-E, EAN-5 & EAN-2 (EAN/UPC add-ons), ISBN (including legacy ISBN), ISMN (including legacy ISMN), ISSN, Code 128 (A, B & C), GS1-128, SSCC-18 (EAN-18, NVE), EAN-14, Code 39, Code 39 Extended, Code 93, Code 93 Extended, Code 32 (Italian Pharmacode), Pharmazentralnummer (PZN), Interleaved 2 of 5, ITF-14 (UPC SCS), GS1 DataBar (Omnidirectional, Stacked, Stacked Omnidirectional, Limited, Expanded, Expanded Stacked), Code 2 of 5 (Industrial, IATA, Matrix, Datalogic & COOP), Code 11 (USD-8), BC412, Codabar (NW-7), Pharmacode (including two-track), MSI, Plessey, Telepen, Channel Code, PosiCode, PDF417, Data Matrix (ECC200), QR Code (including Micro QR Code), and more.
Tulip is an information visualization framework dedicated to the analysis and visualization of relational data. Tulip aims to provide the developer with a complete library, supporting the design of interactive information visualization applications for relational data that can be tailored to the problems being faced. Written in C++, the framework enables the development of algorithms, visual encodings, interaction techniques, data models, and domain-specific visualizations. One of the goals of Tulip is to facilitate the reuse of components, and it allows developers to focus on programming their application. This development pipeline makes the framework efficient for research prototyping as well as the development of end-user applications. The framework also provides a complete software for visual analysis of relational data having attributes.
With MetaModel, you use a type-safe SQL-like API for querying any datastore. It is a data access framework providing a common interface for exploration and querying of different types of datastores. It isn't a data mapping framework. Instead, it emphasizes abstraction of metadata and the ability to add data sources at runtime, making MetaModel great for generic data processing applications, but less so for applications modeled around a particular domain.
DotNetWikiBot Framework is a full-featured client API with a console interface that allows you to build programs and Web robots easily to manage information on MediaWiki-powered sites. DotNetWikiBot Framework is intended to help with many complicated and routine tasks of wiki site development and maintenance. Any .NET language can be used to access DotNetWikiBot library functions. Only minimal programming skills are required to make bots with DotNetWikiBot Framework.
Foswiki is wiki software, supporting the editing of Web pages in an ordinary Web browser by end users. What makes Foswiki special is that it supports the embedding of active and passive macros that enhance the page content (e.g. with global or dynamic information) and allow end-users to build applications that store and process data in a structured manner.
StelsCSV is a JDBC driver that allows performing SQL statements and other JDBC operations upon text files (comma separated, tab-separated, fixed length, etc.). Using this driver, users can easily create a simple database consisting of plain text files. The driver can be used for writing data importing programs and migration tools. It supports most keywords of ANSI SQL92, table joins, INSERT, UPDATE, and DELETE statements, data types, aggregate, converting, string, and user-defined SQL functions.
Config-Model provides a framework for editing and validating the content of any configuration file or data. With a configuration model (expressed in a data structure), Config-Model provides a user interface and a tool to validate configuration. An optional graphical (Perl/Tk) or curses interface can be used to edit configuration data that will be validated according to the user-provided model. Config-Model includes a model example for fstab and a small fstab demo.
PDFTextStream is a PDF text and metadata extraction library available for Java and .NET. It supports all versions of the PDF document specification (including v1.7, used by Acrobat 8, 9, and X), extraction of text encoded using double-byte character sets (including Chinese, Japanese, and Korean), decryption of documents encrypted using 40-bit, 128-bit, 256-bit, and variable bit length ciphers, and extraction of all document metadata provided by PDF documents (including form data, bookmarks, and annotations). Easy integration with Jakarta Lucene is included, as well as interactive form update capability.
Xapian is a search engine library, scalable to collections containing hundreds of millions of documents. It's written in C++ with bindings for Perl, Python, PHP, Java, Tcl, C#, Ruby, and Lua. It is a highly adaptable toolkit that allows developers to easily add advanced indexing and search facilities to their own applications. It supports the Probabilistic Information Retrieval model and also a rich set of boolean query operators. Omega is a Web search application built upon the Xapian library. It can index a Web server's document tree (including HTML, PDF, OpenOffice, MS Word/Excel/Powerpoint/Works, WordPerfect, RTF, PS, etc.), or data exported from arbitrary sources (e.g. SQL databases).