RSS 20 projects tagged "Indexing"

Download Website Updated 06 Jan 2014 HTMLDOC

Screenshot
Pop 748.74
Vit 39.67

HTMLDOC converts HTML files and Web pages into indexed HTML, PostScript, and PDF files suitable for online viewing and printing. It can be used as a standalone GUI application, in a batch document processing environment, as a Web-based report generation application, or in embedded environments to support printing of HTML content. It runs on all Unix platforms as well as Mac OS X and Windows 2000 and higher.

Download Website Updated 15 Mar 2006 SWISH++

Screenshot
Pop 239.94
Vit 9.25

SWISH++ is a Unix-based file indexing and searching engine (typically used to index and search files on web sites). It was based on SWISH-E although SWISH++ is a complete rewrite. SWISH++ is at least 10 times faster and can handle much larger numbers of files. Additionally, it has unique features such as selective non-indexing, on-the-fly filters, user-selectable stemming, and more.

Download Website Updated 16 Mar 2005 Radsearch

Screenshot
Pop 26.31
Vit 2.06

Radsearch is a text utility used to retrieve records from a text file or list of text files, given a keyword and delimiter. This utility was written with the specific purpose of allowing quick retrieval of all login and logout records for a particular user in Radiusd log files.

Download Website Updated 04 Jul 2011 Emdros

Screenshot
Pop 156.81
Vit 16.80

Emdros is a corpus query system for storing and searching linguistically annotated text. It is very generic, supporting almost any kind of annotation from almost any linguistic theory. All linguistic levels of analysis are supported, including phonology, morphology, the lexical level, syntax, and discourse. The core libraries act as a middleware layer between a client and an underlying SQL database. MySQL, PostgreSQL, and SQLite are supported.

No download Website Updated 19 May 2002 SODA Native XML Database System

Screenshot
Pop 31.08
Vit 1.42

The SODA Native XML Database System is a native XML database that provides efficient management of large amounts of XML data. It is based on a multi-user, client-server architecture with a generic query processing layer that can easily support different query languages. In this lightweight version, user- defined indexes and query optimizations have been removed, however full transaction support (commits and rollbacks) and crash recovery are available.

Download Website Updated 05 Mar 2007 QDBM: Quick DataBase Manager

Screenshot
Pop 188.98
Vit 12.02

QDBM is an embedded database library compatible with GDBM and NDBM. It features hash database and B+ tree database and is developed referring to GDBM for the purpose of the following three points: higher processing speed, smaller size of a database file, and simpler API.

Download Website Updated 28 Jun 2012 Xapian and Omega

Screenshot
Pop 403.99
Vit 16.29

Xapian is a search engine library, scalable to collections containing hundreds of millions of documents. It's written in C++ with bindings for Perl, Python, PHP, Java, Tcl, C#, Ruby, and Lua. It is a highly adaptable toolkit that allows developers to easily add advanced indexing and search facilities to their own applications. It supports the Probabilistic Information Retrieval model and also a rich set of boolean query operators. Omega is a Web search application built upon the Xapian library. It can index a Web server's document tree (including HTML, PDF, OpenOffice, MS Word/Excel/Powerpoint/Works, WordPerfect, RTF, PS, etc.), or data exported from arbitrary sources (e.g. SQL databases).

Download Website Updated 15 May 2004 StringSearch

Screenshot
Pop 61.16
Vit 1.75

The StringSearch library provides implementations of algorithms of the Boyer-Moore family and the Shift-Or (bit-parallel) family, for use in Java programs that need fast string searching algorithms.

Download Website Updated 15 Mar 2005 Ellogon

Screenshot
Pop 52.31
Vit 1.82

Ellogon is a multi-lingual, cross-platform, general-purpose language engineering environment, developed in order to aid both researchers who are doing research in computational linguistics, as well as companies who produce and deliver language engineering systems. As a language engineering platform, it offers an extensive set of facilities, including tools for processing and visualising textual/HTML/XML data and associated linguistic information, support for lexical resources (like creating and embedding lexicons), tools for creating annotated corpora, accessing databases, comparing annotated data, or transforming linguistic information into vectors for use with various machine learning algorithms.

Download Website Updated 27 Apr 2005 POPsearch

Screenshot
Pop 70.89
Vit 4.94

POPsearch is a desktop search engine that is designed to help you easily find information on your computer. With features that other search engines don't have,it lets you index your entire collection of email messages and files. As information is indexed, it is immediately available for analysis from any Web browser. When POPsearch is configured correctly, you can also access your data remotely with RSS feeds, email feeds, or from any computer that has a Web browser.

Screenshot

Project Spotlight

Qtractor

An Audio/MIDI multi-track sequencer.

Screenshot

Project Spotlight

Frams' Shell Tools

Shell tools to make Unix everyday life more comfortable.