RSS 45 projects tagged "Indexing"

Download Website Updated 15 Mar 2006 SWISH++

Screenshot
Pop 238.55
Vit 9.25

SWISH++ is a Unix-based file indexing and searching engine (typically used to index and search files on web sites). It was based on SWISH-E although SWISH++ is a complete rewrite. SWISH++ is at least 10 times faster and can handle much larger numbers of files. Additionally, it has unique features such as selective non-indexing, on-the-fly filters, user-selectable stemming, and more.

Download Website Updated 29 Oct 2008 WebGlimpse

Screenshot
Pop 173.51
Vit 11.33

WebGlimpse is a scalable, feature-rich search engine for indexing your Web site or any collection of local and remote sites you choose. Features include customizable output formats, custom ranking/ordering of hits, fuzzy matching, boolean queries, a Web administration interface for multiple archives, logging of queries, caching of results, and more. Localized search interfaces are provided in multiple languages including Spanish, German, French, Italian, Norwegian, Finnish, Russian, Hebrew, and others. It supports 3rd party filters for indexing PDF, Word, and Excel files. It is free for academic and most nonprofit users.

Download Website Updated 13 Apr 2009 YASE

Screenshot
Pop 80.41
Vit 3.30

YASE is a text indexing and retrieval system. It allows you to index your document collection very easily. All words are indexed and can be optionally stemmed. The query tool supports searching all/any terms and can rank query results by relevance using the cosine measure.

Download Website Updated 22 Dec 2001 XMLDB

Screenshot
Pop 53.76
Vit 1.77

XMLDB uses an RDBMS to persist arbitrary XML documents. Due to its storage mechanism, searching for and recalling documents is extremely quick. You can also perform XSL translation on documents with surprising speed. The library can be used in any program to store libxml2 documents. A PHP module is also included, making XMLDB into a complete three-tier Web application development suite.

Download Website Updated 16 Mar 2005 Radsearch

Screenshot
Pop 26.61
Vit 2.06

Radsearch is a text utility used to retrieve records from a text file or list of text files, given a keyword and delimiter. This utility was written with the specific purpose of allowing quick retrieval of all login and logout records for a particular user in Radiusd log files.

No download Website Updated 28 Nov 2005 X-Hive/DB

Screenshot
Pop 82.26
Vit 3.71

X-Hive/DB is a powerful native XML database designed for software developers who require advanced XML data processing and storage functionality within their applications. The comprehensive X-Hive/DB Java API contains methods for storing, querying, retrieving, transforming, and publishing XML data. X-Hive/DB supports all major W3C standards, such as XQuery, XPath, DOM, XPointer, XML Schemas, and more.

Download Website Updated 04 Jul 2011 Emdros

Screenshot
Pop 156.81
Vit 16.78

Emdros is a corpus query system for storing and searching linguistically annotated text. It is very generic, supporting almost any kind of annotation from almost any linguistic theory. All linguistic levels of analysis are supported, including phonology, morphology, the lexical level, syntax, and discourse. The core libraries act as a middleware layer between a client and an underlying SQL database. MySQL, PostgreSQL, and SQLite are supported.

Download Website Updated 16 Apr 2002 Zoogle

Screenshot
Pop 29.39
Vit 1.00

Zoogle is a Z39.50 gateway to the Google web index. With this gateway you can search the Google index with any Z39.50 client. It is based upon Google's official search API, the popular YAZ toolkit, and the Perl module Net::Z3950::SimpleServer.

No download Website Updated 19 May 2002 SODA Native XML Database System

Screenshot
Pop 31.27
Vit 1.42

The SODA Native XML Database System is a native XML database that provides efficient management of large amounts of XML data. It is based on a multi-user, client-server architecture with a generic query processing layer that can easily support different query languages. In this lightweight version, user- defined indexes and query optimizations have been removed, however full transaction support (commits and rollbacks) and crash recovery are available.

Download Website Updated 23 Dec 2013 GNU libextractor

Screenshot
Pop 482.37
Vit 50.55

libextractor is a library used to extract meta-data from files of arbitrary type. It is designed to use helper-libraries to perform the actual extraction, and to be trivially extendable by linking against external extractors for additional file types. The goal is to provide developers of file-sharing networks, file managers, and WWW-indexing bots with a universal library to obtain meta-data about files. It includes a shell-command and bindings for Java (JNI) and Python.

Screenshot

Project Spotlight

TurnKey Moodle Appliance

A Moodle appliance that is easy to use and lightweight.

Screenshot

Project Spotlight

GenScriber

A genealogy records transcription editor.