RSS 35 projects tagged "Information Management"

Download Website Updated 04 Jul 2011 Emdros

Screenshot
Pop 158.09
Vit 16.82

Emdros is a corpus query system for storing and searching linguistically annotated text. It is very generic, supporting almost any kind of annotation from almost any linguistic theory. All linguistic levels of analysis are supported, including phonology, morphology, the lexical level, syntax, and discourse. The core libraries act as a middleware layer between a client and an underlying SQL database. MySQL, PostgreSQL, and SQLite are supported.

No download No website Updated 21 Feb 2005 Zoe Intertwingle

Screenshot
Pop 165.46
Vit 7.12

Zoe is a Web based email client with a built in SMTP and POP3 server and Google-like search functionality that lives on your desktop. It is written in Java and uses Lucene technology to provided instant searching and threading of your email messages.

Download Website Updated 29 Jun 2004 Comment

Screenshot
Pop 45.30
Vit 1.85

Comment is a command line directory context note taker. Notes are stored in both the local directory and each users home. It was developed as a low impact tool for retaining flyaway information that is often needed at a later date. The dual storage system provides convenient access to prior notes, and all notes are stored in plain-text format.

Download Website Updated 08 Aug 2003 Hog Bay Notebook

Screenshot
Pop 13.89
Vit 1.76

Hog Bay Notebook provides a central location for all of your miscellaneous information. It allows you to search your data and keep it well organized.

Download Website Updated 20 Jul 2003 SearchAssist

Screenshot
Pop 27.48
Vit 1.42

SearchAssist is a simple but practical search engine application that uses a ternary search tree. It uses Java's dynamic loading feature to make the search engine highly customizable, and uses takes Mozilla bookmarks as input. A Swing UI allows users to enter search words and view the results.

Download Website Updated 28 Jun 2012 Xapian and Omega

Screenshot
Pop 402.28
Vit 16.31

Xapian is a search engine library, scalable to collections containing hundreds of millions of documents. It's written in C++ with bindings for Perl, Python, PHP, Java, Tcl, C#, Ruby, and Lua. It is a highly adaptable toolkit that allows developers to easily add advanced indexing and search facilities to their own applications. It supports the Probabilistic Information Retrieval model and also a rich set of boolean query operators. Omega is a Web search application built upon the Xapian library. It can index a Web server's document tree (including HTML, PDF, OpenOffice, MS Word/Excel/Powerpoint/Works, WordPerfect, RTF, PS, etc.), or data exported from arbitrary sources (e.g. SQL databases).

Download Website Updated 18 Oct 2009 isbnsearch

Screenshot
Pop 117.89
Vit 2.98

isbnsearch provides a simple method for retrieving information about any book using only an ISBN or EAN barcode. It is intended to provide assistance for online libraries, user groups, or individual users, and is designed in such a way to provide a distributed ISBN database query system. Users can choose to view the summary information (author, title, publisher, date, edition, subject, ISBN) as HTML, XML, or a pre-formatted SQL statement.

Download Website Updated 14 Mar 2004 The Lucene Application Layer

Screenshot
Pop 27.39
Vit 1.00

LUALA is an acronym for LUcene Application LAyer. It is an intermediate level API for document indexing and searching. It uses the low-level API of Lucene.

Download Website Updated 15 Mar 2005 Ellogon

Screenshot
Pop 52.99
Vit 1.82

Ellogon is a multi-lingual, cross-platform, general-purpose language engineering environment, developed in order to aid both researchers who are doing research in computational linguistics, as well as companies who produce and deliver language engineering systems. As a language engineering platform, it offers an extensive set of facilities, including tools for processing and visualising textual/HTML/XML data and associated linguistic information, support for lexical resources (like creating and embedding lexicons), tools for creating annotated corpora, accessing databases, comparing annotated data, or transforming linguistic information into vectors for use with various machine learning algorithms.

Download No website Updated 28 Jan 2006 The Revisionist

Screenshot
Pop 42.05
Vit 1.56

The Revisionist is a tool for extracting and indexing hidden metadata (such as deleted or modified text) from large collections of MS Word files. It can operate whole Web sites or SMB or NFS directories. It is handy for pen-testing, or it can be used just to spot embarrassing secrets.

Screenshot

Project Spotlight

pstoedit

A converter from Postscript(TM) and PDF to other vector graphic formats.

Screenshot

Project Spotlight

JS-Collider

An event-driven Java network (NIO) framework.