RSS 6 projects tagged "Indexing"

Download Website Updated 15 Mar 2005 Ellogon

Screenshot
Pop 65.50
Vit 1.83

Ellogon is a multi-lingual, cross-platform, general-purpose language engineering environment, developed in order to aid both researchers who are doing research in computational linguistics, as well as companies who produce and deliver language engineering systems. As a language engineering platform, it offers an extensive set of facilities, including tools for processing and visualising textual/HTML/XML data and associated linguistic information, support for lexical resources (like creating and embedding lexicons), tools for creating annotated corpora, accessing databases, comparing annotated data, or transforming linguistic information into vectors for use with various machine learning algorithms.

Download Website Updated 17 Jan 2007 ddc-concordance

Screenshot
Pop 36.88
Vit 2.31

ddc-concordance is a search engine for linguists. It lets you search for words or sequences of words together with morphological patterns. It was created to help linguists find a particular collocation or word in a given context.

Download Website Updated 22 Mar 2006 hyperjournal

Screenshot
Pop 48.58
Vit 2.14

hyperjournal facilitates the administration of academic journals on the Web. It is designed according to an intuitive and elegant layout and permits the installation, personalization, and administration of a dedicated Web site without the need for special IT competence.

No download Website Updated 29 Nov 2005 isobel

Screenshot
Pop 27.42
Vit 1.00

Isobel is a framework to build complex information retrieval and analysis systems. Isobel can be functionally divided in two subsytems, Isobel Gatherer (the crawling and filtering subsystem) and Isobel Analyzer (the analysis subsystem). The two subsytems can also be used separately. Isobel Gatherer offers ready-to-use services like content fetching, scheduling, document format conversion, Hyperlink graph storage and analysis, content storage and indexing. A programmer may easily add new services. Isobel Analyzer uses the IBM UIMA architecture to reuse the analysis components developed for this architecture.

Download Website Updated 17 May 2007 Document clustering

Screenshot
Pop 44.54
Vit 1.01

Document clustering is a data mining suite to cluster a document set. This set of tools was implemented from a series of papers: "Clustering Web Pages Semantically using Combinatorial Topology", "Data mining using granular computing", and "A fast association rule algorithm based on bitmap and granular computing".

Download Website Updated 22 Oct 2012 Invenio

Screenshot
Pop 102.04
Vit 11.74

Invenio (formerly CDSware) is a suite of applications that provides the framework and tools for building and managing an autonomous digital library server. It complies with the Open Archives Initiative metadata harvesting protocol (OAI-PMH) and uses MARC 21 as its underlying bibliographic standard. Its flexibility and performance make it a comprehensive solution for the management of document repositories of moderate to large size.

Screenshot

Project Spotlight

Mount-gtk

A front end for udisks and mount.

Screenshot

Project Spotlight

lookbusy

A synthetic system load generator.