RSS 8 projects tagged "Indexing/Search"

Download Website Updated 08 Oct 2003 LinkGrammar-WN

Screenshot
Pop 49.09
Vit 1.00

LinkGrammar-WN is a lexicon expansion for the Link Grammar Parser. The Link Grammar Parser is a syntactic parser of the English language that is capable of handling a wide variety of syntactic constructions and is considered quite robust. The LinkGrammar-WN project aims to import lexical information from WordNet in an effort to increase the size of the LGP lexicon. This project is of interest to anyone interested in NLP (natural language parsing) of English text.

Download Website Updated 05 Oct 2013 Apache Lucene

Screenshot
Pop 258.07
Vit 21.35

Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is suitable for nearly any application that requires full-text search, especially cross-platform.

Download Website Updated 06 Oct 2004 Dowser

Screenshot
Pop 48.34
Vit 2.24

Dowser is a Web research and archiving tool that clusters results from search engines, associates words that appear in previous searches, and keeps a local cache of all the results you click on in a searchable database along with summaries and links to related information. It helps you to keep track of what you find, with no advertising.

No download Website Updated 29 Nov 2005 isobel

Screenshot
Pop 25.22
Vit 1.00

Isobel is a framework to build complex information retrieval and analysis systems. Isobel can be functionally divided in two subsytems, Isobel Gatherer (the crawling and filtering subsystem) and Isobel Analyzer (the analysis subsystem). The two subsytems can also be used separately. Isobel Gatherer offers ready-to-use services like content fetching, scheduling, document format conversion, Hyperlink graph storage and analysis, content storage and indexing. A programmer may easily add new services. Isobel Analyzer uses the IBM UIMA architecture to reuse the analysis components developed for this architecture.

No download Website Updated 26 Jan 2006 Verticrawl Seek Site Search

Screenshot
Pop 19.75
Vit 1.00

Verticrawl Seek Site Search is a search engine technology for making powerful, fast, and customizable search solutions. It features parsing of multiple document formats, an admin interface, compatibility with sitemaps, and a search interface for HTML, XML, and PHP.

Download Website Updated 15 Aug 2006 JBootCat

Screenshot
Pop 18.65
Vit 1.00

JBootCat is an implemention of the BootCat scripts for acquiring corpora from the Internet, which is of interest to linguists and translators. The main goal is to encapsulate the BootCat functionality within a user-friendly desktop application.

No download Website Updated 05 Oct 2013 Apache Solr

Screenshot
Pop 164.27
Vit 13.48

Solr is an enterprise search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted search, dynamic clustering, database integration, and rich document (e.g. Word and PDF) handling. Solr is highly scalable, providing distributed search and index replication, and it powers the search and navigation features of many of the world's largest internet sites. Solr is written in Java and runs as a standalone full-text search server within a servlet container such as Tomcat. Solr uses the Lucene Java search library at its core for full-text indexing and search, and has REST-like HTTP/XML and JSON APIs that make it easy to use from virtually any programming language. Solr's powerful external configuration allows it to be tailored to almost any type of application without Java coding, and it has an extensive plugin architecture when more advanced customization is required.

Download Website Updated 07 Oct 2011 OpenEphyra

Screenshot
Pop 48.89
Vit 2.70

OpenEphyra is a question answering (QA) system. It retrieves answers to natural language questions from the Web and other sources. OpenEphyra comes with implementations of algorithms that proved effective in Carnegie Mellon's Ephyra system, which participated in the TREC evaluations. It is platform independent and can be set up in just a few minutes. The goal of this project is to give researchers the opportunity to develop new QA techniques without worrying about the end-to-end system.

Screenshot

Project Spotlight

CoreTML framework

A tool allowing the developer to create user-configurable source code templates.

Screenshot

Project Spotlight

Alaya Webdav Server

A simple WebDAV 1.0 server.