RSS 3 projects tagged "Windows"

Download Website Updated 16 Apr 2009 CLucene

Screenshot
Pop 72.97
Vit 2.54

CLucene is a C++ port of Lucene, a high-performance, full-featured text search engine. It is, however, faster than Lucene as it is written in C++.

No download Website Updated 29 Nov 2005 isobel

Screenshot
Pop 24.00
Vit 1.00

Isobel is a framework to build complex information retrieval and analysis systems. Isobel can be functionally divided in two subsytems, Isobel Gatherer (the crawling and filtering subsystem) and Isobel Analyzer (the analysis subsystem). The two subsytems can also be used separately. Isobel Gatherer offers ready-to-use services like content fetching, scheduling, document format conversion, Hyperlink graph storage and analysis, content storage and indexing. A programmer may easily add new services. Isobel Analyzer uses the IBM UIMA architecture to reuse the analysis components developed for this architecture.

Download Website Updated 29 Mar 2009 Apache Nutch

Screenshot
Pop 49.75
Vit 2.47

Nutch is highly scalable Web searching software which builds on top of Apache Hadoop and Lucene Java. Key features include a Web crawler, indexer, crawl management tools, parsers for HTML, PDF, DOC, and several other document formats, and an expandable architecture that allows you to plug in additional functionality such as document parsers, custom scoring algorithms, custom content parsers, protocols, and more.

Screenshot

Project Spotlight

webon

A Web content management system.

Screenshot

Project Spotlight

BirdFont

A font editor which can create TTF, EOT, and SVG fonts.