RSS 18 projects tagged "Apache 2.0"

Download Website Updated 22 Jul 2012 Apache Lucene

Screenshot
Pop 223.66
Vit 15.60

Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is suitable for nearly any application that requires full-text search, especially cross-platform.

Download Website Updated 29 Jan 2005 ProteomeCommons.org IO Framework

Screenshot
Pop 25.02
Vit 1.42

ProteomeCommons.org IO Framework is a proper Java framework for handling spectra and peak lists. The framework can read and write to a number of different spectra and peak list formats, and it provides a simple, intuitive Java object model for working with spectra or peak lists. All classes support two methods of handling peak list and spectrum data: in-memory or stream. The goal of this framework is to support all the popular MS and MSMS data formats, and to eliminate any time or effort involved in figuring out how to read and write peak list or spectrum files.

Download Website Updated 29 Sep 2008 JPdfUnit

Screenshot
Pop 41.42
Vit 3.44

JPdfUnit integrates PDFBox as a PDF API with the JUnit framework for the testing of PDF documents (so JPdfUnit is a high level API). Simple, ready-to-use assertions help to compare the expected data to the concrete data of the PDF document.

No download Website Updated 29 Nov 2005 isobel

Screenshot
Pop 27.06
Vit 1.00

Isobel is a framework to build complex information retrieval and analysis systems. Isobel can be functionally divided in two subsytems, Isobel Gatherer (the crawling and filtering subsystem) and Isobel Analyzer (the analysis subsystem). The two subsytems can also be used separately. Isobel Gatherer offers ready-to-use services like content fetching, scheduling, document format conversion, Hyperlink graph storage and analysis, content storage and indexing. A programmer may easily add new services. Isobel Analyzer uses the IBM UIMA architecture to reuse the analysis components developed for this architecture.

Download Website Updated 15 Jun 2012 WeOCR

Screenshot
Pop 151.58
Vit 7.43

WeOCR is a platform for Web-enabled OCR (Optical Character Reader/Recognition) systems. It enables people to use character recognition over networks. A WeOCR server receives document images from users, recognizes text in the images, and returns recognition results to the users. WeOCR does not have its own character recognition engine. Instead, it is intended to accommodate various existing character recognition engines.

No download Website Updated 22 Jul 2012 Apache Solr

Screenshot
Pop 139.21
Vit 9.30

Solr is an enterprise search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted search, dynamic clustering, database integration, and rich document (e.g. Word and PDF) handling. Solr is highly scalable, providing distributed search and index replication, and it powers the search and navigation features of many of the world's largest internet sites. Solr is written in Java and runs as a standalone full-text search server within a servlet container such as Tomcat. Solr uses the Lucene Java search library at its core for full-text indexing and search, and has REST-like HTTP/XML and JSON APIs that make it easy to use from virtually any programming language. Solr's powerful external configuration allows it to be tailored to almost any type of application without Java coding, and it has an extensive plugin architecture when more advanced customization is required.

No download Website Updated 20 Mar 2007 jSlovo

Screenshot
Pop 41.17
Vit 1.00

jSlovo is a fast database engine with a GUI that was designed for free dictionaries. It can create a file-based database from a text file and then be used to search it for particular words. It can scan any large number of file-based databases and the size of the databases is not limited. HTML tags can be used in the text files and for cross-references.

No download Website Updated 29 Jan 2012 jcpp

Screenshot
Pop 66.00
Vit 6.49

JCPP is a complete, compliant, standalone, pure Java implementation of the C preprocessor. It is intended to be of use to people writing C-style compilers in Java using tools like sablecc, antlr, JLex, CUP, and so forth. It has been used to successfully preprocess much of the source code of the GNU C library.

No download Website Updated 09 Sep 2007 TiTLi

Screenshot
Pop 22.72
Vit 1.00

TiTLi is a Google-like search tool for relational databases . It builds on top of Apache Lucene to provide an API and a GWT-based UI for searching multiple databases from various vendors simultaneously. It is very fast due to indexing, and the database is queried only when a record is chosen.

Download Website Updated 10 Jun 2008 SCAN

Screenshot
Pop 61.43
Vit 2.62

SCAN is a personal information retrieval framework, combining search, text analysis, tagging, and metadata functions for document collections management. SCAN is a component-based software using a number of plugins for specific features. The basic SCAN platform can be easily extended with plugins for different document formats and document location types.

Screenshot

Project Spotlight

NetIRC2

A easy-to-use .NET IRC client library.

Screenshot

Project Spotlight

CloverETL

A Java framework for building data integration and ETL applications.