Projects / jWeb1T

jWeb1T

jWeb1T is an Java tool for efficiently searching n-gram data in the Web 1T 5-gram corpus format. It is based on a binary search algorithm that finds the n-grams and returns their frequency counts in logarithmic time. As the corpus is stored in many files, a simple index is used to retrieve the files containing the n-grams.

Tags
Licenses
Operating Systems
Implementation

Recent releases

  •  14 Feb 2012 23:39

    Release Notes: Cleaned up code, some bugfixes, and performance improvements.

    Screenshot

    Project Spotlight

    OpenStack4j

    A Fluent OpenStack client API for Java.

    Screenshot

    Project Spotlight

    TurnKey TWiki Appliance

    A TWiki appliance that is easy to use and lightweight.