Projects / Grok


Grok is a library of Java components for performing various natural language tasks. These include several preprocessing tasks, chart parsing, a large categorial grammar for English (induced from the Penn treebank), and some knowledge representation components (basic coreference, salience tracking, etc.). The library also has a companion kit which provides a GUI interface to the components, several of which are implementations of interfaces in the Quipu OpenNLP API.

Operating Systems

Recent releases

  •  05 Dec 2001 16:39

    Release Notes: A whole bunch of stuff was cut out of this release. The preprocessing components now use the new XML structure of NLPDocuments appropriately.

    •  26 Nov 2001 19:10

      Release Notes: Improvements to the Grokling frontend, resurrection of parsing, updates for changes to unification architecture of OpenNLP, use of XML in preprocessing components conforms with JDOM, many improvements to sentence detection code, and new samples in the source tree.

      •  21 Nov 2001 11:27

        Release Notes: A problem in the build.xml file that was causing the maxent models to be corrupted when the grok jar was created was fixed. The infrastructure for creating and releasing the Grok homepage was improved.

        •  16 Oct 2001 17:05

          Release Notes: Improvements were made to the sentence detection code. All maxent models were converted to the new single file format. The part-of-speech tagging model is trained on much more data and uses a number of word stems to avoid some errors. Grok was updated for changes in the OpenNLP API. Grok now uses Ant to build. The directory structure has been changed considerably to facilitate the creation of distribution files, and to ensure that source code and supporting libraries are provided with distributions. Note: The lexicon system is broken in this version, and therefore it cannot be used to parse.

          •  07 Jun 2000 02:34

            Release Notes: Initial release.


            Project Spotlight


            A Fluent OpenStack client API for Java.


            Project Spotlight

            TurnKey TWiki Appliance

            A TWiki appliance that is easy to use and lightweight.