Pymur provides Python bindings to the C++ based Lemur Toolkit. The Lemur Toolkit is an open-source toolkit designed to facilitate research in language modeling and information retrieval. Lemur supports a wide range of industrial and research language applications such as ad-hoc retrieval, site-search, and text mining.
Puggle is a desktop search engine that provides full text search over files, folders, music, photos, Web pages, and other data that are stored locally on your computer. Puggle is able to create many different indices, each with a different configuration. For example, you may have a different index for your music collection as well as your documents. Each of them can be used on demand, simply by loading it. Furthermore, Puggle supports indexing of portable devices, like USB flash drives or external hard disks. The index will be stored in the device, using relative paths, allowing you to search over the data very quickly on any computer.
Jipes is a Java library that allows you to efficiently compute audio features. Possible uses for these features are general music information retrieval (MIR) applications or, more specifically, personal music software. Unlike many other digital signal processing (DSP) libraries or frameworks, Jipes is not meant for real time processing of a single audio stream that is manipulated and eventually played back. Instead, it focuses on efficiently executing mutiple processing pipelines that transform a signal into a feature or feature set. Since Jipes focuses on features instead of the raw, untyped signal streams, it supports rich types to be used, where other frameworks only offer arrays of raw data. Also, by using Java generics, many core interfaces and classes can be typed to whatever class you see fit for the purpose. However, Jipes also comes with useful pre-defined types and support for raw arrays.