cpdetector is a small yet clever framework for codepage detection that integrates different strategies. It may be used as a library for third party software that accesses textual data over network. It also includes a best-practice implementation in form of a command line tool that allows sorting and transforming large collections of documents based on their codepage. Available strategies include: jchardet (exclusion, frequency analysis, and guessing), detection of the HTML charset property, and detection of the XML encoding declaration.
Terrier is software for the rapid development of Web, intranet, and desktop search engines. More generally, it is a modular platform for building large-scale information retrieval applications, providing indexing and probabilistic retrieval functionalities. It comes with a desktop search application.
SlimSearch is a quick and easy search extension for Firefox. It allows you to perform searches on text selected with the mouse by using the contextual (right-click) menu. You can do normal searches, dictionary searches, address searches, Froogle, and many other kinds. Google is extensively used for most of the searches, but other specialized search engines are used too. The focus is on simplicity of utilization: less is more.