The 64 Studio Platform Development Kit (PDK) is a version control system for GNU/Linux distributions, allowing the creation and management of many different projects, based on Debian and Ubuntu sources. PDK is written in Python, and the source code is well commented and contains documented examples.
ACOPOST is a set of freely available POS taggers modeled after well-known techniques. The programs are written in C (aiming for extreme portability and code correctness/safety) and run under various Unix flavors (and probably even under Windows). ACOPOST currently consists of four taggers that are based on different frameworks: Maximum Entropy Tagger (MET), Trigram Tagger (T3, based on Hidden Markov Models), Error-driven Transformation-based Tagger (TBT or Brill Tagger), and Example-based tagger (ET).
Ailurus is an application which tells its users about tricks for enhancing their use of Ubuntu Linux. It puts tricks in tool-tip text and a "Tip of the day" window. It also displays information about the system's BIOS, motherboard, CPU, and battery. It has an interface for changing some GNOME settings. It can install and remove some applications which are not provided in the official Ubuntu apt repository. It can detect the speed of apt mirrors and find the fastest one. It can enable and disable some third party repositories.
Airs is a tool that can periodically check for new TV episodes online on a few of the popular Web sites. It will present to you if and when there are new episodes available. It supports lifetime management for the downloaded episode information where you can track which episodes you've already seen, which you are still retrieving, and which ones you still have to find. With the integrated Web server you can point and click on the episodes and watch them in your favorite player.
Algorithm Study provides tools and resources to augment the traditional study of algorithms. It includes implementations of common and less-common algorithms in a variety of languages and visualization tools to help in gaining a deeper understanding of the algorithms. The algorithm implementations are each accompanied by a discussion of the asymptotic ("big O") run time and memory limits of the algorithm. Some implementations include discussion of how the algorithm or data structure is commonly used and comparisons with similar algorithms or data structures. All implementations have test cases that exercise their functionality. The visualization tool, Algorithm Visualizer, displays what happens as various algorithms do their work.
Algraeph is a tool for manual alignment of linguistic graphs, such as phrase structure trees or dependency structures, where each node corresponds to a subsequence of the analyzed input sentence. It allows you to express the similarity between two graphs by aligning their nodes and attaching relation labels to these alignments. Graphs are read from one or more graphbanks (or treebanks) in the GraphML or Alpino formats. Alignment relations are user-defined and are stored in a simple XML format, which can be used for further processing. The resulting parallel graph corpus is a useful data set for many tasks in computational linguistics and natural language processing.