NetCrawler is the frontend to a Web crawling system. This command line application will download all of the pages within a domain, and then parse and process all of the relative content (Images, Text, Audio, Video), saving this content within an XML document for later processing. It is definitely alpha quality, but has been used quite extensively.
EZ Reusable Objects (EZRO) is a Web application that can be used by non-technical staff to manage content as "objects." Content objects containing text, video, and audio can be shared, modified, and re-styled to appear via a traditional Web site, an on-line course, an innovative "Coach," or as a community of interest site. It is highly scalable and can be used for public Web sites, secure environments, and private intra/extranets.
GroupServer is a Web-based mailing list manager designed for large sites. It provides email interaction like a traditional mailing list manager but also supports reading, searching, and posting of messages and files via the Web. Users have forum-style profiles, and can manage their email addresses and other settings using the same Web interface. It has supports features such as Atom feeds, a basic CMS, statistics, multiple verified addresses per user, and bounce detection, and is able to be heavily customized.
doclifter helps with lifting documents with nroff markup to XML-DocBook. Lifting documents from presentation level to semantic level is hard, and a really good job requires human polishing. This tool aims to do everything that can be mechanized, and to preserve any troff-level information that might have structural implications in XML comments. TBL tables are translated into DocBook table markup, PIC into SVG, and EQN into MathML (relying on pic2svg and GNU eqn for the last two).
BlindTux is a musical quiz game. The goal is to identify the artist, title, or year of a song by choosing an answer from among 4. The maximum number of players is 4. For now, it is played with the keyboard, but soon players will be able to choose with controls they want: keyboard, gamepad, or maybe buzzer.
GetYourIDX Python Library is a library that implements the GetYourIDX API in Python. GetYourIDX provides IDX data services to real estate agents and brokers. IDX technology gives participants in multiple listing services (MLS) the essential tools needed to display each others' clients on their prospective Internet sites. Under the IDX real estate system, brokers agree to display each others listings on the Internet and are provided the option of what listings they want displayed.
The Chameleon Problem Solving Environment is based on the CPACS data format and allows third party application coupling for multidisciplinary simulation. It comprises a set of libraries and integration components for the commercial integration environment ModelCenter or the open source alternative RCE (http://www.rcenvironment.de/). It was initially developed from the German Aerospace Center (DLR).
TIXI is a fast and simple XML interface library for applications written in C, C++, Fortran, Java, and Python. Although simplified and somewhat restricted compared to a fully-fledged XML processing library, it can create documents, create and delete nodes, and add and remove element attributes. Routines for reading and writing text nodes and nodes holding integer and floating point numbers are included, along with routines that process aggregates of these simple types for the processing of geometric data, multidimensional arrays, or arrays of vectors.
QuickSummary is a plugin for Mozilla Thunderbird which parses text and highlights the lines the user selects as being the most important. For example, if the user defines three important sentences, the first appears highlighted in green, the second in yellow, and the third in red.