eXist is a native XML database featuring efficient, index-based XPath query processing, extensions for keyword search, XUpdate support, and tight integration with existing XML development tools. The database is lightweight and may be easily deployed in a number of ways, running either as a stand-alone server process, inside a servlet engine, or directly embedded into an application.
Load is a utility for Web application and SOAP-based Web services performance and scalability testing. It features a scripting language and a library of test objects for the creation of intelligent agents that drive the Web application or SOAP-based Web service. Running hundreds of agents concurrently shows how your software performs in production environments. Load is a Java application that runs on Linux, Solaris, NT, Win2000, and Macintosh. While Load continues to be maintained, the next generation is the TestMaker program.
Local Domain Search Engine is a distributed search engine. Nodes index local documents, and multiple nodes can be clustered to bigger nodes. There is a plugin mechanism to support new file formats (currently text, HTML, PDF, Zip, and GZip are supported). Files can be gathered via HTTP and from file systems, but this is extensible. Retrieval is done with a fault tolerant algorithm, so misspelled words can be found. Querying is possible with an applet, an application, a servlet, or via an XML interface.
XSet is a performance oriented main memory XML database and query engine. XSet is meant to be a low level application component that enables applications to have high speed access to stored metadata in the form of XML documents. In order to achieve the highest possible performance, XSet does not support the notion of transactions. Compared to its industry and research counterparts, XSet is miniscule. It uses a simple subset query model which is surprisingly complete for its current set of target applications. Examples of XSet applications include metadata file systems, searchable email clients, user preference registries, service discovery services, and online customized content portals. Performance evaluations have shown XSet to scale logarithmically to dataset size, and can handle hundreds of request per second on off the shelf desktop machines.
WebSuck goes through a Web page, following links and making a list of the datafiles encountered along the way. It is useful for such tasks as downloading large image galleries without clicking all the links yourself. It can output a file list in a format appropriate for wget, and another for GetRight. It can be used either via a Swing GUI or in console mode.
SERanks is a Java application that obtains the rankings of search expressions from search engines. It allows you to specify the search engines that are of relevance to you, offers lots of options for configuring the output and for governing the ranking process, and allows you to export the results to spreadsheet applications.
HtmlRipper is a Java package that enables dynamic data to be extracted from Web pages, using pre-defined rule sets. It allows multiple data sets to be combined into a single dynamic web page, and is ideal for the creation of data mining, page analysis, Web page filtering, and article clipping software. The package includes a sample rules-enabled browser and rules editor.
Historical Event Markup and Linking Project (Heml) provides an XML schema for historical events and a Java Web app which transforms conforming documents into hyperlinked timelines, maps and tables. It aims to provide a most information-rich interchange format for historical data, and thus add a historical component to the growing movement for a 'Semantic Web.'