XMLBeans is a technology for accessing XML by binding it to Java types. XMLBeans provides several ways to get at the XML. You can do this through XML schema that has been compiled to generate Java types that represent schema types. In this way, you can access instances of the schema through JavaBeans-style accessors after the fashion of "getFoo" and "setFoo". The XMLBeans API also allows you to reflect into the XML schema itself through an XML Schema Object model. There is a cursor model through which you can traverse the full XML infoset. There is support for XML DOM.
BitNami Opina Stack is an easy-to-install distribution of the Opina poll management software. It includes pre-configured, ready-to-run versions of Apache-Tomcat, MySQL, and JDK (optional), so users can get an Opina installation up and running in minutes after answering a few questions. Linux, Windows, and OS X are supported.
AppFuse is an application for "kickstarting" Web application development. It allows you to execute a single ant task to create a skeleton application based on Struts, Spring, and Hibernate running on Tomcat and MySQL. It uses Ant, XDoclet, Spring, Hibernate (or iBATIS), JUnit, jMock, StrutsTestCase, Canoo's WebTest, Struts Menu, Display Tag Library, OSCache, JSTL, and Struts. (Spring MVC, WebWork, Tapestry and JSF are also options.)
The Pegasus Workflow Management System encompasses a set of technologies which help workflow-based applications execute in a number of different environments, including desktops, campus clusters, grids, and clouds. It bridges the scientific domain and the execution environment by automatically mapping high-level workflow descriptions onto distributed resources. It automatically locates the necessary input data and computational resources necessary for workflow execution. It enables scientists to construct workflows in abstract terms without worrying about the details of the underlying execution environment or the particulars of the low-level specifications required by the middleware (Condor, Globus, or Amazon EC2). It bridges the current cyberinfrastructure by effectively coordinating multiple distributed resources.
Python Web Graph Generator is a threaded Web graph (Power law random graph) generator. It can generate a synthetic Web graph of about one million nodes in a few minutes on a desktop machine. It supports both directed and undirected graphs. It implements a threaded variant of the RMAT algorithm. A little tweak can produce graphs representing social networks or community networks. It can also output connected components in a graph.
GWiki is an embeddable Wiki engine. It allows users to edit rich text fragments in an application you develop. It supports I18N files and mulitimedia content. Gwiki can be extended with macros written in Java or Groovy. It has a powerful right and role system, which can be fed from external sources such as LDAP, and allows access to business entitiels from wiki pages. It comes with Wicket integration out-of-the-box. Wicket-HTML-Fragments can be held within GWiki, so these pages can be managed by the Wiki System, while Wicket handles the application logic. GWiki can be deployed as a servlet on every servlet container. GWiki can use a filesystem, a Zip archive, or a database for its content storage.
Nutch is highly scalable Web searching software which builds on top of Apache Hadoop and Lucene Java. Key features include a Web crawler, indexer, crawl management tools, parsers for HTML, PDF, DOC, and several other document formats, and an expandable architecture that allows you to plug in additional functionality such as document parsers, custom scoring algorithms, custom content parsers, protocols, and more.