BioJava aims to provide a comprehensive set of Java components for the rapid development of applications in Bioinformatics. It contains interfaces for representing Sequences, Features, and other important bioinformatics concepts. It can also read and write sequence data in a variety of common formats and communicate with Ensembl databases and with DAS and BioCorba servers.
Genpak is a small set of utilities designated to process DNA, RNA, and protein sequences in a very Un*x-like manner. This way, Genpak programs can be combined using pipes and redirections, as well as easily incorporated into CGI scripts. The utilities include a program for calculating GC content, Tm, translating DNA/RNA sequences into protein sequences, quick sequence retrieval, random sequence generation, and finding promoter sequences using a Hertz matrix.
Weka is a collection of machine learning algorithms for solving real-world data mining problems. It is written in Java and runs on almost any platform. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka is also well-suited for developing new machine learning schemes. The development version contains a GUI with visualization tools and direct database access.
Arka provides a nice GUI for the gp package of command-line utilities for manipulation and display of DNA/RNA/protein sequences, the WU-BLAST and FASTA program families, and additional graphical tools for various aspects of sequence analysis (e.g., GC plots and 3D graphs). It provides editable and saveable windows for standard input, output and error, and a dialog for any command-line program with specifications in a configuration file.
FlipDCD is a small utility for reversing the endianism of binary DCD trajectory files from Charmm and NAMD. This can be useful when running simulations on one architecture and visualizing or analyzing the results on another. FixDCD is a tiny utility to modify the header of an X-PLOR DCD file to make it readable by programs expecting Charmm DCD files, at the expense of a Timestep size value in the header.