BioJava aims to provide a comprehensive set of Java components for the rapid development of applications in Bioinformatics. It contains interfaces for representing Sequences, Features, and other important bioinformatics concepts. It can also read and write sequence data in a variety of common formats and communicate with Ensembl databases and with DAS and BioCorba servers.
Genpak is a small set of utilities designated to process DNA, RNA, and protein sequences in a very Un*x-like manner. This way, Genpak programs can be combined using pipes and redirections, as well as easily incorporated into CGI scripts. The utilities include a program for calculating GC content, Tm, translating DNA/RNA sequences into protein sequences, quick sequence retrieval, random sequence generation, and finding promoter sequences using a Hertz matrix.
R is a language and environment for statistical computing and graphics. It is similar to S, which was developed at Bell Laboratories by John Chambers et al. It provides a wide variety of statistical and graphical techniques (linear and nonlinear modelling, statistical tests, time series analysis, classification, clustering, etc.). R is designed as a true computer language with control-flow constructions for iteration and alternation, and it allows users to add additional functionality by defining new functions. For computationally intensive tasks, Fortran and C code can be linked and called at run time.
VMD (Visual Molecular Dynamics) is designed for the visualization and analysis of biological systems such as proteins, nucleic acids, lipid bilayer assemblies, etc. It may be used to view more general molecules, as VMD can read standard Protein Data Bank (PDB) files and display the contained structure. VMD provides a wide variety of methods for rendering and coloring a molecule: simple points and lines, CPK spheres and cylinders, licorice bonds, backbone tubes and ribbons, cartoon drawings, and others. VMD can be used to animate and analyze the trajectory of a molecular dynamics (MD) simulation. In particular, VMD can act as a graphical front end for an external MD program by displaying and animating a molecule undergoing simulation on a remote computer. VMD uses OpenGL and OpenGL Programmable Shading Language for high performance rendering of large structures.
Weka is a collection of machine learning algorithms for solving real-world data mining problems. It is written in Java and runs on almost any platform. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka is also well-suited for developing new machine learning schemes. The development version contains a GUI with visualization tools and direct database access.
Arka provides a nice GUI for the gp package of command-line utilities for manipulation and display of DNA/RNA/protein sequences, the WU-BLAST and FASTA program families, and additional graphical tools for various aspects of sequence analysis (e.g., GC plots and 3D graphs). It provides editable and saveable windows for standard input, output and error, and a dialog for any command-line program with specifications in a configuration file.