226 projects tagged "Bioinformatics"
EAsea Specification of Evolutionary Algorithms (EASEA), is a high-level language dedicated to the specification of evolutionary algorithms. The language and compiler are quite mature. EASEA compiles .ez specification files into C++ or Java object files, using existing evolutionary libraries. Supported C++ libraries currently are GALib or EO.
Genpak is a small set of utilities designated to process DNA, RNA, and protein sequences in a very Un*x-like manner. This way, Genpak programs can be combined using pipes and redirections, as well as easily incorporated into CGI scripts. The utilities include a program for calculating GC content, Tm, translating DNA/RNA sequences into protein sequences, quick sequence retrieval, random sequence generation, and finding promoter sequences using a Hertz matrix.
Arka provides a nice GUI for the gp package of command-line utilities for manipulation and display of DNA/RNA/protein sequences, the WU-BLAST and FASTA program families, and additional graphical tools for various aspects of sequence analysis (e.g., GC plots and 3D graphs). It provides editable and saveable windows for standard input, output and error, and a dialog for any command-line program with specifications in a configuration file.
Wat2ions is complementary to DelPhi, the renowned solver of the Poisson-Boltzmann equation. The electrostatic potential map pre-computed by DelPhi is used to replace a certain number of the water molecules of the bath, surrounding a solvated macromolecule, with ions. The potential on the oxygens of the water molecules is estimated using a 3D cubic spline interpolation of the pre-computed potential. The waters with the largest potential are replaced with chlorides, and the waters with the smallest potential, with sodii. No two ions are placed closer than a certain distance to each other. The coordinates of the macromolecule, the water bath (except the replaced waters), and the placed ions are output to a new PDB file.
SODIUM places the required number of sodium ions around a (simulated) system of electric charges, e.g., the atoms of a biological macromolecule (protein, DNA, protein/DNA complex). The ions are placed in the nodes of a cubic grid in which the electrostatic energy achieves the smallest values. The energy is re-computed after placement of each ion. A simple Coulombic formula is used for the energy. The coordinates of the placed ions are printed out in the PDB format for further usage. Trivial modifications to the program should allow the placement of any combination of multivalent ions of different charges.
PDBCat can be used to manipulate and process PDB files using commonly available text-processing tools such as Perl, awk, etc. The Brookhaven Protein Data Bank stores atomic coordinate information for protein structures in a column-based format which is designed to be read easily read by FORTRAN programs. PDBCat converts the ATOM and HETATM records of PDB files from this column-based format to a field-based one that is more easily processed by standard Unix tools.
Mol_Volume calculates the volume of a macromolecule by a method somewhat akin to the Monte Carlo method, namely, by measuring how many vertices of a dense regular grid happen to be within the probe radius of the molecule's atoms. The volume is then calculated as V = V_grid * N_near / N_total = N_near * V_per_node.
FlipDCD is a small utility for reversing the endianism of binary DCD trajectory files from Charmm and NAMD. This can be useful when running simulations on one architecture and visualizing or analyzing the results on another. FixDCD is a tiny utility to modify the header of an X-PLOR DCD file to make it readable by programs expecting Charmm DCD files, at the expense of a Timestep size value in the header.
MatDCD is a Matlab package for reading/writing DCD-formatted molecular dynamics trajectory files. It supports Charmm and xplor-format DCD files, it can read both big- and little-endian storage formats, and it can specify which atom indices to load without having to load the entire file.