5 projects tagged "Bioinformatics"
PubCrawler is a script for an "alerting" service which scans daily updates to the NCBI Medline (PubMed) and GenBank databases. When PubCrawler is run daily, the results can keep scientists informed of the current contents of Medline and GenBank by listing new database entries that match their research interests.
SENTENSA Knowledge Miner is a platform independent tool for searching any text. SENTENSA uses robust methods of indexing and searching text, leveraging experience from more than 20 years of information retrieval. SENTENSA products offer advanced text retrieval solutions for large databases that will make your searches for key information fast and effective. You can index on one platform and query on another.
The Full-text Index Data structure library, libfid for short, is a portable software library for accessing indexed data through a simple C interface. It implements, among others, functions for reading indexed data from files, and for performing common operations such as fast string matching. Easy alphabet handling for mapping between printable and binary alphabets is integrated from the ground up. Currently, the enhanced suffix array is the only full-text index data structure supported. A very simplistic program for constructing enhanced suffix arrays is included.
mkESA is a program for constructing enhanced suffix arrays (ESAs) from biological sequence data. The program is based on an implementation of Manzini's lightweight Deep-Shallow algorithm, which can also utilize multiple CPUs/cores for extra performance. The generated output is compatible with the output of mkvtree from the Vmatch package.