The Pegasus Workflow Management System encompasses a set of technologies which help workflow-based applications execute in a number of different environments, including desktops, campus clusters, grids, and clouds. It bridges the scientific domain and the execution environment by automatically mapping high-level workflow descriptions onto distributed resources. It automatically locates the necessary input data and computational resources necessary for workflow execution. It enables scientists to construct workflows in abstract terms without worrying about the details of the underlying execution environment or the particulars of the low-level specifications required by the middleware (Condor, Globus, or Amazon EC2). It bridges the current cyberinfrastructure by effectively coordinating multiple distributed resources.
"TCP Input Text" extracts TCP ports and fully qualified domain names (FQDN) from search results into a .csv file and individual shell scripts for nmap and nc (a.k.a. netcat) to provide assurance of a listening TCP service since the time that has passed of the last crawl performed by the search engine.
FuzzyOcr is a plugin for SpamAssassin that can be used on image spam. It supports optical character recognition using different engines and settings, a fuzzy word matching algorithm applied to OCR results, an image hashing system to learn the unique properties of known spam images, dimension, size, and integrity checking of images, and content-type verification for the containing email message.
The Android Scripting Environment (ASE) brings scripting languages to Android by allowing you to edit and execute scripts and interactive interpreters directly on the Android device. These scripts have access to many of the APIs available to full-fledged Android applications, but with a greatly simplified interface that makes it easy to handle intents, start activities, make phone calls, send text messages, scan bar codes, poll location and sensor data, use text-to-speech, and more. Scripts can be run interactively in a terminal, started as a long running service, or started via Locale. Python, Perl, JRuby, Lua, BeanShell, and Bourne shell are currently supported.
CTL is a cross-platform control dispatching tool that makes it easy to automate any kind of distributed systems management or application provisioning task. Rather than writing complex and error-prone scripts that over utilize "for loops", CTL handles the network dispatching for you and allows you to focus on the actual management tasks you need to accomplish. In addition to being a dispatching tool, CTL comes with pre-built cross-platform utilities so you don't have to script actions like file distribution or process and port checking. You can also write your own custom utilities and share them with others in your organization. Custom utilities are defined in XML, and your scripting can be done in multiple scripting languages (Perl, Python, etc.), *nix shell, Windows batch, and/or Ant.
PDFassassin is a module for SpamAssassin that allows for the scanning of PDF files in email message attachments. Email bodies are scanned upon connection and checked for PDF attachments. Text is extracted from the PDF via pdftotext and scanned by SpamAssassin. Should the PDF contain images, the gocr program is called to extract the text content. The total spam score of the PDF is compared against the global required_score setting; if it's higher, a score equal to the one specified in pdf.cf is appended to the overall score of the email message.