KeyBox provides a way to manage OpenSSH v2 public keys, and can start a Web-based SSH terminal to execute commands and scripts on multiple SSH sessions simultaneously. The authorized_keys file is generated and distributed based on relationships maintained in the application. This allows for centralized management to help prevent public key sprawl. Also, composite terminals or scripts can be created so that commands can be shared across SSH sessions.
Weed-FS is a simple and highly scalable distributed file system. There are two objectives: to store billions of files, and to serve the files fast! Instead of supporting full POSIX file system semantics, it implements only a key-file mapping. Instead of managing all file metadata in a central master, it manages file volumes in the central master and lets volume servers manage files and the metadata. This relieves concurrency pressure from the central master and spreads file metadata into volume servers' memories, allowing faster file access with just one disk read operation. It is modelled on Facebook's Haystack design paper. Only 40 bytes of disk storage are required for each file's metadata, and disk reads are O(1).
uma::bson is a DOM-style C++ API for reading/writing BSON data. Unlike the MongoDB C++ API, which exposes a read-only interface with a separate interface for creating a BSON representation, this API allows reading/writing on the existing data. The API is designed primarily for serialising/deserialising BSON data to/from streams (files, socket connections, etc.).
Overthere is a Java library to manipulate files and execute processes on remote hosts, i.e. do stuff "over there". It was built for and is used in the XebiaLabs deployment automation product Deployit as a way to perform tasks on remote hosts, e.g. copy configuration files, install EAR files, or restart Web servers. Another way of looking at it is to say that Overthere gives you java.io.File and java.lang.Process as they should have been: as interfaces, created by a factory and extensible through an SPI mechanism.
RunDeck is a command automation hub that helps you automate ad-hoc and routine procedures in data center or cloud environments. It allows you to run tasks on any number of nodes from a Web-based or command-line interface. It also includes other features that make it easy to scale up your scripting efforts, including access control, workflow building, scheduling, logging, and integration with external sources for node and context data.
Rerun is a lightweight tool-building framework useful to those implementing management procedure with shell scripts. It will help you organize your implementation into well-defined modular interfaces. Collections of management modules can be archived and delivered as a single executable to facilitate team handoffs. Using the "stubbs" module, rerun will even facilitate unit tests. When users execute rerun module commands, rerun can record execution data into log files which can later be replayed.
Apache OpenNLP is a machine learning based toolkit for the processing of natural language text. It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution. These tasks are usually required to build more advanced text processing services.