Xtract is an extendable, scriptable data extraction framework. It is meant to be a unified framework to perform data extraction tasks. It includes a sample module to identify JPEG files. A common, shared framework similar to this makes it possible to develop an easily deployable and commonly held set of data extraction tools that exceed the capability of any closed-source alternative. The script-based nature of Xtract makes it portable across architectures, and its modular design allows the easy addition of capabilities to identify new types of files or other data.
Hadoop Studio is a map-reduce development environment (IDE) based on Netbeans. It makes it easy to create, understand, and debug map-reduce applications based on Hadoop, without requiring development-time access to a map-reduce cluster. The studio provides a real-time workflow view of a map-reduce job, which displays the individual inputs, outputs, and interactions between the phases of a map-reduce job. The workflow view of a job updates in real time with the developer's code changes. It then generates Java sources and compiles them into a binary jar file, which can be run on a normal Hadoop cluster.
ARPWatch Web-interface and SQL-storage includes an SQL converter and a Web interface for arpwatch. The SQL converter lets you store arpwatch messages in a MySQL database. The Web interface displays mailbox and database reports with selection by period, IP address, and MAC address.