ServingXML is a markup language for expressing XML pipelines and an extensible Java framework for defining the elements of the language. It defines a vocabulary for expressing flat-XML, XML-flat, flat-flat, and XML-XML transformations in pipelines. The accompanying console app supports reading content as XML files, flat files, SQL queries, or SAX events, and writing it as XML, HTML, PDF, or mail attachments. This software is especially suited for converting flat file or database records to XML, with its support for namespaces, variant record types, multi-valued fields, segments and repeating groups, hierarchical grouping of records, and row-by-row validation with XML Schema. There is also an API for embedding the software in a Java application.
Weed-FS is a simple and highly scalable distributed file system. There are two objectives: to store billions of files, and to serve the files fast! Instead of supporting full POSIX file system semantics, it implements only a key-file mapping. Instead of managing all file metadata in a central master, it manages file volumes in the central master and lets volume servers manage files and the metadata. This relieves concurrency pressure from the central master and spreads file metadata into volume servers' memories, allowing faster file access with just one disk read operation. It is modelled on Facebook's Haystack design paper. Only 40 bytes of disk storage are required for each file's metadata, and disk reads are O(1).
OpenSAML is a portable implementation of the Security Assertion Markup Language (SAML) standard for the formation and exchange of authentication, attribute, and authorization data using XML, as defined by OASIS. There are interfaces for a range of languages, including C++ and Java.
Log4php is a PHP port of Log4j, the most popular Java logging framework. It supports configuration through XML and properties files (with the same structure as log4j) and custom Configurators. File, RollingFile, DailyFile, Echo, Console, Mail, PEAR::Db, PHP error, Syslog or NT events, and socket appenders are supported. Simple, TTCC, Pattern, Html, and Xml Layouts are supported. It also supports Filters, custom Levels, and Loggers. Internal debugging can be switched on and off. Log4php can be used inside a class or inside a main/sub function.
Apache OpenNLP is a machine learning based toolkit for the processing of natural language text. It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution. These tasks are usually required to build more advanced text processing services.
Probus is a Web framework for developing in PHP5. This framework is based on the MVC pattern. It has excellent support data caching and provides convenient AJAX support. This framework supports the following features: Many programming patterns can be used, include MVC. A simple command-line tool is used to generate projects. Standard plugins are included for AJAX, RSS, and form validation. It is simple create new plugins for Probus. Its database abstraction layer supports MySQL, PgSQL, MSSQL, Oracle, and many others. It automatically caches data. Its configuration system uses YAML syntax. Components can be used from external frameworks such as Zend Framework, symfony, Code Igniter, Kohana, and CakePHP.