Niocchi is a Java crawler library implementing synchronous I/O multiplexing. This specific type of implementation allows crawling tens of thousands of hosts in parallel on a single low end server. Niocchi has been designed for big search engines that need to crawl massive amount of data, but can also be used to write no-frills crawlers.
lambdaj is a Java library that allows you to manipulate collections in a pseudo-functional and statically typed way. The main purpose of lambdaj is to partially eliminate the burden of writing (often nested and poorly readable) loops while iterating over collections. In particular, it allows you to iterate a collection in order to filter its items on a given condition, convert each item with a given rule, extract a given property from each item, sort the items on the values of one of their properties, group or index the items on the value of one or more properties, invoke a method on each item, or aggregate the items or the values of one of their properties. It allows all this without having to write a single explicit loop.
RestyGWT is a GWT generator for REST services and JSON encoded data transfer objects. RestyGWT Rest Services allow you to define an asynchronous service API which is then implemented via GWT deferred binding by RestyGWT. It features generation of asynchronous RESTful JSON based service proxies, encoding and decoding of Java objects to JSON, and an easy-to-use REST API.
treap.py is a treap implementation for Python. A treap is a hybrid of a binary tree and a binary heap that is self-balancing and is O(nlog2(n)) for most operations, including deleting a value, inserting a value, finding the least value, and finding the greatest value. This particular treap implementation looks like a dictionary to the caller, but it also supports getting an ordered list (forward or reverse) in O(n) time. The code is available as pure Python (should run on about any Python implementation supporting generators, but was tested on CPython 2.6) or as part Python and part Cython for performance. The version with Cython should run on CPython or Unladen Swallow, but was only tested on CPython 2.6.
SkrubCRM is a CRM that can be used easily by companies of all sizes. Larger organizations can rely on it to deliver constantly and reliably. It has comprehensive role based security. It can use any Hibernate supported database. It does not require an application server to run; it can run on a Web server like Tomcat or Jetty. It can support millions of customers easily. It is extremely fast. It complies with ITIL, SOX, and other international regulations. Its internationalization and localization features allow it to be translated into any language easily.
Tranche is file storage and dissemination software. Designed and built with scientists and researchers in mind, Tranche can handle very large data sets, is secure and scalable, and all data sets are citable in scientific journals. Features include a fully decentralized architecture, support for very large files, very long-term file persistence/preservation, file immutability/integrity, provenance, encryption, licensing, versioning, and citability.