DKPro WSD provides UIMA components which encapsulate corpus readers, linguistic annotators, lexical semantic resources, WSD algorithms, and evaluation and reporting tools. You configure the components, or write new ones, and arrange them into a data processing pipeline. DKPro WSD is modular and flexible. Components which provide the same functionality can be freely swapped. You can easily run the same algorithm on different data sets, or test several different algorithms on the same data set.
OrientDB is a NoSQL DBMS which can store 150,000 documents per second on common hardware. Even with a document-based database, the relationships are managed as in graph databases, with direct connections among records. You can traverse entire or parts of trees and graphs of records in a few milliseconds. It supports schema-less, schema-full, and schema-mixed modes, has a strong security profiling system based on users and roles, and supports SQL between the query languages. Thanks to the SQL layer, it's straightforward to use for people skilled in the relational world.
DKPro Core is a collection of software components for natural language processing (NLP) based on the Apache UIMA framework. Many powerful and state-of-the-art NLP components are already freely available in the NLP research community. New and improved components are being developed and released continuously. The components cover the whole range of NLP-related processing tasks. DKPro Core provides wrappers for such third-party tool as well as original NLP components. DKPro Core builds heavily on uimaFIT which allows for rapid and easy development of NLP processing pipelines.
C5Connector.Java is a backend for the Core Five file manager. It provides a bridge between the file manager and a storage backend and works like a transparent virtual file system or a proxy. It includes a default implementation for accessing the local filesystem (on the server side). The API encapsulates the building of JSON and is intended to be used within JSPs as well as with a template engine like Velocity or FreeMarker.
documentr is a Web-based tool for editing and presenting software documentation. It allows you to easily maintain documentation for multiple products and product branches. Edits can easily be copied between branches, with merge conflicts being handled gracefully. It uses Markdown as its markup language, along with some extensions, and has a role-based permission system.
TomP2P is a P2P-based high performance key-value pair storage library. Each peer has a table (either disk-based or memory-based) to store its values. A single value can be queried or updated with a secondary key. The underlying communication framework uses Java NIO to handle many concurrent connections.
CKEditor.Java provides an object with a simple interface for creating and configuring an editor instance. This makes the deployment of the CKEditor a piece of cake. It is intended to be used in servlet-based systems like JSP as well as template engines like Velocity or FreeMarker. There are some notable differences between this library and the official CKEditor for Java library. CKEditor.Java isn't focused on JSP and can be used in all servlet-based systems. A tag library isn't provided yet! It also has advanced handling of properties. It's possible to set all CKEditor configuration for the entire Web application in one properties file. This keeps the front-end code clean. The state-of-the-art logging facade slf4j is used.
ERDesignerNG is a Java-based entity relationship design and database modeling tool. It has a powerful WYSIWYG interface for physical database design, supports 2D and 3D database modeling, browsing, and design, handles tables, relations, indexes, and comments, supports a central repository for all models (Model Repository), supports subject areas, supports domains, supports MySQL, Oracle, Microsoft SQLServer, PostgreSQL, and H2, creates the SQL DDL statements for schema creation, has an integrated schema version control system, can generate schema migration scripts for every change, stores the database definition as XML files for further processing, can export the database schema as GIF, BMP, JPEG, or SVG files, has an integrated reverse engineering module for existing schemas, has a build in schema compare feature, can generate schema documentation as PDF, HTML, RTF, and other formats, and supports schema conversion between different dialects.
JID is a system which can deserialize, update, and reserialize a map of 100,000 entries in half a millisecond on an i7 at 3.40 GHz. Serialization/reserialization timings are largely independent of the size and complexity of the table entries. Incremental serialization/deserialization is used to achieve high performance.