Duke is a fast and flexible record linkage engine. It does not use the traditional blocking (sort by key) approach, but instead relies on Lucene. This makes it high-performance (able to process 1,000,000 records in ~10 minutes). Duke can be run from the command line, but also has an API allowing incremental linking applications to be built easily. It supports reading data from CSV, JDBC, SPARQL, and NTriples, and also supports a number of string comparators and string normalizers.
Geo Bookmark is a simple widget that adds a bookmark with the current time and geographical position to the home screen. The interface is kept minimal to allow operation in situations that do not permit extensive interaction with the phone (e.g. while driving a car). Bookmarks are color coded. Blue icons represent exact locations, red icons mean that the widget could not get a GPS fix within 40 seconds and therefore recorded the last known location (which may or may not have been determined by means other than GPS). In order to view a bookmark, a suitable maps application needs to be installed (e.g. Google Maps or OSMAnd).
AtMail is a webmail client. The project aims to provide an elegant client for existing IMAP mailservers, with less bloat and a focus on an intuitive, simple user interface. Features include complete Webmail functionality, address-book support, video mail, an AJAX interface, drag'n'drop, and more.
Weed-FS is a simple and highly scalable distributed file system. There are two objectives: to store billions of files, and to serve the files fast! Instead of supporting full POSIX file system semantics, it implements only a key-file mapping. Instead of managing all file metadata in a central master, it manages file volumes in the central master and lets volume servers manage files and the metadata. This relieves concurrency pressure from the central master and spreads file metadata into volume servers' memories, allowing faster file access with just one disk read operation. It is modelled on Facebook's Haystack design paper. Only 40 bytes of disk storage are required for each file's metadata, and disk reads are O(1).
Gizmo Daemon is a program for controlling your computer based on events from input devices. It has built-in support for all Linux input devices, including keyboards with special keys, joysticks, remotes, dials, and more. It lets you control applications, launch programs, change the system volume, switch desktops, and directly control Amarok. It can visualize system events (such as Amarok sound output, CPU usage, etc.) on capable devices (keyboards with LEDs, Griffin PowerMate, etc.). It also features support for LIRC and RF based remote controls, allowing it to have per-application key mappings and configurable sensitivity settings.
Apache OpenNLP is a machine learning based toolkit for the processing of natural language text. It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution. These tasks are usually required to build more advanced text processing services.
BitNami MAMPStack Native Installer is an easy-to-install environment to develop and deploy PHP applications. It includes pre-configured, ready-to-run versions of Apache, MySQL, PHP, and phpMyAdmin. Amazon AWS PHP SDK makes it easier to develop PHP applications that run on Amazon Web Services. The AWS PHP SDK includes the AWS PHP library, code samples, and documentation.
JasperStarter is a commandline launcher and batch compiler for JasperReports. It can run any JasperReport that needs a JDBC, CSV, or empty data source, use any database for which a JDBC driver is available, execute reports that need runtime parameters, print directly to the system default or a given printer, optionally show a printer dialog to choose a printer, optionally show a print preview, export to file in the PDF, RTF, .docx, .odt, and HTML formats, export multiple formats in one commanding call, print and export in one commanding call, and integrate in non-Java applications (for example PHP, Python) A binary executable is available for Windows. JapserStarter is able to prompt for report parameters. Any parameter whose class has a string constructor is accepted. Additionally, the following types are supported or have special handlers: date, image (see usage), and locale.