PDF OCR is a simple drag-and-drop utility that converts PDFs and images into text documents. It uses advanced OCR (optical character recognition) technology to extract the text of the PDF or image. This is particularly useful for dealing with PDFs and images that were created via a scan-to-PDF function in a scanner or photo copier. It uses the Tesseract engine to perform OCR, and currently supports over 20 languages for OCR.
Bitsy is a small, fast, embeddable, durable in-memory graph database that implements the Blueprints API. It is an ACID database that recovers cleanly from power failures and crashes. Designed for multi-threaded OLTP applications, Bitsy implements optimistic concurrency control and stores data in human-readable JSON-encoded text files. Common management tasks such as online backups can be done over JMX. The speed and simplicity of Bitsy relies on three design principles: No Seek, No Socket, NoSQL.