PDF OCR is a simple drag-and-drop utility that converts PDFs and images into text documents. It uses advanced OCR (optical character recognition) technology to extract the text of the PDF or image. This is particularly useful for dealing with PDFs and images that were created via a scan-to-PDF function in a scanner or photo copier. It uses the Tesseract engine to perform OCR, and currently supports over 20 languages for OCR.
TeXstudio is a fully featured LaTeX editor. It is designed to make writing LaTeX documents as easy and comfortable as possible. It features an integrated PDF viewer with (almost) word-level synchronization, live inline preview, advanced syntax highlighting, and live checking of references, citations, LaTeX commands, spelling, and grammar.
LogicalDOC is a Web-based document management system that is easy to use and learn. Its architecture leverages best-of-breed Java technology to achieve a powerful and flexible solution. It supports its users with a powerful search engine (Lucene), Web service interface (JAX-WS via CXF) compatible with .NET and PHP, versioning, annotation on documents, a WebDAV interface, importing and exporting from .zip files. Documents can be organized into hierarchical folders, searched using the integrated search engine, or browsed by Tag. The system is extensible thanks to the technologies used (Spring-Hibernate) and its plugin architecture.
The National Space Science Data Center's (NSSDC) Common Data Format (CDF) is a self-describing data abstraction for the storage and manipulation of multidimensional data in a platform- and discipline-independent fashion. It consists of a scientific data management package (known as the "CDF Library") that allows programmers and application developers to manage and manipulate scalar, vector, and multi-dimensional data arrays.
The Common Pipeline Library provides a highly robust set of functions for manipulating signals and images. It is primarily intended for the building of VLT instrument pipelines, but is also useful for generic data handling. It includes a number of useful low-level data types, medium-level data access methods, standard implementations of commonly-used signal processing and data reduction tasks, and dynamic loading of "recipes" for data processing.