libextractor is a library used to extract meta-data from files of arbitrary type. It is designed to use helper-libraries to perform the actual extraction, and to be trivially extendable by linking against external extractors for additional file types. The goal is to provide developers of file-sharing networks, file managers, and WWW-indexing bots with a universal library to obtain meta-data about files. It includes a shell-command and bindings for Java (JNI) and Python.
|Tags||Software Development Libraries Internet Web Indexing/Search Communications File Sharing Text Processing Indexing|
|Licenses||GPLv3 or later|
|Operating Systems||POSIX BSD Unix Linux Windows Mac OS X|
|Translations||Kinyarwanda French German Irish Romanian Swedish Vietnamese Dutch Italian Ukrainian|
Release Notes: This release adds a plugin for extracting audio previews and ensures that one blocking (or slow) plugin does not prevent other plugins from progressing.
Release Notes: This release fixes (silent) IPC issues on slow machines which previously triggered a timeout resulting in incomplete meta data extraction. It also updates the Dutch translation, requires an external installation of libltdl, and fixes build issues with recent versions of libavcodec and libtidy.
Release Notes: This release fixes plugin discovery on OS X. Nothing has changed on other systems, so there is no need to update from 1.0.0 on non-OS X systems.
Release Notes: Major changes to the plugin mechanism now allow out-of-process plugins full random access to the entire file. Most plugins have been rewritten to the new plugin API. The external (libextractor) API remains unchanged and compatible with 0.6. As part of the rewrite, many plugins were changed to use standard 3rd party libraries (libjpeg, libtiff, libgif, libtidy, and libmagic) for parsing. A new plugin based on gstreamer replaces many existing multimedia plugins. Automated test cases for (almost all) of the plugins were also written, and the documentation was updated.
Release Notes: This release adds support for Matroska, fixes some minor bugs (leaks on error-handling paths), and does some minor code clean up (fixing compiler warnings about dead code).