Emdros is a corpus query system for storing and searching linguistically annotated text. It is very generic, supporting almost any kind of annotation from almost any linguistic theory. All linguistic levels of analysis are supported, including phonology, morphology, the lexical level, syntax, and discourse. The core libraries act as a middleware layer between a client and an underlying SQL database. MySQL, PostgreSQL, and SQLite are supported.
| Tags | Database Database Engines/Servers Text Processing Linguistic General Indexing Information Management education |
|---|---|
| Licenses | GPL |
| Operating Systems | Windows Windows POSIX Linux Solaris Mac OS X |
| Implementation | C++ |
Recent releases


Release Notes: Experimental support for full text search was added. The RenderObjects and RenderXML features were enhanced. General speed improvements were obtained. Support was added for building on recent Linux distributions, as well as support for building with qmake.


Release Notes: Upgraded to SQLite 3.6.17 and PCRE 8.01. The TIGERXML importer is now more lenient towards the input. The Emdros Query Tool can now output in XML format, and can create PNGs from the command line.


Release Notes: Support for gcc 4.4 was added, meaning it can now be compiled on the latest Linux distributions, such as Fedora 11 and Ubuntu 9.04. Various bugs were fixed.


Release Notes: A serious bug in the topographic language that could lead to incomplete or misleading query results was fixed. The Emdros Query Tool was given tree output and KWIC output options, as well as new ways of "harvesting" results. Lots of other small improvements and bugfixes were made.


Release Notes: Support for Solaris was reinstated.