Catxdoc is a bash script to extract plain text (in UTF-8) from .docx files to stdout (in one line). The requirements other than bash are the "file", "unzip", and "sed" commands.
saahriktu 12 Mar 2010 10:10
Software to monitor for open files on your system in real time.
A Java deduplication / record linkage engine.