Redet is a tool for developing and executing regular expressions using any of more than 50 search programs, editors, and programming languages, intended both for developing regular expressions for use elsewhere and as a search tool in its own right. For each program in each locale, a palette showing the available constructs is provided. The properties of each program are determined by runtime tests, which guarantees that they will be correct for the program version and locale. Additional features include persistent history, extensive help, a variety of character entry tools, and the ability to change locale while running. Redet is highly configurable and fully supports Unicode.
SndBite is a specialized audio editor designed for breaking large recordings into smaller components with great efficiency. Its principal intended application is in linguistic research where it is often desirable to put each word or sentence into a separate file before further processing. It is also useful for measuring pause durations. Its features include multiple simultaneous views of the waveform at different resolutions, the ability to position window edges at transitions between sound and silence, automated setting of cut points at zero-crossings, automatic filename generation easily controlled by the user, and optional automatic playback on window motion. It is scriptable and may be run in batch mode without the GUI.
Tamil Converters is a collection of programs for converting among a variety of encodings and transliterations of Tamil, including: Unicode, ISCII, TSCII, ITRANS, the International Phonetic Alphabet, the Koln, Penn, and Colloquial Tamil romanizations, ISO-15919 transliteration, and Unicode character names enclosed in angle brackets (as in POSIX locale source files).
uni2ascii and ascii2uni provide conversion in both directions between UTF-8 Unicode and more than thirty 7-bit ASCII equivalents, including RFC 2396 URI format and RFC 2045 Quoted Printable format, the representations used in HTML, SGML, XML, OOXML, the Unicode standard, Rich Text Format, POSIX portable charmaps, POSIX locale specifications, and Apache log files. It can also convert between the escapes used for Unicode in languages such as Ada, C, Common Lisp, Java, Pascal, Perl, Postscript, Python, Scheme, and Tcl.
The Unicode Utilities are a set of programs for manipulating and analyzing Unicode text. uniname prints any combination of the character offset of each character, its byte offset, its hex code value, its encoding, the glyph itself, and its name. unidesc reports the character ranges to which different portions of the text belong. unihist generates a histogram of the characters in its input. ExplicateUTF8 determines and explains the validity of a sequence of bytes as a UTF-8 encoding. unirev reverses UTF-8 strings. unifuzz tests other programs' unicode handling.
UnicodeDataBrowser is a browser for the UnicodeData.txt file, which contains much useful information but is not easily read by humans. It creates a scrollable table in which columns represent properties. The table may be sorted on any column. Abbreviations are expanded and characters cross-referenced in decomposition and casing fields are named. Regular expression search restricted to a selected column is available. The set of characters for which information is displayed may be restricted to those characters matching a regular expression on a specified property.
WAVE Utilities is a set of three programs for dealing with WAVE format audio files. Some software is unable to parse complex WAVE files containing such things as playlists and padding. SimplifyWave converts complex files into files that such programs can read by stripping everything other than the data chunk. RepairWave inserts the required data chunk id and size information into ill-formed files in which the audio data directly follows the header. InfoWave extracts information from a RIFF/WAV or RIFX/WAV file and reports on the contents of the file.
WordGenerator generates hypothetical words from specifications of their syllable structure. The user specifies the maximum length of the words in syllables, the abstract structure of syllables in the language (in terms of such units as consonants and vowels or onsets and rhymes), and the actual sounds that comprise each abstract class (e.g. the list of vowels in the language); WordGenerator then generates the words that conform to this specification. Such lists are useful to field linguists exploring the vocabulary of a language, and to designers of artificial languages.
Xlit converts text from one writing system into another. It allows the user to define a transliteration simply by typing the input strings in one window and the strings to which they are to be mapped in another. Transliteration may be restricted to regions bounded by specified delimiters or their complements. Transliteration may also be performed by external commands or plugins. Xlit can also convert one type of delimiter to another, e.g. from HZ escapes to XML. Xlit can read and write transliteration definitions in its own format and as Yudit keymaps. It can be run in batch mode without the GUI.
What's with the fixed width pages? I hate being forced either to scroll horizontally or use a really wide window.
interesting - somewhat like pic This is interesting and looks promising. It looks rather like a development of pic.
comparison How does BigMath compare to other similar libraries such as GNU MP? Any particular advantages?
Re: very useful > This tool certiainly does one thing very > well but it strays from the Unix > philosophy by having a very non-standard > command-line interface. True....
documentation could use work This is a useful toolkit but the documentation badly needs improvement. There isn't nearly enough detail as to the meaning of the options, what units they are in, w...