rxvt-unicode is a clone of the well known terminal emulator rxvt, modified to store text in Unicode and to use locale-correct input and output. It also supports mixing multiple fonts at the same time, including Xft fonts, real transparency, and an optional embedded Perl interpreter that allows you to extend your terminal easily.
The Heirloom Toolchest is a collection of standard Unix utilities. It was derived from original Unix material released as open source by Caldera and Sun, and contains multiple versions of each utility corresponding to SVID3/SVR4, SVID4/SVR4.2MP, POSIX.2-1992/SUSV2, POSIX.1-2001/SUSV3, and 4BSD (SVR4 /usr/ucb). It processes lines of arbitrary length and in many cases binary input data, supports characters in UTF-8 and many East Asian encodings, and contains more than 100 individual utilities including bc, cpio, diff, ed, file, find, grep, man, nawk, oawk, pax, ps, sed, sort, spell, and tar. Extensive documentation is included.
Traditional vi is derived from the 4BSD source and includes support for modern operating systems, 8-bit input, multi-byte character encodings like UTF-8, and features demanded by POSIX.2. It contains few additions beyond this, so it is of interest for those who look for a small but well-defined vi implementation close to that of most commercial Unix systems. It also has some features to cope with primitive terminals or slow connections.
Java+ is a Java preprocessor that adds its features to any Java compiler. It has long, multi-line strings with executable inclusions like Perl or Ruby, optionally segregates Java+ strings into ResourceBundle files, eliminates the need for JSP or ASP and their implied need for Java compilers on deployment servers (a security concern), and adds absolutely no overhead in either space or time. There are graphical and command line interfaces, and a simple, general, and recursive string syntax.
The Universal Text Recognizer and Converter (Utrac) is a commandline tool and a C library that recognizes the encoding of an input file (UTF-8, ISO-8859-1, CP437, etc.) and its end-of-line type (CR, LF, or CRLF). It features automatic recognition (depending on the file and on the system's locale, reliable in most cases), assistance for verification or manual recognition, and conversion to another charset and/or end-of-line type.