4 projects tagged "utf8"
utfout is a command-line tool that can produce UTF-8 (Unicode) strings in various ways and direct them to standard output, standard error, or direct to the terminal without the need for shell support. Strings can be repeated, delayed, randomly-generated, written to arbitrary file descriptors, interspersed with other characters, and generated using ranges. printf(1)-style escape sequences are supported along with extended escape sequences. utfout(1) sits somewhere between echo(1) and printf(1).
ULS is an library for general purpose lexical analysis, with support for UTF-8. It comes with C/C++ libraries and a couple of tools for Linux and Windows. It's a intuitive, practical, flexible, and optimized tokenizer. ULS can instantiate multiple objects for lexical analyses. The objects can process multiple (nested) inputs of different languages. The language specification is specified in configuration file suffixed by *.ulc. ULS can tokenize the input file, which encoded by UTF-8. The input files may contain the words in a localized language as identifiers. ULS can stream the tokens from many input files to another output (files).The stream can be stored in a *.uls file and replayed from it whenever necessary.
WebAbility is an advanced Web development platform that contains WebFlow, a content management system (CMS), a security advanced system (SAS), a powerful workflow engine, and Web site wrappers. It supports multiple database connections, multiple languages, multiple presentation templates, page, script, and library versioning, XML and Web Services integration, and advanced security management. It uses plugins to extend the software for tasks such as e-commerce, portals, intranets, and editorial systems.