libutf8 is a reasonably fast, robust encoder/decoder for UTF-8. It will encode UCS-4 (32-bit characters) into UTF-8, and decode UTF-8 into UCS-4. It is a "safe" UTF-8 decoder, rejecting overlong character representations and other illegal characters. libutf8 is a C library that presents simple functions for manipulating characters and strings, and a more powerful API for dealing with streamed data (e.g. from a TCP connection) efficiently.
| Tags | Text Processing |
|---|---|
| Licenses | GPL |
| Operating Systems | Unix |
| Implementation | C |
Recent releases


Release Notes: Major cleanups and an ABI change. New character classifier functions have been added, along with some bugfixes.


Release Notes: This release has an updated build system and a new URL.


No changes have been submitted for this release.
A user-friendly DJango project to perform data interpretation and charting of message activity from WhatsApp records.