uni2ascii 4.5 (Default branch)

uni2ascii and ascii2uni convert between UTF-8 Unicode and 29 7-bit ASCII equivalents including: hexadecimal and decimal HTML and SGML numeric character references, \u-escapes, standard hexadecimal, raw hexadecimal, and RFC2396 URI format. Such ASCII equivalents are useful for entering Unicode in program source or in programs that are not 8-bit safe, and for testing and debugging. Several options allow Unicode to be converted to approximately equivalent ASCII, e.g. by stripping diacritics. An optional GUI is provided. License: GNU General Public License v3 Changes:
Microsoft-style HTML entities lacking the final semi-colon are now passed on by default rather than converted by ascii2uni. The new -m flag causes them to be converted. Error messages and warnings from ascii2uni now include the line number.

More...