This converter was originally taken from Shaun Moss's website. His site appears to be offline so I rescued this from the google cache. Changed some stuff too. I use this to convert the email addresses I put on web pages to things the email harvester bots won't understand.


Unicode Characters to HTML Entities Converter

A utility to convert Unicode characters to decimal and hexadecimal HTML entities.
(by Shaun Moss, adapted from: ASCII to HEX to Unicode Converter by Mike Golding)

The partial conversions do not convert characters with a code of 127 or less, (i.e. plain old ASCII characters), which can appear in HTML code as they are.
More about character sets.


Unicode characters:
 

HTML entities:
decimal, full:
decimal, partial: *
hexadecimal, full:
hexadecimal, partial:

* This is the result I prefer for HTML code. I have noticed some quirkiness with the hexadecimal codes - sometimes the HTML entity is not converted to the character and appears on the page unconverted (e.g. 월 instead of 월). Also, although some text editors will allow you to type Latin 1 characters (e.g. é, ç) into your HTML code, these characters do not always render properly (not sure why, if you know please email me). For maximum reliability use decimal HTML entities for any non-ASCII characters.