Character Encoding in HTML

Encoding of a character is a way of changing bytes into characters. To authenticate or show a HTML document, a program should prefer character encoding. The largely general character set or character encoding in use on system is ASCII and this is most likely and broadly employed character set for encoding text electronically.

Encoding using ASCII supports the lower and upper alphabet, the numbers from 0 to 9 and some other extra characters. It takes total of 128 characters.

A lot of languages employ either Latin characters or completely different alphabets. ASCII is not addressing these characters, so character encoding is to be learnt to employ non ASCII characters.

ISO generated a series of character sets to handle with different national characters. The largely employed encoding is ISO – 8859-1 for English and other western eurocent languages document.

The underlying is the character set employed around the globe

Character Encoding in HTML Assignment Help Through Online Tutoring and Guided Sessions at MyAssignmentHelp

The Unicode consortium has developed a way to display all characters of various languages. So, Unicode Character encoding has to be employed to generate documents which use characters from multiple character sets. It also specifies encoding to work with a string in extra ways to build enough space for the large character set it encompasses. It is called as UTF 8 , UTF16 and UTF 32

The 1st 256 characters of Unicode set of characters related to the 256 characters of ISO – 8859 -1

HTML 4 processors must sustain UTF 8 and XML processors are believed to support UTF 8 and UTF 16, so all XHTML related processors must also support UTF 16