Convert HTML entities to characters:
<?php$str = "<© W3CSçh°°¦§>";echo html_entity_decode($str);?>The HTML output of the above code is as follows (view source code):
<!DOCTYPE html><html><body><© W3CSçh°°¦§></body></html>The browser output of the above code is as follows:
<© W3CSçh°°¦§>The html_entity_decode() function converts HTML entities into characters.
The html_entity_decode() function is the inverse of the htmlentities() function.
html_entity_decode( string,flags,character-se t)
parameter | describe |
---|---|
string | Required. Specifies the string to decode. |
flags | Optional. Specifies how quotation marks are handled and which document type is used. Available quote types: ENT_COMPAT - Default. Only double quotes are decoded. ENT_QUOTES - Decode double and single quotes. ENT_NOQUOTES - Do not decode any quotes. Additional flags specifying the document type to use: ENT_HTML401 - Default. Code processed as HTML 4.01. ENT_HTML5 - code processed as HTML 5. ENT_XML1 - Code processed as XML 1. ENT_XHTML - as XHTML processing code. |
character-set | Optional. A string specifying the character set to be used. Allowed values: UTF-8 - Default. ASCII compatible multi-byte 8-bit Unicode ISO-8859-1 - Western Europe ISO-8859-15 - Western Europe (added euro symbol + French and Finnish letters missing from ISO-8859-1) cp866 - DOS-specific Cyrillic character set cp1251 - Windows-specific Cyrillic character set cp1252 - Windows-specific Western European character set KOI8-R - Russian BIG5 - Traditional Chinese, mainly used in Taiwan GB2312 - Simplified Chinese, national standard character set BIG5-HKSCS - Big5 with Hong Kong extension Shift_JIS - Japanese EUC-JP - Japanese MacRoman - Character set used by the Mac operating system Note: In versions prior to PHP 5.4, unrecognized character sets were ignored and replaced by ISO-8859-1. As of PHP 5.4, unrecognized character sets are ignored and replaced by UTF-8. |
Return value: | Returns the converted string. |
---|---|
PHP version: | 4.3.0+ |
Update log: | In PHP 5, the default value of the character-set parameter changed to UTF-8. In PHP 5.4, additional flags have been added to specify the document type used: ENT_HTML401, ENT_HTML5, ENT_XML1, and ENT_XHTML. In PHP 5.0, support for multibyte encodings was added. |
Convert some HTML entities to characters:
<?php$str = "Jane & 'Tarzan'";echo html_entity_decode($str, ENT_COMPAT); // Will only convert double quotesecho "<br>";echo html_entity_decode($str, ENT_QUOTES) ; // Converts double and single quotesecho "<br>";echo html_entity_decode($str, ENT_NOQUOTES); // Does not convert any quotes?>The HTML output of the above code is as follows (view source code):
<!DOCTYPE html><html><body>Jane & 'Tarzan'<br>Jane & 'Tarzan'<br>Jane & 'Tarzan'</body></html>The browser output of the above code is as follows:
Jane & 'Tarzan'Jane & 'Tarzan'Jane & 'Tarzan'Convert some HTML entities to characters using the Western European character set:
<?php$str = "My name is Øyvind Åsane. I'm Norwegian.";echo html_entity_decode($str, ENT_QUOTES, "ISO-8859-1");?>The HTML output of the code above will be (View Source):
<!DOCTYPE html><html><body>My name is Øyvind Åsane. I'm Norwegian.</body></html>The browser output of the above code is as follows:
My name is Øyvind Åsane. I'm Norwegian.