PHP htmlentities() function

Author：Eve Cole Update Time：2024-12-24 09:36:01

Example

Convert some characters into HTML entities:

<?php$str = "<© W3CSçh°°¦§>";echo htmlentities($str);?>

The HTML output of the above code is as follows (view source code):

<!DOCTYPE html><html><body><© W3CSçh°°¦§></body></html>

The browser output of the above code is as follows:

<© W3CSçh°°¦§>

Definition and usage

The htmlentities() function converts characters into HTML entities.

Tip: To convert HTML entities back to characters, use the html_entity_decode() function.

Tip: Use the get_html_translation_table() function to return the translation table used by htmlentities().

grammar

 htmlentities( string,flags,character-set,double_encode )

parameter	describe
string	Required. Specifies the string to be converted.
flags	Optional. Specifies how to handle quotes, invalid encodings, and which document type to use. Available quote types: ENT_COMPAT - Default. Only double quotes are encoded. ENT_QUOTES - Encodes double and single quotes. ENT_NOQUOTES - Do not encode any quotes. Invalid encoding: ENT_IGNORE - Ignore invalid encodings instead of having the function return an empty string. This should be avoided as this may have an impact on security. ENT_SUBSTITUTE - Substitutes an invalid encoding with the specified character with the Unicode replacement character U+FFFD (UTF-8) or &#FFFD; instead of returning an empty string. ENT_DISALLOWED - Replaces invalid code points in the specified document type with the Unicode replacement characters U+FFFD (UTF-8) or &#FFFD;. Additional flags specifying the document type to use: ENT_HTML401 - Default. Code processed as HTML 4.01. ENT_HTML5 - code processed as HTML 5. ENT_XML1 - Code processed as XML 1. ENT_XHTML - as XHTML processing code.
character-set	Optional. A string specifying the character set to be used. Allowed values: UTF-8 - Default. ASCII compatible multi-byte 8-bit Unicode ISO-8859-1 - Western Europe ISO-8859-15 - Western Europe (added euro symbol + French and Finnish letters missing from ISO-8859-1) cp866 - DOS-specific Cyrillic character set cp1251 - Windows-specific Cyrillic character set cp1252 - Windows-specific Western European character set KOI8-R - Russian BIG5 - Traditional Chinese, mainly used in Taiwan GB2312 - Simplified Chinese, national standard character set BIG5-HKSCS - Big5 with Hong Kong extension Shift_JIS - Japanese EUC-JP - Japanese MacRoman - Character set used by the Mac operating system Note: In versions prior to PHP 5.4, unrecognized character sets were ignored and replaced by ISO-8859-1. As of PHP 5.4, unrecognized character sets are ignored and replaced by UTF-8.
double_encode	Optional. A Boolean value that specifies whether to encode existing HTML entities. TRUE - Default. Each entity will be converted. FALSE - Existing HTML entities will not be encoded.

technical details

Return value:	Returns the converted string. If string contains an invalid encoding, an empty string is returned unless the ENT_IGNORE or ENT_SUBSTITUTE flag is set.
PHP version:	4+
Update log:	In PHP 5, the default value of the character-set parameter changed to UTF-8. In PHP 5.4, new: ENT_SUBSTITUTE, ENT_DISALLOWED, ENT_HTML401, ENT_HTML5, ENT_XML1, and ENT_XHTML. In PHP 5.3, ENT_IGNORE was added. In PHP 5.2.3, the double_encode parameter was added. In PHP 4.1, the character-set parameter was added.

More examples

Example 1

Convert some characters into HTML entities:

<?php$str = "Jane & 'Tarzan'";echo htmlentities($str, ENT_COMPAT); // Will only convert double quotesecho "<br>";echo htmlentities($str, ENT_QUOTES); // Converts double and single quotesecho "<br>";echo htmlentities($str, ENT_NOQUOTES); // Does not convert any quotes?>

The HTML output of the above code is as follows (view source code):

<!DOCTYPE html><html><body>Jane & 'Tarzan'<br>Jane & 'Tarzan'<br>Jane & 'Tarzan'</body></html>

The browser output of the above code is as follows:

Jane & 'Tarzan'Jane & 'Tarzan'Jane & 'Tarzan'

Example 2

Convert some characters to HTML entities by using the Western European character set:

<?php$str = "My name is Øyvind Åsane. I'm Norwegian.";echo htmlentities($str, ENT_QUOTES, "ISO-8859-1"); // Will only convert double quotes (not single quotes), and uses the character-set Western European?>

The HTML output of the above code is as follows (view source code):

<!DOCTYPE html><html><body>My name is Øyvind Åsane. I'm Norwegian.</body></html>

The browser output of the above code is as follows:

My name is Øyvind Åsane. I'm Norwegian.