JustConvertAll-in-One Convert

HTML Entity Decoder

Decode HTML entities back to plain text instantly online. Supports all named entities (&, <, >), numeric ('), and hex (') references. Free — runs in your browser.

Related Tools

Advertisement

HTML entity decoding converts entity references back to their original characters: &amp; → &, &lt; → <, &copy; → ©, &#65; → A, and &#x41; → A. There are over 2,000 named HTML5 entities covering Latin characters, Greek symbols, mathematical operators, arrows, typographic punctuation, and emoji. Numeric entities (decimal and hexadecimal) cover the full Unicode range.

HTML-encoded text appears in API responses from content management systems, RSS feed item bodies, scraping output from HTML parsers that preserve entity references, HTML email content, and data exported from WYSIWYG editors like TinyMCE or CKEditor. When this text is processed as plain text (stored in a database, compared, or displayed in a non-HTML context), the entities must be decoded first.

This tool decodes all named HTML5 entities, decimal numeric references (&#160;), and hexadecimal numeric references (&#xA0;). Decoding is performed via the browser's own HTML parser: the input is assigned to a temporary element's innerHTML, and .textContent is read back — guaranteeing correct decoding of every entity the browser supports.

Common Use Cases

Cleaning RSS feed content for plain-text processing

RSS 2.0 feed item descriptions are HTML fragments with entities like &amp;, &lt;, and &quot; preserved by the feed generator. A news aggregation pipeline that extracts article summaries for full-text search indexing must decode these entities before indexing, otherwise the search index contains literal &amp; strings that will never match user queries.

Normalizing CMS API response text

WordPress's REST API returns post titles and content with HTML entities preserved (e.g., AT&amp;T for 'AT&T', &#8211; for an en dash). When consuming these responses in a mobile app or email newsletter template that renders plain text rather than HTML, decoding entities here produces the clean text string the renderer expects.

Fixing double-encoded content in databases

Legacy systems that ran user input through HTML encoding before storing it in a database, and then encoded it again on output, produce strings like &amp;amp;lt;. Identifying the degree of encoding and decoding iteratively here clarifies how many decode passes the data migration script must apply to recover the original unencoded text.

Extracting plain text from scraped HTML

Web scrapers using Cheerio, BeautifulSoup, or Puppeteer sometimes return text nodes with HTML entities still intact, especially from server-rendered pages with improper encoding handling. Decoding the scraped text here converts &nbsp; to spaces, &mdash; to em dashes, and named entities to their proper Unicode characters before storing in a database.

About HTML Entity Decoding

Common use cases