Question 1

What is a Unicode homoglyph?

Accepted Answer

A homoglyph is a character that looks visually identical or very similar to another character but has a different Unicode code point. For example, the Cyrillic 'а' (U+0430) looks identical to the Latin 'a' (U+0061) but is a completely different character. Homoglyphs are used in domain spoofing, text watermarking, and obfuscation.

Question 2

What are zero-width characters?

Accepted Answer

Zero-width characters are Unicode code points that take up no visible space in rendered text. Examples include Zero-Width Space (U+200B), Zero-Width Non-Joiner (U+200C), and Zero-Width Joiner (U+200D). They can be inserted between characters to break string matching and copy-paste detection without affecting visual appearance.

Question 3

What is HTML entity encoding?

Accepted Answer

HTML entity encoding replaces characters with their HTML numeric or named entities. For example, 'A' becomes '&#65;' and '&' becomes '&'. This is useful for embedding text in HTML without triggering parsing, or for obfuscating email addresses from simple scrapers that don't decode entities.

Question 4

Can obfuscated text be decoded?

Accepted Answer

Yes. Zero-width characters can be stripped by removing known Unicode code points. HTML entities can be decoded by parsing them. Homoglyphs are harder to reverse automatically since the mapping is not always 1:1, but this tool's Decode mode strips zero-width characters and decodes HTML entities from obfuscated text.

Question 5

What are common uses for text obfuscation?

Accepted Answer

Common uses include: watermarking documents to track leaks (each copy gets a unique zero-width pattern), protecting email addresses from spam harvesters, bypassing keyword filters in text processing pipelines, testing how applications handle unusual Unicode input, and generating visually distinctive text for creative or artistic purposes.

Text String Obfuscator

How to Use the String Obfuscator

Obfuscation Methods Explained

Homoglyph Substitution

Zero-Width Character Injection

HTML Entity Encoding

Decode Mode

Frequently Asked Questions