Question 1

What is a homoglyph attack?

Accepted Answer

A homoglyph attack (also called a homograph attack) uses Unicode characters that look identical or very similar to ASCII characters to impersonate legitimate domain names, email addresses, or text. For example, the Cyrillic letter а (U+0430) is visually identical to the Latin letter a (U+0061). A phishing site could register pаypal.com using the Cyrillic а, making it look like paypal.com to a human reader.

Question 2

How do browsers protect against homoglyph domain attacks?

Accepted Answer

Modern browsers display internationalized domain names (IDNs) in Punycode notation (e.g., xn--pypal-4ve.com) when the domain mixes scripts or contains characters from scripts not used in the label's language. Chrome, Firefox, and Safari all have heuristics to detect confusable domains and show the Punycode form to warn users. However, domains using only characters from a single confusable script (all-Cyrillic, for example) may still display as Unicode.

Question 3

What are Unicode confusable characters?

Accepted Answer

The Unicode Consortium maintains an official confusables.txt data file that lists pairs of characters that can be visually confused with each other. This includes Cyrillic letters that look like Latin letters, Greek letters that resemble Latin letters, and many other cross-script lookalikes. The official list is used by security tools and browser implementations to detect potential spoofing.

Question 4

How can I protect my domain from homoglyph spoofing?

Accepted Answer

Register defensive variants of your domain name using IDN homoglyphs. Use DMARC, DKIM, and SPF email authentication to prevent email spoofing. Implement certificate transparency monitoring to detect fraudulent certificates for lookalike domains. Train users to check URLs carefully and hover over links before clicking. Major domain registrars also block registrations of known confusable domains for popular brands.

Question 5

Is homoglyph generation illegal?

Accepted Answer

Generating homoglyph strings is a legitimate security research and testing technique. This tool is intended for security professionals to test their defenses, developers to understand the attack surface, and educators. Using homoglyphs to actually deceive users — for example, registering a phishing domain — is illegal in most jurisdictions and violates ICANN policies.

Homoglyph Detector

What Is a Homoglyph Attack?

Common Confusable Character Pairs

How to Detect Homoglyph Attacks

Protecting Against Homoglyph Attacks

Security Research Applications

Zero-Width and Invisible Characters

Frequently Asked Questions