This is a python version of samesame repo to generate homograph strings
#大语言模型#Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G2P tasks without additional training, featuring Sentence-Bench ...
Given a TLD zone file, PhishCanary extracts International Domain Names (IDNs) that are homoglyphs of specified target domain names.
Benchmarking notebooks for various Persian G2P models, comparing their performance on the SentenceBench dataset, including Homo-GE2PE and Homo-T5.
Get all possible homograph punycoded domains of a particular domain
HomoRich: The first large-scale Persian homograph dataset for G2P conversion, featuring 528K annotated sentences with balanced pronunciation variants and dual phoneme representations.
Solving various image processing, machine learning, and deep learning problems. Assignments for Computer Vision Course in UGR.
An project to protect you from homograph phising attacks like "góog1e.com" or "fącebook.com"
This is a simple JavaScript based project that checks, detects and validate given URL for possibilities of homograph, homoglyph, IDN and any suspicious format.