This is probably both vastly more deep than you need, yet not wide enough to cover your use case, but the Unicode consortium have had to deal with attacks against internationalised domain names and came up with this list of homographs (characters with the same or similar rendering):
http://www.unicode.org/Public/security/latest/confusables.txt
Might make a starting point at least.