Here is what I was looking for: http://untroubled.org/spam/
This archive has around a gigabyte of compressed accumulated spam messages dating 1998 – 2011. Now I just need to get non-spam email. So I’ll just query my own Gmail for that using the getmail program and the tutorial at mattcutts.com