It's always nice to get feedback and I've already recieved some for Classifier4J – which is kind of scary for unreleased code.
Mike was kind enough to help me out with an extract of the Javablogs database, so now I have about 1.5 million words to analyse.