Represent documents in semantic word space and compare distances
Important words occur more frequently within a document than between documents
Subtract frequency of each word in target text from frequency in source. Take top n words.
Compare distance of texts
Compare topic word frequency, t-tests