Stable Random Projection
I have a new article on dimensionality reduction on massive digital libraries this month. Because it's a technique with applications beyond the specific tasks outlined there, I want to link to a few things here.
The article in Cultural Analytics.
Instructions for best using those features for your own projects in Creating Data.