It is a interesting co-incidence that Eitan and I were discussion his research proposal and got to talk about multi-word tokens and their role in natural language processing and a feed (Google Patent Granted on Semantic Units (Meaningful Compounds)) in SEO by the SEA blog that also discusses this issue, though not calling it multi word token but rather a semantic unit.
The blog post is an interesting read, as most of the posts in that blog.