It is a interesting co-incidence that Eitan and I were discussion his research proposal and got to talk about multi-word tokens and their role in natural language processing and a feed (Google Patent Granted on Semantic Units (Meaningful Compounds)) in SEO by the SEA blog that also discusses this issue, though not calling it multi word token but rather a semantic unit.
The blog post is an interesting read, as most of the posts in that blog.
thanks for the link, Shlomo,
ReplyDeleteI will read it as soon as I'll get back from miluim