Language has a pretty interesting property known as Zipf’s Law. That is, language data (and even subsets of language data) have a Zipfian distribution. There are a small number of highly frequent words, and a large number of highly infrequent words. Moreover, the frequent words tend to be short, grammatical (words that are grammatically required but … Continue reading Zipf’s Law
Copy and paste this URL into your WordPress site to embed
Copy and paste this code into your site to embed