

The pink line is the theoretical Zipf distribution, which is found to be `f/n^0.94`, where f is the frequency of the top-ranked word and n is the rank of the word. The dark blue data points represent the top 20 occurring English words (with the first few labeled). This gives us a hyperbola, that we met before.) I have included the "Theoretical Zipf Distribution, based on the n-th ranked word occurring approximately `1/n` times the frequency of the highest ranked word. This Corpus is the count of how often one million words were used in a variety of books, newspapers and other publications.

(The first 20 words in the Brown Corpus, published in 1967. The next ranked word, "of", occurred around `3.6%` of the time (or about `1/2` as often as the top-ranked word.) The third most popular word was "and", with a frequency of `2.8%`, or roughly `1/3` of the frequency of the top ranked word. The most common word, "the" occurred around `70,000` times (or `7%` of the million words counted). The table is based on the Brown Corpus, a careful study of a million words from a wide variety of sources including newspapers, books, magazines, fiction, government documents, comedy and academic publications. Zipf originally developed his law in response to the observation that the frequency of words was inversely proportional to the rank of each word.įor example, the most common 20 words in English are listed in the following table.


Application 2: Zipf DistributionsĬonsider the most common words in English. Graph of `y=100(0.82)^t` on semilogarithmic axes. Thanks much in advance.0 0.2 0.1 2 1 20 10 100 5 10 15 20 25 30 p t Open image in a new page Perhaps I simply haven't found the right search phrase(s).Īt any rate, the question probably doesn't need an MWE, but to provide one for a simple 'answer by demonstration', here is a simple plot where I'd like to decrease the length (distance from top of tick mark to axis) by (say) 50%, relative to the default length.
#Mathematica 7 bold axes numbers how to#
While there are any number of posts on how to fiddle (from the Latin) with tick labels, or tick placements, or tick color, I haven't been able to find anything (in terms of worked examples, or in the documentation), for how to change the length and width of the ticks themselves.
