Web Science/Part2: Emerging Web Properties/Advanced statistical descriptive models for the Web/The Zipf law for text/quiz

From Wikiversity
Jump to navigation Jump to search

What do you know about Zipf law?

Plotting the rank of words against the frequency appear as a straight line
the word rank multiplied by its frequency is supposed to be roughly constant
on the simple english wikipedia dataset the law only seams to hold for the top ranked words
Zipf's law has been falsified for many years and is only taught for historical reasons