I often read that Pitman-Yor Process has power-law properties. Let's say I am interested in modelling English word's distribution (which follows power-law). Using CRP metaphor, words come and get assigned to tables using CRP probabilities. Now I want to draw samples from this CRP to show that it actually captures power-law properties. How would I conduct such sampling?
What I thought initially was to treat it as sampling from a multinomial distribution defined by the seating arrangements (tables, #ofCustomers). But this doesn't seem to be correct.
Any idea?
[link][3 comments]