davidjackson > 09-02-2018, 06:58 PM
Emma May Smith > 09-02-2018, 08:57 PM
davidjackson > 09-02-2018, 09:33 PM
MarcoP > 10-02-2018, 11:23 AM
(09-02-2018, 06:55 PM)davidjackson Wrote: You are not allowed to view links. Register or Login to view. A confession - my excel formula was pointing to the wrong column, so the above are wrong. (The formula prediction was running against tokens found, no total types). So my above post was a waste of everybody's time, sorry.
davidjackson > 10-02-2018, 09:12 PM
Torsten > 11-02-2018, 02:21 AM
davidjackson > 11-02-2018, 10:34 AM
Quote: The computer simulation is using a constant generation algorithm. Moreover for this implementation some aspects are simplified. Therefore a constant graph is expected.
Quote: The result demonstrates again that the text of the VMS is not homogenous. Therefore Heap's law didn't apply here.Actually, I would argue that Heap's is applying here, as we see in the the stars graph (and herbal in a previous post). Remember we are dealing with a statistical law not a true power law, and as such variation is to be expected. But the prediction line and real plots both have a weak constant association.
Torsten > 11-02-2018, 12:09 PM
MarcoP > 11-02-2018, 12:25 PM
(10-02-2018, 09:12 PM)davidjackson Wrote: You are not allowed to view links. Register or Login to view.Since the ratio of creation is constant, I get two rapidly diverging lines, as illustrated when I overlay two trend lines and expand them outwards:
davidjackson > 11-02-2018, 12:44 PM
Quote:You have to split the text of the VMS into different parts. If you then compare the graphs for the different sections this can be interesting. It seems for instance that the graph for herbal pages in Currier A is similar to the graph for herbal pages in Currier B. This is an interesting result. Even if they use different word types they have something in common.This is what I have been trying to do, in a very roundabout way. It will be interesting to see how the labels respond, for example.
Quote:MarcoP: David, what do you mean when you say that the ratio of creation is "constant"? Do you mean that the number of unique words increases linearly with text length?