ReneZ > 22-04-2026, 03:48 PM
Grove > 22-04-2026, 04:05 PM
JoJo_Jost > 22-04-2026, 04:26 PM
JoJo_Jost > 22-04-2026, 04:35 PM
oeesordy > 22-04-2026, 06:12 PM
Quote:JoJo If we group all tokens from position 2 onwards (32,425 words), they have an average length of 4.49 and an anomaly rate of 28.2%. Even so, the first word of each line deviates significantly, with 4.91 and 41.5%.
dashstofsk > 22-04-2026, 07:28 PM
JoJo_Jost > 23-04-2026, 06:41 AM
(22-04-2026, 07:28 PM)dashstofsk Wrote: You are not allowed to view links. Register or Login to view.I am a bit puzzled by your figures for o_rate. In the manuscript the frequency of character o is greater than the frequency for character y. Yet the o_rate values are less than the y_ending values. It appears that o_rate is the character frequency. But qo_rate seems to be the frequency of words starting qo. To make everything consistent would it not help to have o_rate as the frequency of words with that character?
Also what are the columns t and p in your table?
JoJo_Jost > 23-04-2026, 06:53 AM
JoJo_Jost > 23-04-2026, 07:26 AM
)
(or not)
dashstofsk > 23-04-2026, 08:46 AM