Jorge_Stolfi > 10 hours ago
(Yesterday, 02:48 PM)quimqu Wrote: You are not allowed to view links. Register or Login to view.I don’t like throwing out baseless theories or overcomplicating things, but I can’t help noticing that the characters most common at the beginnings of lines (except the first lines of paragraphs) are also the ones that most resemble Arabic numerals in the Voynich.
oaken > 2 hours ago
(Yesterday, 02:48 PM)quimqu Wrote: You are not allowed to view links. Register or Login to view.I don’t like throwing out baseless theories or overcomplicating things, but I can’t help noticing that the characters most common at the beginnings of lines (except the first lines of paragraphs) are also the ones that most resemble Arabic numerals in the Voynich.
(10 hours ago)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.However, there is the problem that any position-dependent cipher would almost surely encode each occurrence of the same word type as two or more distinct words. That would flatten out the frequency x rank plot ("Zipf plot") at the high frequency end -- which does not seem to be the case.
quimqu > 2 hours ago
(10 hours ago)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.One could test that theory by separating all lines into subsets based on their first glyph, computing the word frequency distribution of each subset (excluding the first word of each line) and comparing those distributions.
oaken > 2 hours ago
(2 hours ago)quimqu Wrote: You are not allowed to view links. Register or Login to view.(10 hours ago)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.One could test that theory by separating all lines into subsets based on their first glyph, computing the word frequency distribution of each subset (excluding the first word of each line) and comparing those distributions.
I did a fast check. The cosine similarity plot suggests that the first character of a line doesn’t create a separate "language" within the manuscript. It may nudge the line in a certain direction, but overall the vocabulary quickly falls back into the same general pattern as the rest of the text.