Jorge_Stolfi > 17-09-2025, 02:27 AM
(16-09-2025, 02:48 PM)quimqu Wrote: You are not allowed to view links. Register or Login to view.I don’t like throwing out baseless theories or overcomplicating things, but I can’t help noticing that the characters most common at the beginnings of lines (except the first lines of paragraphs) are also the ones that most resemble Arabic numerals in the Voynich.
oaken > 17-09-2025, 09:43 AM
(16-09-2025, 02:48 PM)quimqu Wrote: You are not allowed to view links. Register or Login to view.I don’t like throwing out baseless theories or overcomplicating things, but I can’t help noticing that the characters most common at the beginnings of lines (except the first lines of paragraphs) are also the ones that most resemble Arabic numerals in the Voynich.
(17-09-2025, 02:27 AM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.However, there is the problem that any position-dependent cipher would almost surely encode each occurrence of the same word type as two or more distinct words. That would flatten out the frequency x rank plot ("Zipf plot") at the high frequency end -- which does not seem to be the case.
quimqu > 17-09-2025, 09:50 AM
(17-09-2025, 02:27 AM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.One could test that theory by separating all lines into subsets based on their first glyph, computing the word frequency distribution of each subset (excluding the first word of each line) and comparing those distributions.
oaken > 17-09-2025, 09:56 AM
(17-09-2025, 09:50 AM)quimqu Wrote: You are not allowed to view links. Register or Login to view.(17-09-2025, 02:27 AM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.One could test that theory by separating all lines into subsets based on their first glyph, computing the word frequency distribution of each subset (excluding the first word of each line) and comparing those distributions.
I did a fast check. The cosine similarity plot suggests that the first character of a line doesn’t create a separate "language" within the manuscript. It may nudge the line in a certain direction, but overall the vocabulary quickly falls back into the same general pattern as the rest of the text.