16-02-2026, 01:34 PM
(16-02-2026, 11:34 AM)kckluge Wrote: You are not allowed to view links. Register or Login to view."if the SPS is a version of the SBJ, then each Voynichese word in the former corresponds roughly to ... 1.089 [Chinese characters]."That implies a bound on how undersegmented the Voynichese can be. If "...the Author often missed word breaks when taking dictation....", then according to this claim he/she didn't miss more than ~8% of them. ... As I pointed out in my earlier post, treating uncertain spaces as spaces results in only 129 of the occurences of 'daiin' being as a word, with all but 8 of the remaining 177 occurences being as a word suffix or prefix. Which implies a far higher rate of undersegmentation if we assume 'daiin' is typical.
There are many confusing facts to account for.
First, just to clarify, the word counts in the paper consider commas as word spaces, like "." and "-" (line break).
Second, we don't know the language and what sort translation the source book was. If the Author was in, say, a Cantonese-speaking area, we may hope that the Chinese characters were basically those of the digital file, and Dictator read each Chinese character as one Cantonese syllable. But if the thing happened in Vietnam, the text as read by the Dictator may have not been one-Vietnamese-syllable-for-one-Chinese-character-in-my-file, because the grammar of the two languages is completely different. However the close match of the word counts makes this possibility rather unlikely.
Moreover, the SPS version of the "Rooster" recipe almost certainly omitted the "taste and warmth" field of the Chinese version, and the "veterinary uses" and "where it grows" fields at the end. If similar omissions occur in many other recipes too, they will affect the ratio above (average words per parag / average chars per recipe) Thus the actual correspondence of VMS words per Chines chars may be closer to 1 than to 1.08.
On the other hand, that 8% is the net discrepancy in the word count. The transcription of the SPS may have bogus spaces as well as missing ones. It could be 18% missing word spaces and 10% bogus ones. For example, in other SPS parags one finds "otchod.aiin" and "dalchd.aiin" that seem to be improperly joined and split "daiin".
Anyone who has tried to transcribe the VMS knows that the spaces marked with comma are only a subset of the dubious spaces, and that many glyph spaces that do not seem dubious at all may in fact be missing word spaces, or vice-versa.
Quote:Alan Farne's thesis ... "There were two surprising conclusions from the analysis of these manuscripts: first, the scribes of both direct copies neither added nor omitted any words."
But those scribes were copying from other clean-copy manuscripts written by other scribes for sale or commission, where word spaces were clearly distinct from glyph spaces (otherwise they would necessarily have made many wrong calls when trying to determine whether there was a space of not).
And one must wonder how ignorant those Scribes were of Greek, really. They were not fluent, sure, seeing that they made many spelling errors. But maybe they knew enough to parse words even if the spaces were ambiguous? Greek used to have special diacritics on initial vowels, and a special form for final sigma, and things like that...
Whereas the VMS Scribe was copying from the Author's draft. Even if it was a second draft, not the raw dictation, it would probably have been much more loose with spacing.
Quote:And requires an explanation if we are asked to believe it is not.
I am not asking people to believe. I am presenting the evidence as best as I can so that they
can draw their own conclusions
To me, the conclusion "SPS≈SBJ, daiin≈主" much more certain and well-founded than, say, the 1400s date, or the Five Scribes theory. But I don't need others to believe that. I am not posting here to boost my TikTok views or promote some book. If some people do not agree now, well, maybe they will later, as I get more matches.
All the best, --stolfi
