09-05-2026, 10:33 PM
(09-05-2026, 01:33 PM)Grove Wrote: You are not allowed to view links. Register or Login to view. “ The character 血 is matched with the EVA string aiinqot “
Hi Jorge, does this mean spaces are irrelevant in your theory?
I added that crib just yesterday, and I am still quite unsure about it.
Spaces do seem to be relevant, since, on average, one Chinese character in the SBJ corresponds to one Voynichese token in the SPS, almost exactly. That is the case not only for individual entries and for gaps between entries, but also for the histograms of entry length and parag lengths.
But spaces also seem to be often wrong, in both senses. It must be often the case that a Chinese character corresponds to only part of a word in the transcription file, or to two or more words together. How often does that happen? I do not know yet.
Besides the potential confusion with single-vowel syllables, which I mentioned in the previous post, there is also the fact that many concepts that are expressed by a single word in Western languages are expressed by two-character compounds in those candidate East Asian languages; and (as in any language) the meaning of the compound is usually related to the parts, but cannot be deduced from them. Like 瘾疹 yǐn zhěn = "urticaria", literally "craving rash". Nowadays, when Mandarin is transcribed phonetically as pinyin, the two syllables of such compounds are often stuck together without a space (yǐnzhěn), possibly to facilitate the parsing by Westerners. So perhaps the hypothetical Dictator too sometimes read such compounds together as single words. Or perhaps the Author recognized that the two syllables were a compound, and joined them in his notes.
For that reason and other similar reasons, when I try to pair an SBJ recipe to an SPS parag, I ignore all spaces in the latter, and look only for substrings of EVA characters that seem to be correlated with the Chinese keywords characters.
In the case of 血 = "blood" (which, not surprisingly, is one of the most common characters in the SBJ), I am trying to see whether it can be another crib, the same way I found the others. Namely, I took the SBJ entries that have I have already tentatively paired and that use that character, and looked for some common string that appeared around the corresponding positions in the SPS parags, modulo some "spelling" variations.
On some (not all) of those parags, the string "aiinqok" seemed to appear at the right places, with some variations. Currently I am assuming that the variations can be described by the RE pattern /aii?(in|r)q?o?[ktpf]/, that would match "aiirk", "aiiinqop", etc.
The "?" and "(in|r)" in the pattern means that the string may start with aiin, aiiin, air, aiir; and that the "qo" is optional.
In a previous post, I explained why iin <-> ir is an acceptable variation; namely because they can easily look the same in moderately sloppy handwriting. Said another way, the "ink distance" between the two is rather small.
In another post I also explained why omitting the q or qo is also an acceptable variation. Namely, because there is a vague hint that it is not phonetic but a symbol for "and", like Western "&", that the Author added because he felt it was necessary for comprehension. (In a text like the SBJ, the phrase "apples, bananas, and cherries are good" would be written "林檎香蕉櫻桃美", literally "apple banana cherry good". My hunch is that the Author found that quite confusing, and would write down "apple qobanana qocherry good" instead.)
Anyway, now I am trying to add 血 ≈ aiinqok (and variations) as a required keyword pair when matching entries that use that character.
Maybe it will work,maybe not. It may be that the Voynichese for 血 is not quite that string but some other string that may or may not overlap with it. It may be that the entry-parag pairs were I got that string from were incorrect and the Voynichese for "blood" is nothing like it. Maybe I will need to allow other variations, like o instead of a, or d replacing k (another pair of glyphs that are close in "ink distance"). Stay tuned.
All the best, --stolfi