oshfdk > 17-06-2025, 06:28 PM
(17-06-2025, 03:03 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.After commenting out the subsection titles on both files, I counted again the number of words and parags, and basic statistics (min, max, average, and standard deviation) of the number of words per paragraph (nwp):
Code:statistic ! bencao ! starps
-----------+---------+--------
parags | 354 | 330
words | 10874 | 10457
min nwp | 7 | 11
max nwp | 76 | 72
avg nwp | 30.8 | 31.7
dev nwp | 8.5 | 11.2
Considering the missing bifolio in the SPS quire, we have 6 surprising near coincidences: number of entries, and the mode, min, max, average, and deviation of the number of words per paragraph. (The total number of words is not an extra coincidence since it is the average npw times the number of entries.)
Jorge_Stolfi > 17-06-2025, 06:44 PM
(17-06-2025, 08:35 AM)oshfdk Wrote: You are not allowed to view links. Register or Login to view.(16-06-2025, 11:17 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.the word structure (quite unlike that of "European" languages, but just like that of the "Chinese" languages);What do you mean exactly by "the word structure"? In all flavours of Chinese a word /correction/ syllable consists of an initial sound, a vowel and a final, as far as I know.
Quote:In Voynichese the structures of words /correction/ or parts of words are much more complex than that, and you certainly are aware of this, because there are very well known "Stolfi's" models of decomposing Voynichese words, that do not conform to a simple clean prefix-infix-suffix model.
Quote:I'd say if Voynichese was a phonetic representation of Chinese of any kind, this would be very obvious.
Quote:(16-06-2025, 11:17 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.the lack of identifiable articles, copula verbs, inflectionsThere is lack of identifiable anything in the Voynich MS. Statistically it's not similar to any language, including Chinese.
Quote:[/font](16-06-2025, 11:17 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view. the relatively large number of duplicated words, like chor chorDuplicated words are not a feature of Classical Chinese, as far as I know. In the manuscript that you mentioned as the possible source I've only found 43 instances of duplicated character tokens. ... [font=sans-serif]After removing punctuation, I've found 15 instances of repeated words in Opus Magus
Quote:[The Zodiac diagrams] are the only pieces of evidence for the Chinese theory so far that I personally would call specific. Is there a long form explanation of why these point to the Orient?
Quote: why was there at all the need to put European Zodiac signs onto the Chinese Zodiac?Because the Author must have been told the time of the year covered by each diagram, understood that the 24 diagrams roughly corresponded to the 12 sign of Western Zodiac, so he indicated that information in his copy.
Quote:We don't actually know the number of entries in the [Starred Parags section] of the Voynich MS.
Jorge_Stolfi > 17-06-2025, 06:53 PM
(17-06-2025, 11:22 AM)Aga Tentakulus Wrote: You are not allowed to view links. Register or Login to view.There was also Moses, who was pulled out of the Nile.
nablator > 17-06-2025, 07:35 PM
(17-06-2025, 06:28 PM)oshfdk Wrote: You are not allowed to view links. Register or Login to view.BTW, I'm not sure what method was used to determine the number of words for the Chinese manuscript.
DeepSeek Wrote:While word identification in space-less ancient Chinese requires expertise, it is reliable when guided by grammar, context, and philological training. Modern computational tools (e.g., NLP for Classical Chinese) also assist, but human judgment remains essential for ambiguity.
Jorge_Stolfi > 17-06-2025, 09:02 PM
(17-06-2025, 02:27 PM)Pepper Wrote: You are not allowed to view links. Register or Login to view.(17-06-2025, 08:35 AM)oshfdk Wrote: You are not allowed to view links. Register or Login to view.There are generally two kinds of Voynich theories: the solution kind (providing some specific plaintext for specific parts of the MS, be it labels, lines, etc) and the origin story kind, of which your Chinese theory is an example.I think the origin story is not at all convincing but that's largely irrelevant to whether the solution is correct or not, so it's a shame to get bogged down in arguments about it.
oshfdk > 17-06-2025, 09:20 PM
(17-06-2025, 06:44 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.Unfortunately, there is a large number of schemes that the Author may have chosen to map the structure of a "Chinese" language to the structure of a VMS word. Even for a single language, at a single stage of evolution. The Author may have decided to use a singe glyph to encode the [You are not allowed to view links. Register or Login to view.] of Mandarin, as pinyin does. Or, conversely, he may have used two glyphs for a single phoneme, like the English use of "sh".
And then there is the tone. In Mandarin, is not a property of any specific vowel, but of the syllable as a whole. Therefore, if it is indicated by an explicit glyph in the VMS, this glyph may be inserted anywhere in the word, with the position changing randomly between occurrences of the same word. Or in some more complicated way.Check You are not allowed to view links. Register or Login to view. to see how insanely complicated the scheme could be.
Pinyin (PY) and the national Vietnamese script have settled to use diacritics over a specific vowel to indicate the tone. Another scheme that was used for Mandarin, in pre-Unicode days, was a digit placed at the end. Tibetan used a silent consonant prefixed to the syllable.
Another scheme, used for languages with many tones, or to compare tones in different languages, is to use digits 1-5 to denote pitch, and a sequence of digits to denote the pitch profile of the tone. Thus, for example, the flat tone of Madarin in PY jīng could be written as jing3, the ascending tone of míng as ming25, the dipping tone of shǒu as shou213, etc. And these digits could be inserted anywhere in the word, e.g. sh2o1u3.
oshfdk > 17-06-2025, 09:27 PM
(17-06-2025, 07:35 PM)nablator Wrote: You are not allowed to view links. Register or Login to view.(17-06-2025, 06:28 PM)oshfdk Wrote: You are not allowed to view links. Register or Login to view.BTW, I'm not sure what method was used to determine the number of words for the Chinese manuscript.
I asked DeepSeek, presumably more knowledgeable about Classical Chinese than Google Translate, if words can be reliably identified.
nablator > 17-06-2025, 09:37 PM
(17-06-2025, 09:27 PM)oshfdk Wrote: You are not allowed to view links. Register or Login to view.My question was not about how to do this, but what specific method was used by Prof. Stolfi.
oshfdk > 17-06-2025, 09:38 PM
(17-06-2025, 09:37 PM)nablator Wrote: You are not allowed to view links. Register or Login to view.(17-06-2025, 09:27 PM)oshfdk Wrote: You are not allowed to view links. Register or Login to view.My question was not about how to do this, but what specific method was used by Prof. Stolfi.
In bencaopinyin.txt:
# Transcription of Chinese characters to pinyin obtained by Google Translate.
Jorge_Stolfi > 17-06-2025, 09:47 PM
(17-06-2025, 04:57 PM)MarcoP Wrote: You are not allowed to view links. Register or Login to view.Still it's interesting that the vowels a,e,o,u in Giovanni Fontana's cipher (1420 ca) were rotations of the same simple shape. Example from You are not allowed to view links. Register or Login to view. cesto da uoue=egg basket