25-02-2023, 12:36 AM
Two arguments against the nomenclator cipher hypothesis that I have met with in discussions were:
1) It would be unusual to encipher a whole book in such manner
2) It would require a huge nomenclator to encipher a volume of this size
Considering argument #1, I think that the VMS is cryptologically unusual (if not unique) anyway, so the argument of unusualness-to-be does not look a strong one to me.
Considering argument #2, I tried to estimate the size of the nomenclator, assuming what is supposedly the worst-case scenario (OK, upon further consideration it is not the worst case of course, but let's say, a "worse case") - namely, that each unique plain text word is matched to a unique vord.
According to voynich.nu, the vocabulary of the VMS is comprised of ~8100 unique vords. That would be the size of the nomenclator in question. However, the nomenclator must contain not only vords, but also their translations into words. This way, the amount of tokens in the nomenclator doubles and results in 8100 x 2 = 16200.
What folio space would be required to hold that many of tokens? This we can deduce from Q20 which presents a convenient example of folio space filled almost exclusively with text, without drawings. 23 pages of Q20 (excluding the mostly empty f116v) contain circa 10700 vords. This means the average density of 10700/23 = 470 vords per page. Let's even assume a 30% margin for breaking text into columns, leaving more free space or increasing the letter size for better legibility etc. This yields the density of 470 x 0.7 = 330 vords per page. Assuming this density for the nomenclator space, 16200 tokens will occupy 49 pages or (regarding a "folio" in this case as a two-sided sheet) 49/2 = 25 folios. This is roughly two quires of the size of the present state of Q20 (without the missing folios). In the relative figures, this would be a (234 + 49)/234 = 21% increase in the total "thickness" of the MS.
The figures look significant but not like impossible.
This calculation implicitly assumes roughly equal average token length both for plain text and cipher text, but this would not be something unexpected judging by the average vord length.
1) It would be unusual to encipher a whole book in such manner
2) It would require a huge nomenclator to encipher a volume of this size
Considering argument #1, I think that the VMS is cryptologically unusual (if not unique) anyway, so the argument of unusualness-to-be does not look a strong one to me.
Considering argument #2, I tried to estimate the size of the nomenclator, assuming what is supposedly the worst-case scenario (OK, upon further consideration it is not the worst case of course, but let's say, a "worse case") - namely, that each unique plain text word is matched to a unique vord.
According to voynich.nu, the vocabulary of the VMS is comprised of ~8100 unique vords. That would be the size of the nomenclator in question. However, the nomenclator must contain not only vords, but also their translations into words. This way, the amount of tokens in the nomenclator doubles and results in 8100 x 2 = 16200.
What folio space would be required to hold that many of tokens? This we can deduce from Q20 which presents a convenient example of folio space filled almost exclusively with text, without drawings. 23 pages of Q20 (excluding the mostly empty f116v) contain circa 10700 vords. This means the average density of 10700/23 = 470 vords per page. Let's even assume a 30% margin for breaking text into columns, leaving more free space or increasing the letter size for better legibility etc. This yields the density of 470 x 0.7 = 330 vords per page. Assuming this density for the nomenclator space, 16200 tokens will occupy 49 pages or (regarding a "folio" in this case as a two-sided sheet) 49/2 = 25 folios. This is roughly two quires of the size of the present state of Q20 (without the missing folios). In the relative figures, this would be a (234 + 49)/234 = 21% increase in the total "thickness" of the MS.
The figures look significant but not like impossible.
This calculation implicitly assumes roughly equal average token length both for plain text and cipher text, but this would not be something unexpected judging by the average vord length.