quimqu > 03-06-2026, 04:02 PM
(03-06-2026, 09:55 AM)dashstofsk Wrote: You are not allowed to view links. Register or Login to view.So, this is biasing your numbers. It should not surprise that hands 2 and 3 and quires 13 and 20 have better than expected probabilities of sharing these words.
| Link type | Percentage |
|---|---|
| Cross-hand + cross-quire | 59.4% |
| Same-hand + cross-quire | 21.9% |
| Same-hand + same-quire | 16.9% |
| Cross-hand + same-quire | 1.7% |
| Section pair | Cross-hand | Cross-quire |
|---|---|---|
| Herbal ↔ Marginal stars | 91.9% | 99.8% |
| Biological ↔ Marginal stars | 100% | 100% |
| Herbal ↔ Biological | 65.5% | 100% |
| Token | Biological folio | Marginal stars folio |
|---|---|---|
| alchl | f76v | f113r |
| cholchey | f79r | f113r |
| fshedy | f80r | f115r |
| cheety | f80r | f112r |
| ckhedy | f84r | f112v |
Torsten > 04-06-2026, 11:37 AM
(03-06-2026, 04:02 PM)quimqu Wrote: You are not allowed to view links. Register or Login to view.For example, some of the semi-rare tokens linking Biological and Marginal stars are:
Token Biological folio Marginal stars folio alchl f76v f113r cholchey f79r f113r fshedy f80r f115r cheety f80r f112r ckhedy f84r f112v
| Folio | Token | Section |
|---|---|---|
| f76v | ralchl | Biological |
| f77r | dolchl | Biological |
| f81r | lchl | Biological |
| f105r | lchl | Marginal Stars |
| f106r | rarolchl | Marginal Stars |
| f107r | polchls | Marginal Stars |
| f113r | alchl | Marginal Stars |
(03-06-2026, 04:02 PM)quimqu Wrote: You are not allowed to view links. Register or Login to view.If I am not wrong, those links are all cross-hand and cross-quire. So I agree there is definitely a codicological component here. But the strongest lexical bridges do not seem to be reducible to simple "same scribe / same quire" proximity either.
dashstofsk > 04-06-2026, 02:39 PM
petronio > 04-06-2026, 05:25 PM
quimqu > 04-06-2026, 06:25 PM
(04-06-2026, 05:25 PM)petronio Wrote: You are not allowed to view links. Register or Login to view.Your rare-token hub result made me look more carefully at something I'd been tracking separately; boundary concentration at the folio level rather than corpus-wide.
The Stars folios (f111r, f111v, f108v) come out consistently low, around 0.37, against a corpus average closer to 0.59. You are not allowed to view links. Register or Login to view. similar. The herbal section runs higher, some pages above 0.80. So the pages with the widest rare-token reach also tend to have the most interior-distributed boundary patterns. Whether that's the same structural phenomenon from a different angle, or two independent signals that happen to co-occur in those sections, I'm still not sure. What do you think?
quimqu > 04-06-2026, 10:28 PM
(04-06-2026, 11:37 AM)Torsten Wrote: You are not allowed to view links. Register or Login to view.If different scribes wrote different sections, we would expect rare vocabulary to cluster by hand — each scribe's idiosyncratic word choices staying within that scribe's sections. That's not what we see either.
Torsten > 04-06-2026, 11:45 PM
(04-06-2026, 10:28 PM)quimqu Wrote: You are not allowed to view links. Register or Login to view.My point is not that rare-token links disprove vocabulary evolution. They probably don't. What interests me is that these semi-rare tokens often connect sections that, under a semantic interpretation, could plausibly be related.
For example, in a normal herbal manuscript, semi-rare words might be plant names. You would expect them to appear on the herb page itself, but also in recipes or remedies referring to that plant.
What I find interesting in the Voynich is that some of these semi-rare tokens connect Herbal folios with Marginal Stars folios. If the Stars section contains something recipe-like (which is of course only a hypothesis), then that is exactly the kind of pattern one might expect from a meaningful text.
quimqu > 05-06-2026, 07:46 AM
(04-06-2026, 11:45 PM)Torsten Wrote: You are not allowed to view links. Register or Login to view.The text contains almost no repeated phrases. Tiltman (1967), D'Imperio (1978), and Reddy & Knight (2011) all noted that sequences of three or more words virtually never repeat. Any meaningful text — descriptions of plants, recipes, instructions — produces repeated phrases ("take the root of," "boil in water," "apply to the"). The VMS doesn't.
Torsten > 05-06-2026, 09:01 AM
(05-06-2026, 07:46 AM)quimqu Wrote: You are not allowed to view links. Register or Login to view....
| Text | Unigram | Bigram | Improvement |
|---|---|---|---|
| VMS B | 2.30% | 2.50% | 8.85% |
| English | 4.72% | 11.9% | 151% |
| Arabic | 3.81% | 14.2% | 252% |
| Chinese | 16.5% | 19.8% | 19.7% |
| Hungarian | 5.84% | 13.0% | 123% |
quimqu > 05-06-2026, 09:37 PM
(05-06-2026, 09:01 AM)Torsten Wrote: You are not allowed to view links. Register or Login to view....
| Section | Candidate tokens | Section vocabulary | % vocabulary |
|---|---|---|---|
| Astronomical | 297 | 614 | 48.4% |
| Text-only | 449 | 995 | 45.1% |
| Cosmological | 439 | 1126 | 39.0% |
| Zodiac | 287 | 784 | 36.6% |
| Pharmaceutical | 409 | 1162 | 35.2% |
| Biological (balneological) | 427 | 1480 | 28.9% |
| Marginal stars only | 654 | 3339 | 19.6% |
| Herbal | 620 | 3490 | 17.8% |
qotey -> Herbal f4r, also found in Stars and Balneological
chetey -> Herbal f34r, also found in multiple non-Herbal folios
olchedy -> Herbal f26r, reused outside Herbal