Mauro > Yesterday, 07:41 AM
(09-06-2026, 11:26 PM)Torsten Wrote: You are not allowed to view links. Register or Login to view.Thank you for this analysis. The discrepancy between your distributional distances and my substitution rules is informative rather than contradictory — because I describe context-dependent rules:
"These rules for similar glyphs only apply with some restrictions. For instance 'o' and 'y' can replace each other only as the first or as the last sign. Another example is that 'o' is interchangeable with 'a' before 'l' and 'r' but not after 'q' or before 'k'." (You are not allowed to view links. Register or Login to view., p. 5). A word grid documenting the most frequent substitution relationships across the VMS vocabulary is available at You are not allowed to view links. Register or Login to view. (see also Timm 2014, pp. 66-82).
So "o" and "a" substitute in specific positions — as prefix elements before "l" and "r" — but not in all contexts. Their global distributional distance (0.55 in your measurement) is high because "o" after "q" has no "a" equivalent, pulling the global distances apart. But in the specific positions where they do substitute — "ol"/"al", "or"/"ar", "chol"/"chal" — they are interchangeable.
The same applies to "n" and "r" (0.63 in your measurement). Both appear word-finally, and in that position they substitute — dain/dair, sain/sair, okain/okair etc. But "r" also appears in other positions where "n" doesn't, making their global distributions different.
Your core pairs — ch/sh, k/t, p/f, r/l — show low distances because they substitute freely across many contexts. The pairs with higher distances in your analysis — o/a, o/y, n/r — substitute only in restricted positions, which dilutes their global similarity.
Note: Currier already noted in 1976 that the Voynich glyphs are constructed from shared base strokes — 'you can make up almost any of the other letters out of these two symbols i and e' (see The Nature of the Symbols in You are not allowed to view links. Register or Login to view.). Your distributional distances quantify this observation: glyph pairs with low distance are the pairs that share stroke structure.
Torsten > Yesterday, 08:42 PM
(Yesterday, 07:41 AM)Mauro Wrote: You are not allowed to view links. Register or Login to view.About the 'shared base strokes': it's true of course that VMS symbols are constructed with just a few of them, but I've never been conviced of the significance of this fact because I find it rather expected that any writing system will tend to re-use the same basic strokes. Ie. in the normal block letters Latin alphabet 'o', 'p', 'q', 'b', 'd', 'l' are all made with a circle (or none)+ a vertical bar (or none), but this does not mean they are related to each other.
quimqu > Yesterday, 09:37 PM
| Metric | Control | Lev-1 | Increase |
|---|---|---|---|
| Section similarity | 0.520 | 0.645 | +24% |
| Page similarity | 0.081 | 0.158 | +95% |
| Paragraph similarity | 0.031 | 0.069 | +123% |
| Context similarity | 0.038 | 0.084 | +121% |
Torsten > Yesterday, 11:33 PM
(Yesterday, 09:37 PM)quimqu Wrote: You are not allowed to view links. Register or Login to view....
Jorge_Stolfi > Today, 04:24 AM
(Yesterday, 08:42 PM)Torsten Wrote: You are not allowed to view links. Register or Login to view.In the Latin alphabet, "p" and "b" share a circle + bar but have completely different distributional contexts. In the VMS, your measurements show that glyph pairs sharing stroke structure — ch/sh (0.13), k/t (0.09), p/f (0.11) — have nearly identical distributional contexts. Visual similarity predicts distributional interchangeability. That relationship doesn't exist in writing systems encoding natural language. However, it does exist in the Voynich text.