nablator > 24-02-2026, 11:26 AM
(24-02-2026, 06:28 AM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.I am confused about what what you mean by "word shuffled" and "token shuffled". Is that a random permutation of the tokens on each line, on each parag, or on the whole text?
quimqu > 24-02-2026, 04:18 PM
(24-02-2026, 11:26 AM)nablator Wrote: You are not allowed to view links. Register or Login to view.(24-02-2026, 06:28 AM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.I am confused about what what you mean by "word shuffled" and "token shuffled". Is that a random permutation of the tokens on each line, on each parag, or on the whole text?
Random permutation of word tokens on the whole text from paragraphs of Currier A and (separately) B pages only for me.
nablator > 24-02-2026, 06:55 PM

quimqu > 26-02-2026, 08:25 PM
| Corpus / Subset | Tokens | Internal tail gap | Change in global gap if removed | Meaning |
|---|---|---|---|---|
| Natural languages (control) | varies | ~0 to 1e-5 | — | Word shuffle has almost no effect on long-range MI |
| Voynich (global) | 38262 | 0.001222 | — | Clear reduction after word shuffle |
| Voynich – Herbal | 10928 | 0.001198 | -0.000204 | Strong driver of the global gap |
| Voynich – Biological (balneological) | 6327 | 0.000402 | -0.000049 | Small positive contribution to global gap |
| Voynich – Cosmological | 2246 | ~0.00117 (global without it) | -0.000052 | Minor contributor |
| Voynich – Marginal stars only | 11646 | 0.000810 | +0.000209 | Reduces the global gap (diluting effect) |
| Voynich – Lines starting with "o" | 6700 | 0.001188 | Very small effect | Strong internal structure, but not a main driver |
| Voynich – Lines starting with "d" | 5819 | 0.000816 | Near zero effect | Moderate internal structure, minimal global impact |
| Torsten Timm (generated text) | varies | ~0.002–0.003 | — | Strong reduction after word shuffle |
nablator > 26-02-2026, 09:54 PM

quimqu > 26-02-2026, 10:55 PM
| System | Tokens | Alphabet size | Tail MI gap |
|---|---|---|---|
| EVA | ~38,000 | 116 | 0.00122 |
| Currier | ~15,000 | 36 | 0.00269 |
| Torsten Timm | ~11,000 | 20 | ~0.002–0.003 |
| Grouping | EVA tail gap (approx) | Currier tail gap (approx) | Effect |
|---|---|---|---|
| Global baseline | 0.00122 | 0.00269 | Currier > EVA |
| Herbal section | High contributor | High contributor | Strong in both |
| Marginal / stars only | Dilutes gap | Dilutes gap | Weak structure |
| Biological | Moderate | Moderate | Section-dependent |
| Grouping | EVA pattern | Currier pattern |
|---|---|---|
| Lines starting with common glyphs | Small variation | Small variation |
| Rare initials | Minimal impact | Minimal impact |
nablator > 26-02-2026, 11:49 PM
(26-02-2026, 10:55 PM)quimqu Wrote: You are not allowed to view links. Register or Login to view.If the long-range MI gap were mainly an artifact of rare glyphs or over-fragmentation, reducing the alphabet should weaken the effect (this is at least the logical thing: it should behave more like natural languages and reduce the gap, isn't it?).

quimqu > 27-02-2026, 09:48 AM
(26-02-2026, 11:49 PM)nablator Wrote: You are not allowed to view links. Register or Login to view.No idea, really. This is not at all what I have in mind. I should wait to have something positive to report before I comment, more tests to do before that. Migraine today until an hour ago, I tested nothing.
quimqu > 27-02-2026, 09:55 AM
| Shuffle scheme | Tail gap (MI raw − shuffle) | Normalized gap (÷ H1) |
|---|---|---|
| Global token shuffle | 0.00250 | 0.00094 |
| Shuffle within lines | ≈ 0 | ≈ 0 |
| Shuffle within paragraphs | ≈ 0 | ≈ 0 |
| Shuffle scheme | Tail gap | Normalized gap |
|---|---|---|
| Shuffle order of lines (lines kept intact) | 0.00251 | 0.00094 |
| Shuffle order of paragraphs (paragraphs kept intact) | 0.00062 | 0.00023 |
quimqu > 27-02-2026, 10:32 AM
| Shuffle scheme | Tail gap | Normalized gap |
|---|---|---|
| Global token shuffle | 0.00250 | 0.00094 |
| Shuffle within lines | ≈ 0 | ≈ 0 |
| Shuffle within paragraphs | ≈ 0 | ≈ 0 |
| Shuffle within pages | ≈ 0 | ≈ 0 |
| Shuffle order of lines | 0.00251 | 0.00094 |
| Shuffle order of paragraphs | 0.00062 | 0.00023 |
| Shuffle order of pages | ≈ 0.00205 | ≈ 0.00077 |