kckluge > Today, 07:56 AM
(Yesterday, 11:26 PM)nablator Wrote: You are not allowed to view links. Register or Login to view.(Yesterday, 06:22 PM)kckluge Wrote: You are not allowed to view links. Register or Login to view.All of which is getting into the weeds. The point is that *if* Voynichese were a cipher that breaks up words into smaller chunks, the process that breaks them up is unlikely to be syllabification (and likely isn't deterministic in general?) due to the extremely low TTR that results.
The TTR of Voynichese can be reduced as much as you need... by simplifications, equivalences, re-spacing. However the babble-like sequences of similar words would not produced plausible Latin (or Chinese).
Jorge_Stolfi > Today, 10:02 AM
(Yesterday, 03:43 PM)nablator Wrote: You are not allowed to view links. Register or Login to view.I estimate it to 800-1000 in long texts (without many exotic words), more than double what Mandarin has (~400).
Jorge_Stolfi > Today, 10:33 AM
(Yesterday, 03:42 PM)Stefan Wirtz_2 Wrote: You are not allowed to view links. Register or Login to view.(19-05-2026, 08:25 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.[..]There is no proof for some of them being just „slight variants“ or even an „abbreviation“ for daiin or anything else.
The Voynichese daiin (and sometimes slight variants like dain, kaiin and laiin, and the abbreviation dam)
ReneZ > 10 hours ago
(Today, 10:33 AM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.But I think I do have good proof that dair, dain, kaiin and laiin can be variants of daiin. It seems you stll don't accept that evidence, but I hope it will eventually be irrefutable; I am working on that.
And I have also an explanation for those particular variations: the glyphs k, d and l (and, separately, r and in) would look very similar if written in a "cursive" handwriting, such as the Author may have used in the draft.
Jorge_Stolfi > 9 hours ago
(Today, 07:56 AM)kckluge Wrote: You are not allowed to view links. Register or Login to view.the distribution of the number of words between instances of words with a length-normalized edit distance below some threshold [...] is an incredibly significant statistical signature of the Voynich text that any theory about how the text is generated needs to reproduce.
Quote:It's for Stolfi to address that issue -- quantitatively -- in the context of explaining/defending his theory. [...] The "explanation" would be to show that the same anomalies that you see in the SPS are spelling system, and incidence of errors.
Quote:...I'm acutely aware that this is the "Chinese" theory thread (and not, for instance, the (AFAIK non-existant) "verbose cipher" theory thread), bur moderators and readers please bear with me because what follows will, in fact, wrap back around to being on topic...
Quote:3) results in a critique of Stolfi's approach here
Quote:All of which leads to wrapping this back around to discussing Stolfi's theory [...]. It's not *impossible* that Stolfi has stumbled into a solution with his approach, but [...] I think it is far more likely that he has fallen prey to the siren call of the "crib" that has lured so many mariners sailing on the treacherous seas of the Voynich Mss to their doom.
Quote: What he *should* be doing -- again, IMHO -- is taking texts in one or more SE Asian languages (dealer's choice), assigning some scheme for representing them with Voynich glyphs, simulating whatever "confused ignorant scribe" error processes he thinks are there, and then showing that you wind up with something that -- quantitatively, and for all the "greatest hits" properties (including "babble-like sequences of similar words") -- looks like Voynichese.
eggyk > 8 hours ago
nablator > 4 hours ago
(9 hours ago)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.Well, you have that text above to play with. Do you see "babble-like sequences" in it?
nablator > 3 hours ago
(4 hours ago)nablator Wrote: You are not allowed to view links. Register or Login to view.How to build a babble detector? Not sure. Some compression algorithm run on all (short) substrings?