(24-08-2025, 12:25 PM)nablator Wrote: You are not allowed to view links. Register or Login to view. (24-08-2025, 12:05 PM)dexdex Wrote: You are not allowed to view links. Register or Login to view.As far as I can tell, for the self citation you only require a couple rules:
[...]
I think that is the conceit behind the algorithm, and it is certainly not a complex process.
It seems simple in the articles by Timm & Schinner but when you look at the code of the generator:
You are not allowed to view links. Register or Login to view. has the "rules" definitions: similarElements, combinableLigature, allowedFollowerGlyphs, allowedInitialFollowerGlyphs, allowedPrecursorGlyphs, finalGlyphReplacements, selfIngroupGlyphReplacements, selfFinalGlyphReplacements, etc.
The way the generator uses these rules is also quite complex: You are not allowed to view links. Register or Login to view. (1300 lines)
The result is far from convincing Voynichese, so more rules would be required to make the generated text indistinguishable from actual VM lines at a glance. The question remains: why would anyone choose to do it the hard way, by following a large set of (mostly useless) rules, instead of doing something much simpler that is more algorithmic and less rule-based? Mostly useless because natural languages are a lot less restrictive: there is no need for so many word-building rules if the aim is only to imitate natural languages.
I am not discounting the possibility, however, that the rules can be expressed in a more elegant and compact way. In a board game like Go the patterns are emergent, not hard-coded into the rules.
As far as I can tell, similarElements + combinableLigature + GlyphReplacements encode adding individual strokes. The functions look more complex because they have to encode that fact in a transcription scheme that doesn't fully support this.
The line finals/line beginnings are there to support the hypothesis that typically line endings are copied from previous line endings, certainly something that would be a simple choice and very convenient for a scribe.
My point being that looking at the generator code is misleading, because a large part of this code is 'encoding' the simple rules into a rote algorithm for a computing machine to perform. They are, however, very natural for humans. To borrow your Go a perfect example: there are essentially three rules to the game of go (the rule of turn order white then black; the rule of liberty counting; the rule of ko) but a program that allows playing go will have to explicitly do the liberty counting in loops, encode the board size, encode the current state of the game, verify its correctness, display it etc etc. Most of these a human does at a glance; and it is why Go eluded computers for so long and why Go playing programs, even though they are stronger by magnitudes than the best players, still have blind spots that wouldn't fool an amateur after their second lesson (they can't do long ladders the way humans do, and this fact can be exploited to beat them by playing a very stupid game that fools the computer).
I am assuming that the description in the Timm article is accurate, but I have no indication to think otherwise. While the example .txt files are quite different than Voynichese, that is to be expected because it's made by a random process that grows in wildly different directions depending on initial seeds; still, on many key statistical features used to support the meaningfulness hypothesis, a significant proportion of the parameter space is hard to distinguish statistically from the VMS.
So, it is plausible that this scheme can generate
a gibberish-but-looks-non-gibberish text with somewhat similar characteristics. Allowing for some scribal freedom (the pages that look like the first column was made first, for instance, might just be a scribe getting bored and doing something weird), I think it's a pretty convincing argument that it is a simple process that does a pretty good job of emulation of Voynichese, devisable and executable by someone in the 15th century. That doesn't mean it's how it was done, or even close, or that Voynich necessarily doesn't contain meaningful text -- but its plausibility has to be acknowledged.
(24-08-2025, 04:05 PM)dashstofsk Wrote: You are not allowed to view links. Register or Login to view.And this is the scenario that I consider to be plausible:
The author is not a skilled writer. Is no Charles Dickens who had the ability to write at long sittings page after page without correction. He possibly was unfamiliar with the secret sciences that seem to be the subject of the VMS, and would be unable to write with authority on it. His nescience would have been immediately obvious and his book would have been immediately dismissed.
Not being able to write convincingly he had the inspiration to create a fabricated narrative in a bogus alphabet and claimed that it was a product from some distant undiscovered land. No need for any grammatical correctness or to give the text meaning. He could just bash the words out as they came. This would actually be more rewarding since a book from some unknown land would be viewed with curiosity and could command a higher premium.
Indeed, this is plausible. And in the context of a 'quack doctor prop' theory, you don't need the craftsmanship to be that great -- you just need something that suggests vast knowledge that will pass cursory inspection. The VMS's usefulness for that is arguably incredible: we still don't know if it contains meaningful text or not, and despite crappy illustrations and no discernible use, it is viewed with curiosity centuries later. It would be a fantastic prop; less so as a book for sale, since why would you buy a book you can't read? (Of course, anything can happen, or it might be a misguided attempt at a book sale hoax etc - but my point is just that I find those explanations way less attractive than the quack doctor hypothesis)
The process to generate it has to be simple enough, but self-copying is certainly plausible.