oshfdk > 21-05-2026, 03:15 PM
(21-05-2026, 01:17 PM)Dunsel Wrote: You are not allowed to view links. Register or Login to view.No, the manuscript never explored every possible word combination. A realistic copy-mutate system would stay conservative, reusing active word families and nearby variants instead of wandering randomly through all legal forms.
Jorge_Stolfi > 21-05-2026, 03:29 PM
(21-05-2026, 08:43 AM)Torsten Wrote: You are not allowed to view links. Register or Login to view.The frequency-connectivity correlation arises through a feedback loop inherent in the copying process. Frequent words are more likely to be selected as copying templates, generating more variants; the existence of more variants increases the probability that members of that word family are selected in subsequent copying events. This self-reinforcing cycle ensures that the most frequently used words accumulate the most similar neighbors—precisely the pattern you document.
Jorge_Stolfi > 21-05-2026, 03:48 PM
(19-05-2026, 07:32 AM)dashstofsk Wrote: You are not allowed to view links. Register or Login to view.This is my hypothesis also. The writer wanted to make people believe that this was a rare piece from some distant land where they used a strange alphabet. In order to create the deception he had to give the manuscript a semblance of genuineness, give it some definite structure. Otherwise unstructured random writing would have been quickly dismissed as a fraud.
Quote:For me the actual method is not really important. It is enough to show evidence that the manuscript is artificial, constructed and meaningless.
Dunsel > 21-05-2026, 04:01 PM
(21-05-2026, 03:15 PM)oshfdk Wrote: You are not allowed to view links. Register or Login to view.(21-05-2026, 01:17 PM)Dunsel Wrote: You are not allowed to view links. Register or Login to view.No, the manuscript never explored every possible word combination. A realistic copy-mutate system would stay conservative, reusing active word families and nearby variants instead of wandering randomly through all legal forms.
And how was this achieved? For example, chol and chedy are two very common words and olchedy appears more than 70 times, what exactly happens when the scribes attempt to merge chol and chedy together into cholchedy? What stops them? Why cholchedy only occurs twice and qolchedy 11 times, even though chol is almost 3 times as popular as qol?
Jorge_Stolfi > 21-05-2026, 04:01 PM
(21-05-2026, 02:12 PM)Dunsel Wrote: You are not allowed to view links. Register or Login to view.I reworked one of my comparison tests to operate at the sheet level. What this appears to show is that the manuscript is not behaving like a collection of unrelated pages. Certain sheets are much more similar to specific other sheets than to the manuscript as a whole, especially in Quire 13 and Quire 20.
Quote:That is consistent with what I would expect from a copy-and-modify process, where words and word families are repeatedly reused and gradually altered over time, creating clusters of closely related sheets instead of uniformly random text.
oshfdk > 21-05-2026, 04:36 PM
(21-05-2026, 04:01 PM)Dunsel Wrote: You are not allowed to view links. Register or Login to view.Quite simply, in human terms: does the word “look right” within the current local family of forms being copied?
Dunsel > 21-05-2026, 04:49 PM
(21-05-2026, 03:48 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.The hoax/fraud theories have several problems.
First, even if we assume that the Author used old vellum, the VMS must have been written before 1600. That was decades before probability theory was invented, and centuries before it was applied to languages. The Author would not have even imagined what the phrase "like natural language" meant. Even computing character frequencies would have been a hard and unusual task. So how come the output of his gibberish generation method ended up having all the properties of natural languages that we see in Voynichese -- lexicon size, Zipf-like word frequencies, 10 bits of word entropy, etc?
And the same applies to the intended targets of the fraud. How could they tell that the text was not "like a natural language"? So then why would the Author worry about that?
On the other hand, if his intent was to make it look "natural", why did he make Voynichese so unlike European languages? Even today, when we know that natural languages can be very different from European ones, people will look at those very short words with complex structure, the absence of recognizable articles and prepositions, the repetitiousness -- and immediately think "gibberish" and "fraud".
Moreover, people rarely imagine things beyond their experience. An invented language will usually be similar to the languages that the Author knows, even if he tries hard to make it "exotic". If all the languages that someone knows have polysyllabic words, articles, prepositions, inflections -- his invented language will quite probably have them too. Edward Kelley, the con man who ruined the life of John Dee, invented an "Enochian language" that was supposedly used by angels in Heaven. Kelley's "Enochian" turned out to be quite similar to Greek, Hebrew,and other languages that he was familiar with. Thus, if the Voynichese language had been invented by a Medieval European scholar, it should look like that, too.
And then there is the question, why would the Author create a book like this? Whether the intent was to sell the book or to impress, he surely would have produced a very different book -- with more figures and less text, with figures suggesting more valuable secrets like turning lead into gold, curing someone from an arrow shot straight though the head, restoring the youth of an elderly person, getting twelve nymphs into one's bed...
Why would he waste time with the ~25 pages of Quire 20, with boring-looking text and no figures?
Why would he have so many plants in the herbal sections? If the "selling point" of the book was that the plants were from a distant land and unknown in Europe, it would not make much difference if the book had 50 or 30 unknown plants, instead of 130+.
Quote:For me the actual method is not really important. It is enough to show evidence that the manuscript is artificial, constructed and meaningless.
The method is important if it implies knowledge that was not generally available at the time, or that it was too complicated and cumbersome to use.
All the best, --stolfi
Dunsel > 21-05-2026, 05:09 PM
(21-05-2026, 04:36 PM)oshfdk Wrote: You are not allowed to view links. Register or Login to view.(21-05-2026, 04:01 PM)Dunsel Wrote: You are not allowed to view links. Register or Login to view.Quite simply, in human terms: does the word “look right” within the current local family of forms being copied?
The whole idea of the copy-mutate system, if I understood @Torsten correctly, is to formalize the process, to be able to make testable predictions, etc.
I see two possibilities for where copy+mutate research leads: either some well defined rule based reasonably simple copy+mutate system successfully explains most statistical properties of the text, which would mean that the scribes actually used something similar to this precise rule-based system system. Or scribes mostly relied on their intuition, visual patterns and imagination and even if the process included looking back and reusing past patterns, there will be no way of identifying the process exactly, in which case we can't really learn much from copy+mutate formalizations. Is this correct?
Dunsel > 21-05-2026, 05:13 PM
(21-05-2026, 04:01 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.On the contrary, the copy-and-modify process should create a gradual transition across quires and sections.
Maybe there is a reordering of the sheets that will make the transitions between sections less abrupt. But that would be an artifact of the reordering, not a real gradual change.
All the best, --stolfi
dashstofsk > 21-05-2026, 06:33 PM
(21-05-2026, 03:48 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.why did he make Voynichese so unlike European languages?