dexdex > 13-03-2026, 08:42 PM
Jorge_Stolfi > 13-03-2026, 08:48 PM
(13-03-2026, 04:31 PM)dexdex Wrote: You are not allowed to view links. Register or Login to view.The methods devised by participants of g&b experiment can match Voynichese quite well (and seemingly better than any proposed languages),
dexdex > 13-03-2026, 08:51 PM
(13-03-2026, 08:48 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.(13-03-2026, 04:31 PM)dexdex Wrote: You are not allowed to view links. Register or Login to view.The methods devised by participants of g&b experiment can match Voynichese quite well (and seemingly better than any proposed languages),
By what criteria, and who measured them?
A simple first-order Makov chain will produce gibberish with most of the same vital statistics as the Shakespeare corpus -- Zipf plot, entropy per word, n-gram frequencies, word pair frequencies. Someone who does not know English would say that the synthetic text matches Shakespeare "quite well". It would match it better than Agatha Christie, Nicholas Culpeper, or any other natural language text would.
Would that suggest that Hamlet may be just gibberish? That it cannot be a natural language?
The obvious implementation of a zero-order word-based Markov would use a single table of word probabilities. For a first-order word Markov, one would need a separate table for each word X in the lexicon, that gives the frequencies of each word Y following word X. That would be quite large if it was complete; but if the frequencies are computed from a fixed "seed" text of N words, all those tables together will have at most N-1 entries.
But then, instead of a table, the seed text itself can be used to represent those frequency distributions implicitly. To generate the next word Y, you take the previous generated word X, pick a random occurrence of X in the seed text, and copy the following word. This trick (which works for higher-order Markov chains as well) makes it possible to chose the next outout word without generating random numbers with unequal probabilities, which a scribe in the 1400s would hardly be able to do.
Thorsten&Timm's "self-copying" method seems to be quite similar to this implementation of a Markov chain. Except that the generated text itself is continuously appended to the seed text, and a separate (non-trivial!) algorithm is used to occasionally create new words by mutation of previous ones. Ah, and also a time-travel routine is called initially to fetch the last page of the output (page f116r) from the future to use as the initial seed text (page f0v), which is discarded at the end.
All the best, --stolfi
Jorge_Stolfi > 13-03-2026, 09:19 PM
(13-03-2026, 08:42 PM)dexdex Wrote: You are not allowed to view links. Register or Login to view.I found an estimate of around 2 pages a day for a good scribe
dexdex > 13-03-2026, 09:27 PM
(13-03-2026, 09:19 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.I guess that this estimate is for a good scribe writing high quality text with neat uniform "font", ornate letters, etc. Not just a faithful copy.While it's certainly not just 'scribbling,' this estimate is from the era when lay education was already a thing, necessitating more books being copied. The copying hand was a little more lax than ornate books, but reproducing text readably and with decent spacing still takes time even if it doesn't need to be perfect. A book reproducing extensive calligraphy and very elaborate letters, exhibiting neatness, would take much longer. There are reports of one such copying where the contracted scribe had to produce folios continuously for the client: the rate came out to about a page a day, so a year for a tome of Voynich size (well, 9+ months, but if it takes this long you will run into shorter days, the cold season, and illnesses interrupting your work, even in Italy, so a year is a more realistic estimate).
Jorge_Stolfi > 13-03-2026, 10:35 PM
(13-03-2026, 09:27 PM)dexdex Wrote: You are not allowed to view links. Register or Login to view.The Voynich is assuredly not of this quality. Going further, there is no line spacing, so one might infer it should be a 'quicker' copy, but the handwriting is not of 'shorthand' quality inasmuch as we can even guess at it, and the drawings need to be reproduced and text wrapped around them, so it can't be freely written in the way a quick copy would be. I would say a typical copy's time of production is a decent guess.
ReneZ > 13-03-2026, 11:29 PM
dexdex > 13-03-2026, 11:41 PM
(13-03-2026, 11:29 PM)ReneZ Wrote: You are not allowed to view links. Register or Login to view.My feeling is that the 'committing to paper' part is perhaps not the largest part of the entire creation process.If it contains meaning, that's quite probable, though it could be an encipherment of a plaintext book that no longer exists. Then the author and scribes are technically different, and their motives can be separate and so the time investment may be 'worth it' from each standpoint when it might not be worth writing a book only to encipher it...
Also, if the MS is a fair copy, one has to include the time both of creating the draft and creating the copy.