Jorge_Stolfi > 05-11-2025, 09:25 PM
quimqu > 05-11-2025, 11:37 PM
(05-11-2025, 04:59 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.The most interesting result is the proximity of the Spanish "head" texts to the Portuguese "tail ones, and vice-versa. Those are essentially distinct texts, in rather different languages and quite different spellings, technically by distinct authors (Machado in the latter, and Tapía translating Machado in the former). What they had in common was the higher-level nature and style of the work (grammatical variety, clause length, predominant verbal tenses, etc.), the general topic (which determined proper names and common concepts and actions) and whatever part of the author's style could survive the translation.
Jorge_Stolfi > 06-11-2025, 01:02 AM
(05-11-2025, 11:37 PM)quimqu Wrote: You are not allowed to view links. Register or Login to view.Maybe a good test would be to pass a text translated into not so near languages like Portuguese and Spanish, and see how the graph behaves. Because then, if the dots are close, we will have a tool to study the Voynich text independently from the alphabet or the "language".
quimqu > 06-11-2025, 09:57 AM
Jorge_Stolfi > 06-11-2025, 10:19 AM
(06-11-2025, 09:57 AM)quimqu Wrote: You are not allowed to view links. Register or Login to view.a few tokens like ol, shedy, and qokedy form the main loop of tightly connected words
Jorge_Stolfi > 06-11-2025, 10:36 AM
(06-11-2025, 09:57 AM)quimqu Wrote: You are not allowed to view links. Register or Login to view.the next word is not random. it depends on the previous two or three words. ... Conditional entropy drops sharply when more context is known
quimqu > 06-11-2025, 11:52 AM
(06-11-2025, 10:19 AM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.I am guessing that an edge directed from A to B counts the times that A appears before B in some parag. Is that correct?
quimqu > 06-11-2025, 04:04 PM
(06-11-2025, 10:36 AM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.So far this is a property of almost any text written in any natural language. And even of encrypted text, if each original word type is mapped to one encrypted word type. Or to a small number of types.
Jorge_Stolfi > 06-11-2025, 08:54 PM
lines words bytes file
------- ------- --------- ------------
9058 99049 531241 engl/chr/true.txt English, Culpeper's Herbal, herbal section.
845 7684 45404 voyn/hea/true.txt Voynichese, Herbal-A, parags only.
359 3417 20400 voyn/heb/true.txt Voynichese, Herbal-B, parags only.
661 6804 41140 voyn/bio/true.txt Voynichese, Bio, parags only.
1111 11555 72750 voyn/str/true.txt Voynichese, Starred Parags, parags only.lines words bytes file
------- ------- --------- ------------
9092 99049 532343 engl/chr/fake.txt
842 7684 45592 voyn/hea/fake.txt
346 3417 20394 voyn/heb/fake.txt
646 6804 41173 voyn/bio/fake.txt
1125 11555 72839 voyn/str/fake.txtlines words bytes file
------- ------- --------- ------------
99049 99049 531241 engl/chr/true.wdp
7684 7684 45404 voyn/hea/true.wdp
3417 3417 20400 voyn/heb/true.wdp
6804 6804 41140 voyn/bio/true.wdp
11555 11555 72745 voyn/str/true.wdplines words bytes file
------- ------- --------- ------------
99049 99049 532343 engl/chr/fake.wdp
7684 7684 45592 voyn/hea/fake.wdp
3417 3417 20394 voyn/heb/fake.wdp
6804 6804 41172 voyn/bio/fake.wdp
11555 11555 72829 voyn/str/fake.wdpReneZ > 07-11-2025, 12:46 AM
(06-11-2025, 10:36 AM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.A millennium ago, Jacques Guy wrote such a generator, which he called "monkey". Alas I can find neither the program not its output. Maybe some other old-timer kept a copy?