Why and how the text could be Bavarian

Why and how the text could be Bavarian - Printable Version

+- The Voynich Ninja (https://www.voynich.ninja)
+-- Forum: Voynich Research (https://www.voynich.ninja/forum-27.html)
+--- Forum: Theories & Solutions (https://www.voynich.ninja/forum-58.html)
+--- Thread: Why and how the text could be Bavarian (/thread-5312.html)

Pages: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23

RE: Why and how the text could be Bavarian - JoJo_Jost - 19-02-2026

After further examining the cipher and developing one myself, I have a different theory. I believe that the developer of the cipher was unable to develop a truly effective cipher. He was likely a simple monk or physician who wanted to keep his herbal recipes and possibly some ideas secret. He invented a secret code without realising that its rules were often contradictory.

How did I come to this conclusion? Because it happened to me too. Big Grin

That could explain a lot.

1. Low entropy: An overly rigid formula, probably very repetitive, possibly even a shorthand with too little variation, too little ‘clean text’. I have now examined several Bavarian texts, and the differences are dramatic, as are their effects on certain statistical characteristics. I will present my findings here shortly.

2. Courier A/B: He did not consistently adhere to his own rules, or rather, he realised over time that they did not work so well and kept adjusting them without changing the old, already encrypted texts. (This happens to me all the time – the longer I work on it, the more I realise that this doesn't work, that doesn't work, and I'm constantly changing everything possible.)

If, in addition, several people worked on the encryption, it could be that
a. The dialects they spoke differed slightly, and so they naturally encrypted the text differently.
b. Precisely because the cipher was inconsistent, misunderstandings and differences arose between the writers.
c. This could even explain why the same word sequence was written slightly differently each time, simply because no one remembered how they had done it last time.

3. Word repetitions: He simply encrypted the same word again without realising it. (This has happened to me too.)

4. Inconsistency with the “e” (chol/cheol): Sometimes he forgot a step or added an extra one, resulting in three “e's – my current cipher allows for several possibilities for the use and non-use of the 'e”, so this can easily happen.

5. Others copied from the template that he or several others had previously encrypted and made further mistakes because they did not understand what they were writing.

Strictly speaking, a bad cipher writer explains the anomalies better than a good one. And that is another possible reason why no one has yet succeeded in cracking the code – not because it is too ingenius, but because he was simply too bad at it.

The German language is really complex, and although this chipher can cover some of the German ‘anomalies’, it's still enough to drive you mad.

To pique your curiosity: I can prove to you, at least as things stand at present, that German/Bavarian fits well with the VMS if it is based on a real language. More on this shortly.

RE: Why and how the text could be Bavarian - JoJo_Jost - 22-02-2026

Note 1. The following analyses were computed using Claude. Once you look at the word lists, you'll understand why there was really no other way.
I checked the Python code myself, as far as I understand it as a non-programmer and had other AI systems rigorously cross-check the values. I've taken random samples too, but I still can't offer any guarantees — AI is AI, after all…

Note 2. ED "DG97EEB" kindly shared his truly comprehensive and detailed study on Voynich grammar with me, which has partly confirmed and significantly advanced my own research. I'm deeply grateful to him for that.

Content:

I was curious whether the ideas behind my cipher approach, which I've presented here before, might actually fit the typical features of the VMS.
So I ran some quantitative comparisons between VMS text and real Middle High German (MHG) medical manuscripts.

The texts I compared:
Eva = a cleaned-up version
Ortloff von Baierland (first 100 pages without the table of contents, transcribed via Transkribus — there are some errors, but it should have been good enough for this sort of comparison work)
The Breslau Pharmacopoeia
Admonter Bartholomäus
A collection of cooking recipes

All MHDeutsch with Bavarian influence, 15th century and earlier. (The texts were tokenised with proper Unicode normalisation for medieval scribal characters.)

The numbers:
What I tested: I tested the hypothesis that the VMS contains medieval Bavarian German, encrypted using a system where function words are systematically absorbed into content words as prefixes.

If this assumption were correct, you'd expect it to show up in the distribution. Those 2–3 character function words no longer appear as separate tokens — they're embedded in the content words, which makes the average VMS word longer. The ~30 percentage point gap in the ≤3 category fits rather nicely with the combined frequency of articles + prepositions + conjunctions in the MHG texts.

Filename: Analyse2.png Size: 84.29 KB 22-02-2026, 11:35 AM

Note: For the prefix “o,” only words longer than 2 letters were counted. I am convinced that o / ol / or / etc. represent separate, distinct words.

Articles (o Präfix): There's a clear convergence with the VMS here. The values naturally depend on what's being described and on the author's personal style as well. That's why we see noticeable differences between the German texts themselves. And of course, we can only guess what exactly was lumped together under "articles."

Prepositions (qo Präfix): We see more pronounced outliers (marked in red), but also a clear match with the cooking recipes. Which proves once again that it very much depends on what's being described.

The daiin/"und" rate doesn't fit. Daiin is one of the most frequent words in the VMS, and "und" is an extremely common word in German. This remains the weakest point of the model and requires further explanation, possibly depending on text type or encoding function.

Verb prefixes (y Präfix): The y-prefix in the VMS sits at 4.1%, while the MHG verb prefix rate (ge-, ver-, be-, er-, zer-, ent-) comes in at 3.6–4.4% across all four texts. That's a remarkably tight fit

Average word length is also interesting. It's higher in the VMS than in the original texts. In this context, that supports the theory that the shorter words were absorbed into the main words via the qo prefix.

What I find particularly interesting about the word length distribution is this: it's measurable independently of any decryption hypothesis. Whatever language is hiding in the VMS — it needs to explain why words of 2–3 characters are so dramatically underrepresented. In natural German (and most European languages), they account for roughly half of all words. In the VMS, it's only 19%. That rules out quite a few hypotheses — or at the very least forces them to offer a mechanism that makes short words disappear.

Now, what does all of this actually tell us about the likelihood that the VMS could be Bavarian German? On its own, none of these metrics prove anything — but taken together, they paint a rather consistent picture. Article rates, preposition rates, word length distribution, average word length — they all land in ranges that are compatible with medieval Bavarian German once you account for an absorption cipher. None of these metrics contradict the hypothesis. While each metric alone proves nothing, their convergence across multiple independent texts is notable.The one that fits least well (the daiin/"und" rate) is also the one most sensitive to text type.

What does this actually tell us about the probability that VMS is Bavarian German? Taken individually, these figures prove nothing – but together they paint a fairly consistent picture. Article frequency, preposition frequency, word length distribution, average word length – all of these fall within ranges that are compatible with medieval Bavarian German, taking absorption encryption into account. None of them provide a result that clearly contradicts the hypothesis.

I know that this is not proof, of course. But if the VMS had nothing to do with German, one would expect at least some of these metrics to deviate significantly. However, they do not. They are consistently within the correct range across four different (in some cases very long) comparison texts.

Let's put it this way: for a 600-year-old manuscript that no one can read, that's not a bad starting point, at least. Wink

....................
Artikel / Articles (69): ain, aine, ainem, ainen, ainer, aines, ayn, ayne, aynem, aynen, ayner, aynes, d, daer, das, dat, daz, de, deer, dem, deme, den, dene, der, dere, des, dez, di, die, diese, diesem, diesen, dieser, dieses, dirre, dise, disem, disen, diser, dises, disiu, ditz, diu, dr, dy, dye, dz, eem, een, eer, ees, ein, eine, einem, einen, einer, eines, eyn, eyne, eynem, eynen, eyner, eynes, eyns, te, tem, ten, ter, tes

Konjunktionen / Conjunctions: "und" (daiin): un, und, unde, unt, vn, vnd, vnde, vndo, vnnd, vnt

Präpositionen / Prepositions(65) ab, abe, an, ane, auf, auff, aus, auss, auz, bei, bi, bis, biz, cze, czu, dar, durch, fur, fúr, für, gegen, in, inne, mit, mite, mitt, myt, nach, neben, nebent, ob, over, sunder, uber, uf, uff, umb, umbe, under, unter, unz, uon, uor, us, uz, vff, vnder, voer, von, vor, vore, vss, vur, vür, wider, yn, ze, zem, zen, zer, zu, zue, zuo, zwischen, über

Verbvorsilben / Verb prefixes (6) ge-, ver-, be-, er-, zer-, ent- (length > prefix+1)

RE: Why and how the text could be Bavarian - JoJo_Jost - 22-02-2026

If we take this approach as a starting point, then all other medieval language candidates — Latin, Spanish, French, early Italian, Middle English, Middle Czech, Middle Hungarian, Middle Bulgarian, Middle Romanian, Old Catalan, Old Occitan, Old Frisian, and Hebrew — fail on at least one of the following criteria: high article-like frequency (19%), frequent prepositional elements (14%), productive verb prefixes (4%), a common coordinating conjunction (3%), an average word length of about five characters, and a strong underrepresentation of 2–3-letter tokens. Only Dutch and other Germanic languages come close — but Bavarian, with its characteristic word shortening, provides the tightest fit.

But of course we don't know how the cipher, if it is one, really works....

RE: Why and how the text could be Bavarian - nablator - 24-02-2026

I just found a great online source for diplomatic transcriptions (*_dipl.pdf) of German early modern manuscripts (and printed books?)

There are useful metadata, for example:
language-area: nordbairisch
language-region: ostoberdeutsch
language-type: oberdeutsch

A You are not allowed to view links. Register or Login to view. selecting You are not allowed to view links. Register or Login to view. (language-type: oberdeutsch) including Nordbairisch, Mittelbairisch, Südbairisch.

Wunderbar! Smile

RE: Why and how the text could be Bavarian - JoJo_Jost - 24-02-2026

And nablator, what does the bresslauer say to u?

RE: Why and how the text could be Bavarian - nablator - 24-02-2026

(24-02-2026, 01:46 PM)JoJo_Jost Wrote: You are not allowed to view links. Register or Login to view.And nablator, what does the bresslauer say to u?

Archive.org has OCRs, usually not perfect so I might use another source: You are not allowed to view links. Register or Login to view.

Also I found a PDF of Das Arzneibuch Ortolfs von Baierland: You are not allowed to view links. Register or Login to view.
Is it OK?

I need to clean them up. I'll post the mutual information graphs in the other thread.

RE: Why and how the text could be Bavarian - JoJo_Jost - 24-02-2026

@ nablator mhdwb: There's no way to download it completely, is there?

You are not allowed to view links. Register or Login to view. This is not an original transkription of the whole text, it seems to be another text. Huh

Look here:

You are not allowed to view links. Register or Login to view. thats is my source...

RE: Why and how the text could be Bavarian - Rafal - 24-02-2026

Quote:After further examining the cipher and developing one myself, I have a different theory

Do you consider now Voynichese a cipher?
You know, there is a big difference between a language written phonetically with custom letters and a cipher.

A cipher can completely change the nature of the text while a custom script shouldn't.

RE: Why and how the text could be Bavarian - JoJo_Jost - 24-02-2026

(24-02-2026, 03:31 PM)Rafal Wrote: You are not allowed to view links. Register or Login to view.
Quote:After further examining the cipher and developing one myself, I have a different theory
Do you consider now Voynichese a cipher?
You know, there is a big difference between a language written phonetically with custom letters and a cipher.
A cipher can completely change the nature of the text while a custom script shouldn't.

Yes, my thesis is that VMS is an absorption cipher of a Bairish text.

The analysis of Bairish texts is important because the simple ciphers used around 1430 do not alter the text as much as more complex ciphers would do. In this respect, the underlying text should already exhibit characteristics that make an absorption cipher possible.

RE: Why and how the text could be Bavarian - nablator - 24-02-2026

(24-02-2026, 03:14 PM)JoJo_Jost Wrote: You are not allowed to view links. Register or Login to view.@ nablator mhdwb: There's no way to download it completely, is there?

Just click Ea-837-23.pdf under the viewer. It seems complete (215 pages).

Quote:You are not allowed to view links. Register or Login to view. This is not an original transkription of the whole text, it seems to be another text.

It's older, from 14th century manuscripts, with Latin chapter titles.

See p. 80 of the PDF:

Quote:... darna wo man dy nature eynes iclichen menschen sal erkennen
vnde wo man eynen gesunden lernen sal, daz her icht seych
werde, vnde ef he seych werde, wo man eme helfen sal (daz er
gesunt werde).

2. de quattuor elementen.
God hat vyr elementen gemacht: vur, lucht, waszer, erden;
darvt wert alle mynscheyt vnde alle frucht, dy op ertryke yst...

matches your p. 15:

Quote:... dar
nach wie man eyns yegklichen menschen natur sol
erkennen. vnd wie man den gesunten lere sal, das er nit
siech werd. vnnd wie man im helffen sol ob er siech
wurde.

Von den vier elementen
Got hat vier element gemachet. Das ist feur
luft wasser und erd. Dauon wirt alle men-
scheyt vnnd alle frùcht, so auff ertrych sind...