I know that the spaces in the VMS have been studied for quite some time and that some people believe they are not actually spaces. I was interested in determining which trigrams or tokens span the boundaries, and which of these occur frequently enough to warrant inclusion in the analysis.
I’m starting to find "y qo" = "yqo" a very, very amusing theory. The "y" in VMS—the "9"—is considered an abbreviation typical of Latin texts; it often appears at the end of a word, or at the beginning, but rarely in the middle.
I think there’s a rule that says: always separate the y with a space.
Why would the cipherer do that?
The result is that the y appears extremely frequently at the end or beginning of a word. Anyone, especially in the Middle Ages, who sees this will immediately think: That’s Latin.
What’s even cooler is that he perfectly conceals important bigrams and trigrams this way.
The same effect applies to “qo,” which, according to the rule, must always appear at the beginning of a word... Everyone immediately thinks of “quod,” “quorum,” and the other abbreviations that exist here, so again: This must be Latin!
And that’s exactly how it went for most people nowadays who are just starting to look into this.
If the cipherer thought: I’ll use a trigram as a letter/unit—for example, yqo—then I’ll always separate it into y and qo with a space.
This kills two birds with one stone:
1. It conceals the actual language (here clearly Bavarian
, okay... or another
)
2. he obscures the crucial bi/trigram or word.
Let’s take a look at the possible combinations:
yqo
yqok
dyqo
dyqok
edyqo
edyqok
eyqo
yqoke
That’s interesting....
And in my opinion, it solves the problem i had with the Cores ....
The same could apply, for example, to aiin:
aiino
aiinch
aiinsh
aiinqo
aiind
aiiny
AIN
aino
ainch
ainsh
ainqo
We know that aiin frequently appears at the end of words. This could indicate that this is also a possible misinterpretation.
So instead of being an ending, the aiin family could be a left-hand part of larger units that are split by a space.
What else there is:
yo / ych / yd / ysh
dyqo / dyo / dych
edyqo / edyo / edych
ro / rch / ra / rsh
lo / lch / lsh / lqo
But if that’s true, that the spaces ar no spaces a lot of statistical findings would collapse like houses of cards or become highly distorted.
Here is the table: How can these abrupt transitions be explained? A normal language likely doesn’t have this, because then the plaintext language would have to have a very long list of “phonetically impossible” bigrams—from this perspective, the simpler explanation is: a consistent cipher system that uses spaces as camouflage, rather than random phonotactic restrictions in many bigrams.
(Overlaps included; counts are not additive.)
The numbers are so strikingly clear that you really have to stop and think about what’s going on here. I’ve already mentioned the simplest explanation for this. And I have at least one theory about it...