The Voynich Ninja

Full Version: Regaining the lost order
You're currently viewing a stripped down version of our content. View the full version with proper formatting.
Pages: 1 2 3
The VMS text is enigmatic on many levels - on the character level, on the word level, on the line level, and even on the alphabet level as well. Dealing with the word and line levels, one notices several features that are simply too well-manifested to be neglected:

- You are not allowed to view links. Register or Login to view.
- You are not allowed to view links. Register or Login to view.
- You are not allowed to view links. Register or Login to view.

Shifting to the folio and paragraph levels, there have been some You are not allowed to view links. Register or Login to view. in the attempts to detect the structure of narration in the VMS text.

The latter point suggests that the plain text, beside being encrypted on the character or/and word level, also underwent some kind of shuffling on the word level - and this has been my idea since I finished that blog post of mine. This point is made stronger if the former three bullets are cumulatively taken into account. In fact, shuffling would explain them all right away.

Vord shuffling would also explain why the text is so difficult for us to decode: to approach vord meanings, one would strongly prefer correct vord order, and, on the other hand, it is difficult to approach correct vord order without any idea of the individual vords' meaning. Much like the chicken-and-egg problem, you see.

Now, what could have been this shuffling? Let's say we are pretty sure that it could not have been as simple (or as complex - for the medieval mind) as pure random shuffling of all vords in a folio. That's because that would have been a one-way process, and the output text would have been made virtually impossible for the reader to decode. Unless - and here is the first option that I can suggest right away - shuffling is run against very short vord chunks of constant length. Suppose you break the whole text into three- or four-word chunks, and then shuffle words randomly within those chunks. Then the plain text order would be fairly simple to reconstruct for anyone who has the idea of the grammar of the language of the plain text. Note that, in principle, the chunks do not have to be line-based. In other words, suppose e.g. that we have a three-line source text, the first three-word chunk can be comprised of the first words of the three lines, the second chunk then would be comprised of the second words of the three lines, and so forth. The shuffling is thus vertical instead of lateral. In principle, this is compatible with examples shown in the multipass thread where it appears as if first vords of lines in a paragraph were written in a single pass, and general sequence of writing was not left-to-right, top-to-bottom, but rather top-to-bottom, left-to-right - at least in part.

The second option that I can imagine is technically also shuffling from the perspective of the observed result, but from the perspective of the algorithm I would rather call it "loading". Suppose that one has the available portion of folio space (it may be as simple as a rectangle box or it may be more complex as the folio space limited by images and other portions of the text), and then he proceeds with loading that space with vords, placing vords into certain (possibly even voluntary) positions of that space. What would be the mechanisms that would allow the reader to reconsruct the original vord order? I can think of some, as follows.

a) A static pre-determined map of where exactly you should place each word of plain text. Possibly not the most probable option, since consistent regularity would probably have been discovered by researchers quite quickly.

b) A deterministic algorithm of dynamic word placing. Think of the knight's move for example. Possibly not very probable as well, since that would not explain the right-alignment feature mentioned above, unless there are some specific alignment adjustments in the algorithm.

c) The dedicated pointer. This is like option a), but the presence of dedicated pointer allows you to implement different order for each new case. The dedicated pointer is some sequence or pattern in the text which specifies where each word is located.

d) Positional markers. This is actually an extended variant of option b), only this time the algorithm is supported by positional markers in the text. The abundance of gallows which visually look like markers indeed, and also the baseline jumps of the leading o (and sometimes y) characters, vaguely hint at this possibility.

e) Each vord has a pointer in itself as to where the next vord is located. The leading or the traling character may serve as such pointer, or the gallows character may, as well. Note that the character serving as a pointer does not necessarily mean that it does not contribute to the semantics of the vord. The pointer function may just be an additional function of certain characters - or of characters occupying certain position in a vord.

f) The gallows coverage may also contribute to the vord order in some way.

So... if there is the lost order, then why not go and recover it? Tongue

What I'm pretty sure of, is that approaching the VMS text as the simple left-to-right, top-to-bottom word flow is a dead-end.
Thank you, Anton: interesting idea!
Of course, since reduplication and quasi-reduplication are You are not allowed to view links. Register or Login to view. than in randomly-scrambled versions of the text, we must assume that word structure plays a role in the hypothetical re-arrangement (possibly as one of several factors).
For instance, "short vord chunks of constant length" could be sorted alphabetically or with some similar system (though I am not sure that the result would really be decipherable).

Options (a) and (b) do not explain why similar words are pushed next to each other more than by random order, so I think they can be ruled out.
Pointer/marker systems could support re-arranging words so that the encoder can deliberately induce reduplication and quasi-reduplication: I think these scenarios can be considered.
Yes, sorting is in fact the simplest form of shuffling to perform, so that's an option to consider. Any adjacent sortings would be directly visible, so that's easily (?) tested.

EDIT: The easiest start with this would be to look at first vords of lines. The fact that paragraphs start with gallows may be just the effect of vertical sorting. Looking at f2r, we observe k-d-q in the first paragraph, then again k-d-q in the second paragraph, then in f2v, second paragraph, it's k-d-d. Does this sustain further?.. Have no time to check right now.
(28-03-2021, 12:05 PM)Anton Wrote: You are not allowed to view links. Register or Login to view.Suppose you break the whole text into three- or four-word chunks, and then shuffle words randomly within those chunks.
But even if you did this, you wouldn't alter the statistics of the individual words, and nobody has been able to break down the words.
So the words must be encrypted (abbreviated, encrypted, elongated, insert the transformation method of your choice here) before shuffling; and if they are encrypted, why bother adding the additional trouble of shuffling words?
(28-03-2021, 12:56 PM)davidjackson Wrote: You are not allowed to view links. Register or Login to view.But even if you did this, you wouldn't alter the statistics of the individual words, and nobody has been able to break down the words. So the words must be encrypted (abbreviated, encrypted, elongated, insert the transformation method of your choice here) before shuffling

That's true, this must be an independent layer of encryption.

(28-03-2021, 12:56 PM)davidjackson Wrote: You are not allowed to view links. Register or Login to view.and if they are encrypted, why bother adding the additional trouble of shuffling words?

I almost think, to divert us from our life activities and to make us spend pretty plenty of time to make something of all this mess!
Anton, what you are proposing is one of the forms of scinderatio fonorum (see my blog You are not allowed to view links. Register or Login to view. about Vigilius Maro). 
One of the "scinderations" he proposes is exactly this type of shuffling.
And indeed, it's hard, if not impossible, to recover the original meaning of the text if someone used this extensively.
Unless you already know the text to begin with.
Well I just screened through the first 10 folios, looking (this time) exclusively at paragraphs which start with k, and the first-character patterns of line-initial vords look remarkable.

This vertical shuffling/sorting idea should be explored further, I think...
(28-03-2021, 03:06 PM)VViews Wrote: You are not allowed to view links. Register or Login to view.Anton, what you are proposing is one of the forms of scinderatio fonorum (see my blog You are not allowed to view links. Register or Login to view. about Vigilius Maro). 
One of the "scinderations" he proposes is exactly this type of shuffling.
And indeed, it's hard, if not impossible, to recover the original meaning of the text if someone used this extensively.
Unless you already know the text to begin with.



It's also definitely discussed by Leon Battista Alberti in his You are not allowed to view links. Register or Login to view. work (note this translation chapter access used to be free and is unfortunately isn't any longer).  Granted this is well after the VM carbon dating (1467), and as discussed by Nick, You are not allowed to view links. Register or Login to view. probably was only circulated within the Vatican staff responsible for encoding -- it can be seen as at least some discussion of what had been in use for ciphers, although it is uncertain how far back or how widespread the techniques disclosed were used.

In short, scrambling the order of the letters within words and words within lines are explicitly discussed.

 
[attachment=5416]


Here is the part that is key for this discussion:

[attachment=5415]

Poetic to be sure, but not particularly good for practical application.
Okay, so here are some quantitative results for Q1 and Q2 (haven't time for other quires yet) for paragraphs starting with k.

In Q1, 10 folios out of 16 (hereinafter I count recto/verso as two folios) have paragraphs starting with k. In total, 12 paragraphs out of 35 in this quire start with k.

In Q2, only 1 folio out of 14 (12r/v are lost) has paragraphs starting with k, or, more precisely, it's only one paragraph - out of 25 total for this quire.

We are exploring sequences of line-initial characters.

For all those paragraphs, k always precedes any other character. There is only one occurrence where a line-initial k is not paragraph-initial, that is f2v, first paragraph, and here k is the second in sequence - that is, it is placed directly after the first k.

The general sequence order is as follows (I distinguish between s and sh, c and ch):

k
y
d
o
q
s
sh

(there is also one occurrence of ch - since it's only one, we can't assume any ordering for it)

If any of these is missing in the paragraph, the relative order of remaining characters is nonetheless preserved.  In some places there are duplicates in a row, e.g. d-d.

There are of course exceptions to this rule, but not many:

f2r, first paragraph, line 5 begins with o after the fourth line began with sh. After that, the order is restored.
f2r, second paragraph, line 4 begins with y after the third line began with q. After that, the order is restored.
f3v, line 5 begins with y after the fourth line began with q. After that, the order is restored.
f5v, line 6 begins with o after the fifth line began with sh. After that, no further lines are present.
f6v, line 4 begins with y after the third line began with q. After that, the order is restored.
f13v, line 4 begins with o after the third line began with q. After that, the order is restored.

Note that the types of exceptions are fewer in number than the exceptions themselves: it is either o after sh, y after q, or o after q.

What's remarkable, is that all these exceptions occur no earlier than in the fourth line of a paragraph. What's even more remarkable, there never is more than one exception in a paragraph - that is, kinda auto-repair (of the broken order) is in place.

Two quires are perhaps too little for generalizations. I'm looking forward to check further quires.
Continuing the investigations, Quires 3 and 4 bring 12 more paragraphs starting with k. The auto-restoration principle does not hold in all cases; there are places where one could write that off to line interruptions (by images), but one clean case where the supposed order is broken and not immediately restored is f32v, 1st paragraph.

The good thing is that the first three lines of the paragraph continue to hold order - with the single exception of You are not allowed to view links. Register or Login to view. where ch in line 3 is out of order. Were it sh, it would have been in order. Maybe just a scribal error? Who knows... one needs to screen more quires!

The ch has found its place in between d and o, so the proposed order is now as follows:

k
y
d
ch
o
q
s
sh
Pages: 1 2 3