@ThomasCoon,
Thanks for the response.
(04-09-2016, 03:23 AM)ThomasCoon Wrote: You are not allowed to view links. Register or Login to view.4. As you can see, <l> is preceded by <o> so often that it is not random chance. It seemed to me that <ol> must be a combination, and there are 25 similar combinations that repeatedly appear.
(04-09-2016, 03:23 AM)ThomasCoon Wrote: You are not allowed to view links. Register or Login to view.That is a good question. I think statistics would be the answer. If l is not supposed to function with <o>or <a>, why does it almost always appear with <o> or <a>?
I agree with the particular assessment about the
ol and
al combinations. I came to a similar conclusion in my own theory. However, I think it's a stretch to apply this principle to
every glyph in the text.
If statistics is the answer, then I have an answer. For what it's worth, I once made an algorithm to find the most significant glyph combinations in the entire corpus. I don't remember how I defined "significant" here, but it was something more interesting than just identifying the most common ones. I do know that it considered the principle of "we can also see
where each character appears in relation to each other - and this is usable data". This was not part of an approach that tried to delineate
all text into combinations like yours, so not everything can be covered. Briefly, the resulting combinations are in the list below, sorted by cluster.
- qo
- cTh, cKh, cPh, te, tch, kch, kee, pch
- ee
- al, ar, ol, or, dy, ey, eey
- ai, aii
- am
This list is rather different from yours.
Just for the record, I held this finding to
describe the text properties but not
explain them.
(04-09-2016, 03:23 AM)ThomasCoon Wrote: You are not allowed to view links. Register or Login to view.the majority seem completely unrestrained to certain word positions (except for the ones like <qo> you mentioned - but for that see point 7).
That's not true. The strong relationship between glyphs and word position has been noted and analysed many times.
For example, your unit 18
aiin appears almost always at the end of words. Why? Why do we not frequently see other ordered combinations of unit 18 like
aiinol,
aiinkk or
chaiinaiin? If all word spaces are fake as you suggest, there is still the question for the system: Why do certain units like
aiin[font=Arial], but not others, [/font](almost) always precede the fake word space?
(04-09-2016, 03:23 AM)ThomasCoon Wrote: You are not allowed to view links. Register or Login to view.the <m> character apears in the exact same places that we see <y>: most importantly after <d>. Anyone who has spent time with the text knows that <dy> is everywhere, but we also have <dm>. I believe they might be the same character.
That's not true.
m appears mostly after
a.
There are lots of characters/combinations that appear in the same places that other characters/combinations appear. That applies to the Voynich Manuscript and natural languages. In English, consider vowels, or the word endings "-ing" and "-ed". Why would that mean that they're the
same thing?
In my list shortly above, the combinations in each cluster replace each other in the same context quite often (that's why they're in the same cluster). However, from that I would not conclude that they're
the same. It's just not convincing enough for such a definite conclusion.
(04-09-2016, 03:23 AM)ThomasCoon Wrote: You are not allowed to view links. Register or Login to view.Regarding #9 and #10, those are just ways to incorporate the gallows, which only appear inconsistently in the text (always in the first line of a paragraph) which means they are likely a variant of another character. No natural language puts certain sounds only at the beginning of a string of speech, so they must be another letter in the text just written differently.
The gallows don't
always appear in the first position in the line. I would link to Job's query engine to demonstrate this, but I can't find it (you can tell this is a common problem for me...)
There has already been a paper* analysing the hypothesis that the gallows characters might be capital letter equivalents of other characters. If the gallows characters are indeed capitals of other characters, which ones are they? Which ones can they replace without affecting the text statistics? The result was that the gallows characters just replace
each other. So either they are capitals/alternate forms for each other, or that type of relationship is simply not applicable here. The former fits your idea that these are really the same "unit", but note that just about every glyph exhibits the same behaviour (some examples are the combinations in my list above), and it would be reductive to say that those must all be the same unit too.
*Morningstar, J. B. (2001).
Gallows Variants as Null Characters in the Voynich Manuscript. Chapel Hill, NC, USA: University of North Carolina.
(04-09-2016, 03:23 AM)ThomasCoon Wrote: You are not allowed to view links. Register or Login to view.In the text, <m> looks like <y> with an extra loop at the top - I actually never realized this before trying to figure out what <m>'s function was.
(04-09-2016, 07:45 PM)ThomasCoon Wrote: You are not allowed to view links. Register or Login to view.But m appears in many of the same places as y and also looks similar to it in the text - and this is something I only realized after trying to figure out what combinations y could join in.
I hate to self-advertise, but my (admittedly controversial) You are not allowed to view links.
Register or
Login to view. offers an alternative explanation for this visual construction. Section 1.2 is the bit relevant to your comment here.
(04-09-2016, 07:53 AM)ReneZ Wrote: You are not allowed to view links. Register or Login to view.That last point I think is important in order to advance, namely to clearly separate the analytical part (how to separate the text into basic units) and the speculative part: what it means (cipher or language?, which language?).
I agree with Rene's statement. In your response, you mix these two things up a lot. For example in the below quote:
(04-09-2016, 03:23 AM)ThomasCoon Wrote: You are not allowed to view links. Register or Login to view.I admit that the possibility of reversing some combinations will bring up the number of possible units, but regardless whether <ar> or <ra>, for example, both should still only correlate to one plaintext Latin letter. So it's not as if I'm devising a really loose system which will spell anything I want to read into the text - I've seen the pitfalls of that in other "decryptions"! 
I was not talking about decryption there. By "fitting" I was simply referring to the first step of the analysis, which is proving that the manuscript's text can be decomposed into your list of units with a high success rate. If the word formation system is so loose and has such a large list of units (>60, not just 26) then it is easily possible for them to fit by chance.
I found You are not allowed to view links.
Register or
Login to view. that I was referring to. I think the concern about tightness/coverage near the end applies especially to this point.
(04-09-2016, 01:48 AM)ThomasCoon Wrote: You are not allowed to view links. Register or Login to view.Well, I actually think that the spaces in text are completely fake. I think that he wrote large words (ex. "vocabulary") into two words: ("vocab-ulary").
(04-09-2016, 05:17 PM)ThomasCoon Wrote: You are not allowed to view links. Register or Login to view.I believe that many ch (ee) and other 2-letter combinations are hidden: for example, one "c" is at the end of a word while the next "c" is at the beginning of the following word.
(04-09-2016, 07:44 PM)-JKP- Wrote: You are not allowed to view links. Register or Login to view.Thomas, just so you know, when I was commenting on unique word-tokens, I wasn't talking specifically about your line of thought but in general.
Taking the approach of deconstructing the text and looking across spaces to find the patterns naturally leads to going in the other direction as well... of putting in spaces where they might be intentionally missing.
(04-09-2016, 08:44 AM)Koen Gh. Wrote: You are not allowed to view links. Register or Login to view.There is more to the structure of Voynichese words, and I'm not sure if it can be explained just by word breaks.
I also agree with this sentiment. If you can ignore arbitrary word breaks whenever it is convenient to fit your system, this further increases the looseness of the system, and hence makes it more likely that they fit by chance.
(04-09-2016, 04:54 PM)Anton Wrote: You are not allowed to view links. Register or Login to view.Brian, nice to see you on the forum again.
No problem, Anton.
