04-10-2016, 09:19 PM
Statement
Some vords appear as "labels", single or double vords apparently identifying images within the manuscript. These labels have the same grammar as those vords in the main body of the corpus.
The text of the manuscript is divided up into clearly defined word-like glyph groups (dubbed vords on this forum). These glyph groups have a non-trivial internal structure which is manifest in the severe restrictions imposed upon the positioning of glyphs within the word groups.
In other words, Voynichese has a very strict phototactic structure – morphemes appear in predefined places within vords, and only there.
A morpheme is the smallest grammatical unit in a language.
Morphemes in the corpus are easily identifiable. Voynichese glyph combinations are very positional aware within vords – glyph groups are non-trivial in their internal positioning. We can identify, and have identified, a long list of suffixes and prefixes within Voynichese. We know that certain glyphs only appear as suffixes; we know that certain glyphs only appear as prefixes; and we know that other glyphs are free form. We have also identified (via the CLS theorem) that glyphs appear in a certain pattern.
We assume these are bound morphemes because they obey certain rules of positioning. (We can make no assumptions about words that do not include such bound morphemes as we are unable to identify a meaning for such unbound morphemes, but such vords are relatively few in nature).
And analysis of the labels (see links below) show that the corpus of labels has a notable level of concordance with the morpheme placement of vords in the main corpus.
Further reading
- The predictability of glyph placement within label vords is in concordance with that of vords in the main corpus
Some vords appear as "labels", single or double vords apparently identifying images within the manuscript. These labels have the same grammar as those vords in the main body of the corpus.
The text of the manuscript is divided up into clearly defined word-like glyph groups (dubbed vords on this forum). These glyph groups have a non-trivial internal structure which is manifest in the severe restrictions imposed upon the positioning of glyphs within the word groups.
In other words, Voynichese has a very strict phototactic structure – morphemes appear in predefined places within vords, and only there.
A morpheme is the smallest grammatical unit in a language.
Morphemes in the corpus are easily identifiable. Voynichese glyph combinations are very positional aware within vords – glyph groups are non-trivial in their internal positioning. We can identify, and have identified, a long list of suffixes and prefixes within Voynichese. We know that certain glyphs only appear as suffixes; we know that certain glyphs only appear as prefixes; and we know that other glyphs are free form. We have also identified (via the CLS theorem) that glyphs appear in a certain pattern.
We assume these are bound morphemes because they obey certain rules of positioning. (We can make no assumptions about words that do not include such bound morphemes as we are unable to identify a meaning for such unbound morphemes, but such vords are relatively few in nature).
And analysis of the labels (see links below) show that the corpus of labels has a notable level of concordance with the morpheme placement of vords in the main corpus.
Further reading
Quote:Summary: Marco found that almost 70% of all labels matched words in the main corpus. The rest were unique.
- VMS language DNA variations. Davidsch
Quote:My research shows visually that the labels, as defined,Stolfi notes [You are not allowed to view links. Register or Login to view.] when attempting to create a "grammar" for Voynichese that (italics mine):
follow the same rules for the letters in the remainder of the text that are not labels, with some exceptions:
You can check by You are not allowed to view links. Register or Login to view. "CAB NST" & "CAB labels only".
- 'a' occurs proportionally more in the "label text"
- the 'q' (only posA) occurs much lesser in the "label text"
- the 'h' occurs much lesser in the "label text"
- the 't' on posB is higher in the "label text"
Quote:It should be noted that that normal words [in his attempt to create a grammar] account for over 88% of all label tokens, and over 96.5% of all the tokens (word instances) in the text. The exceptions (less than 4 every 100 text words) can be ascribed to several causes, including physical "noise" and transcription errors. (Different people transcribing the same page often disagree on their reading, with roughly that same frequency.). Indeed, most "abnormal" words are still quite similar to normal words, as discussed in a You are not allowed to view links. Register or Login to view..
[..]
The words that do not fit into our paradigm [..] These words comprise 1295 tokens (3.7%) in the main text, and 127 tokens (12.4%) in the labels. The vast majority are rare words that occur only once in the whole manuscript.
- TheYou are not allowed to view links. Register or Login to view. by Brian Cham and David Jackson describes how Voynich glyphs can be divided into three categories that interact with one another in a pre-defined manner.
- Statement changed from "The morpheme construction of labels is in concordance with that of the main corpus"
- Added Davidsch to further reading