![]() |
The predictability of glyphs in labels agrees with that of vords - Printable Version +- The Voynich Ninja (https://www.voynich.ninja) +-- Forum: Voynich Tasks (https://www.voynich.ninja/forum-28.html) +--- Forum: Voynich tasks (https://www.voynich.ninja/forum-32.html) +---- Forum: Positions we can agree upon (https://www.voynich.ninja/forum-53.html) +----- Forum: Approved blocks (https://www.voynich.ninja/forum-54.html) +----- Thread: The predictability of glyphs in labels agrees with that of vords (/thread-833.html) Pages:
1
2
|
The predictability of glyphs in labels agrees with that of vords - david - 04-10-2016 Statement
Some vords appear as "labels", single or double vords apparently identifying images within the manuscript. These labels have the same grammar as those vords in the main body of the corpus. The text of the manuscript is divided up into clearly defined word-like glyph groups (dubbed vords on this forum). These glyph groups have a non-trivial internal structure which is manifest in the severe restrictions imposed upon the positioning of glyphs within the word groups. In other words, Voynichese has a very strict phototactic structure – morphemes appear in predefined places within vords, and only there. A morpheme is the smallest grammatical unit in a language. Morphemes in the corpus are easily identifiable. Voynichese glyph combinations are very positional aware within vords – glyph groups are non-trivial in their internal positioning. We can identify, and have identified, a long list of suffixes and prefixes within Voynichese. We know that certain glyphs only appear as suffixes; we know that certain glyphs only appear as prefixes; and we know that other glyphs are free form. We have also identified (via the CLS theorem) that glyphs appear in a certain pattern. We assume these are bound morphemes because they obey certain rules of positioning. (We can make no assumptions about words that do not include such bound morphemes as we are unable to identify a meaning for such unbound morphemes, but such vords are relatively few in nature). And analysis of the labels (see links below) show that the corpus of labels has a notable level of concordance with the morpheme placement of vords in the main corpus. Further reading Quote:Summary: Marco found that almost 70% of all labels matched words in the main corpus. The rest were unique.
Quote:My research shows visually that the labels, as defined,Stolfi notes [You are not allowed to view links. Register or Login to view.] when attempting to create a "grammar" for Voynichese that (italics mine): Quote:It should be noted that that normal words [in his attempt to create a grammar] account for over 88% of all label tokens, and over 96.5% of all the tokens (word instances) in the text. The exceptions (less than 4 every 100 text words) can be ascribed to several causes, including physical "noise" and transcription errors. (Different people transcribing the same page often disagree on their reading, with roughly that same frequency.). Indeed, most "abnormal" words are still quite similar to normal words, as discussed in a You are not allowed to view links. Register or Login to view..
RE: Label grammar is similar to that of the main text - Anton - 04-10-2016 Would not "orthography" or "morphology" be a better term than "grammar"? RE: Label grammar is similar to that of the main text - sidanno - 04-10-2016 POSCIT PETITAE [font=helvetica, arial, sans-serif]pag 151 POSCIT PETITAE [/font] poscit = I beg, I demand, I request, I desire. = implor, solicit, solicit, eu doresc [font=helvetica, arial, sans-serif]= Latin-Verb-third-person singular present active indicative of poscō poscō or pōscō (present infinitive poscere or pōscere, perfect active poposcī or popōscī); third conjugation, no passive petitae = sought = căutat = Latin-Participle-nominative feminine plural of petītus petītus m (feminine petīta, neuter petītum); first/second declension[/font] [font=helvetica, arial, sans-serif]You are not allowed to view links. Register or Login to view.[/font] RE: Label grammar is similar to that of the main text - -JKP- - 04-10-2016 Another poll I can't vote on. I find the word "grammar" a little too specific. As far as I can tell so far, the labels are different in some ways and the same in others. It's like a Venn diagram with about 60 to 70% of overlap if you assess several factors together (glyph combinations, distribution, length, level of repetition, and relationship to other sections in the manuscript). RE: Label grammar is similar to that of the main text - ThomasCoon - 05-10-2016 [Edit - delete] RE: Label grammar is similar to that of the main text - Sam G - 05-10-2016 (04-10-2016, 09:19 PM)david Wrote: You are not allowed to view links. Register or Login to view.A morpheme is the smallest grammatical unit in a language. I don't think it's so obvious what's a morpheme and what isn't. For instance, in English, "faster" can be broken into "fast" and "er", "singer" can be broken into "sing" and "er", but "lumber" can't be broken into "lumb" and "er" - it's just one morpheme, "lumber". That words in the VMS can be divided into subunits that recur in many different words does not necessarily mean that these subunits constitute affixes or morphemes in a grammatical sense (although I suspect that they do in many cases). Also, if there's no problem with terms like morpheme, grammar, prefix, suffix, phonotactic structure, etc. - then is it really necessary to speak of "vords" instead of simply words? RE: Label grammar is similar to that of the main text - Koen G - 05-10-2016 This is a hard one, David. It seems to me like both studies indicate that labels are not very different from the main text, but still a bit different. Basically we get 30% unique vocabulary and more than three times the amount of grammar that doesn't match. RE: Label grammar is similar to that of the main text - ThomasCoon - 05-10-2016 Sam G Wrote:I don't think it's so obvious what's a morpheme and what isn't. For instance, in English, "faster" can be broken into "fast" and "er", "singer" can be broken into "sing" and "er", but "lumber" can't be broken into "lumb" and "er" - it's just one morpheme, "lumber". That words in the VMS can be divided into subunits that recur in many different words does not necessarily mean that these subunits constitute affixes or morphemes in a grammatical sense (although I suspect that they do in many cases). I agree with Sam G - this was exactly my thought also. As David pointed out, "morpheme" means "unit that bears meaning" - but since we don't know that these bigrams / ngrams bear grammatical meaning (there are alternatives as Sam pointed out), we could easily be wrong saying that they are morphemes. RE: Label grammar is similar to that of the main text - Anton - 05-10-2016 I agree with Sam, that's a good point. A term is required here which does not have linguistic flavour. RE: Label grammar is similar to that of the main text - david - 06-10-2016 Quote:For instance, in English, "faster" can be broken into "fast" and "er", "singer" can be broken into "sing" and "er", but "lumber" can't be broken into "lumb" and "er" - it's just one morpheme, "lumber". This is true if you have a knowledge of English. However, if you don't have an understanding of the meaning of the words but are simply looking for patterns to clarify the rules of the language (imagine an alien trying to dechiper English, or JKP trying to understand Voynichese ![]() But to take another example, a bigram such as "ez" isn't a morpheme for our alien because it's usually an n-gram pattern such as in sneeze, breeze, jeeze, tweeze, freeze, etc (unless he postulates that the 4gram "eeze" is a morpheme suffix!). Quote:I agree with Sam, that's a good point. I couldn't think of one, any suggestions? ngram and its derivatives could be an alternative, but fail to convey the meaning of letters with a potential sense meaning. Quote: Basically we get 30% unique vocabulary and more than three times the amount of grammar that doesn't match.I'm going to copy and paste the comments of Prof Stolfi here: Quote:Abnormal words |