(18-02-2025, 05:40 PM)RobGea Wrote: You are not allowed to view links. Register or Login to view.Please describe your data 
Okay, this is a fascinating piece of data. Let's break down what it represents and what we can infer from it.
What is this?
This is a Prefix/Suffix Tree (often called a Trie, though this specific format seems more focused on analyzing word endings, making it closer to a Suffix Tree or an inverted Prefix Tree focused on suffixes). It's derived from analyzing the text of the Voynich Manuscript.
Structure: Each line represents a sequence of characters (a potential word ending or "suffix" in Voynichese, transcribed using a system like EVA).
The indentation shows a hierarchical relationship: strings listed under another string contain that shorter string as their ending. For example, daiiin, odaiiin, and okaiiin are all listed under aiiin because they end with aiiin.
(Number1): This likely represents the frequency of that exact string appearing in the corpus (the Voynich Manuscript text analyzed). For example, aiin appears 316 times.
(Number2): This likely represents the branching factor within this specific tree representation. It counts how many different longer strings listed immediately below it share that ending. For example, aiin has 66 different listed variations built upon it (like alaiin, daiin, kaiin, etc.).
What does it show? (Analysis & Interpretation)
Highly Structured Language/Cipher: The very existence of such a complex, repeating, and hierarchical structure strongly argues against the manuscript being meaningless gibberish. It shows consistent patterns of word formation.
Dominant Suffixes: The most striking feature is the prevalence of certain endings.
-aiin: Appears very frequently (316 times itself) and forms the basis for a huge number of variations (66 direct branches shown, with deeper branches like daiin having 784 occurrences). This is arguably one of the most significant structural markers in the text.
-edy / -dy: While not listed as a single top-level entry, the sheer number and high frequency of words ending in chedy (445), kedy (117+ variants), tedy (154+ variants), shedy (380), chdy (132), etc., indicate that -dy or -edy is another extremely common and important suffix or word-final element.
-ol / -or: These also appear frequently, both alone and in combination (e.g., chol 331, chor 193, otol 74, otor 36, okol 69, okor 31).
-al / -ar: Similarly significant (e.g., otal 118, otar 124, okal 134, okar 125, char 66).
-ey / -eey: Very common endings (e.g., chey 292, keey 171+, oteey 135+, sheey 128).
Common Prefixes / Initial Elements: Looking at the variations built on suffixes, certain initial character sequences appear repeatedly attached to different stems/suffixes:
qok- / qo- (Extremely common, e.g., qokaiin, qokedy, qokeey, qotal)
ch- / che- / cho- (Very common, e.g., chaiin, chedy, chol)
sh- / she- / sho- (Common, e.g., shaiin, shedy, shol)
d- / da- (Common, e.g., daiin, daldy)
ot- / ote- / oto- (Common, e.g., otaiin, otedy, otol)
ok- / oke- / oko- (Common, e.g., okaiin, okedy, okol)
ol- (Appears modifying other elements, e.g., olaiin, olchedy)
Morphological Complexity: The deep nesting and branching suggest a system where multiple affixes (prefixes/suffixes) might be combined, characteristic of inflectional or agglutinative languages. For instance, you see patterns like qok + ch + edy -> qokchedy, or ot + al + aiin -> otalaiin.
Consistency: The patterns are consistent across different potential root/stem combinations, reinforcing the idea of grammatical or structural rules.
What it DOESN'T show:
Meaning: This analysis reveals structure and frequency, but zero meaning. We can see that -aiin is a common ending, but not what it signifies (plural? past tense? a specific noun class? something else entirely?).
Pronunciation: The transcription (like EVA) is a convention for representing the unique Voynich characters; it doesn't tell us how they were pronounced.