Quote:Sam G Wrote:
Well, you're just completely wrong on this point: patterns like these are actually extremely common in natural languages
Your pattern for japanese is not like the network of similar words for the VMS. I didn't know for a grammatical network connecting all words used in a language and I didn't know of a natural language where similar words occur with similar frequencies.
An English text with similar features would consist of words similar to the words "the", "and" and "to". Additionally, words similar to "the" like "khe", "phe", "fhe", "tha", "tho", "thy", "thee" and "theee" would occur with frequencies similar to each other.
The point is not that some words are similar to each other. The point is that all words occurring at least four times are connected to each other. There is no problem to add some more dimensions to the grid:
aiin (469) daiin (863) okaiin (212) qokaiin (262) kaiin (65) taiin (42) otaiin (154) qotaiin (79)
ain ( 89) dain (211) okain (144) qokain (279) kain (48)[font=Courier New] tain (16) otain ( 96) qotain (64) [/font]
air ( 74) dair (106) okair ( 22) qokair ( 17) kair (14) [font=Courier New][font=Courier New]tair (13) [/font]
[font=Courier New]ot[/font]
[font=Courier New]air ( 21[/font]
[font=Courier New]) [/font]
[font=Courier New]qot[/font]
[font=Courier New]a[/font]
[font=Courier New]ir[/font]
[font=Courier New] ( 6[/font]
[font=Courier New])[/font]
[/font]
ar (350) dar (318) okar (129) qokar (152) kar (52) [font=Courier New][font=Courier New][font=Courier New]tar (43) [/font][/font]
[font=Courier New][font=Courier New]ot[/font][/font]
[font=Courier New][font=Courier New]ar (141[/font][/font]
[font=Courier New][font=Courier New]) [/font][/font]
[font=Courier New][font=Courier New]qot[/font][/font]
[font=Courier New][font=Courier New]a[/font][/font]
[font=Courier New][font=Courier New]r[/font][/font]
[font=Courier New][font=Courier New] (63[/font][/font]
[font=Courier New][font=Courier New])[/font][/font]
[font=Courier New] [/font][/font]
[font=Courier New][font=Courier New][font=Courier New]al (260) dal (253) okal (138) qokal (191) kal (23) [font=Courier New][font=Courier New][font=Courier New]tal (20) [/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New]ot[/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New]al (143[/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New]) [/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New]qot[/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New]al[/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New] (59[/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New])[/font][/font][/font][/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New][font=Courier New][font=Courier New][font=Courier New][font=Courier New][font=Courier New]am ( 88) dam ( 98) okam ( 26) qokam ( 25) kam ( 9) [/font]
[font=Courier New][font=Courier New][font=Courier New][font=Courier New]tam ( -) [/font][/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New][font=Courier New]ot[/font][/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New][font=Courier New]am ( 47[/font][/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New][font=Courier New]) [/font][/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New][font=Courier New]qot[/font][/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New][font=Courier New]am[/font][/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New][font=Courier New] (12[/font][/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New][font=Courier New])[/font][/font][/font][/font][/font][/font][/font][/font][/font][/font][/font]
or (363) dor ( 73) okor ( 34) qokor ( 36) kor (26)[font=Courier New] [font=Courier New][font=Courier New][font=Courier New]tor (23) [/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New]oto[/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New]r ( 46[/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New]) [/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New]qoto[/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New]r[/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New] (29[/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New])[/font][/font][/font]
[font=Courier New][font=Courier New] [/font][/font][/font]
ol (537) dol (117) okol ( 82) qokol (104) kol (37)[font=Courier New][font=Courier New] [/font]
[font=Courier New][font=Courier New][font=Courier New][font=Courier New]tol (48) [/font][/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New][font=Courier New]otol[/font][/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New][font=Courier New] ( 86[/font][/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New][font=Courier New]) [/font][/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New][font=Courier New]qotol[/font][/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New][font=Courier New] (47[/font][/font][/font][/font]
[font=Courier New][font=Courier New][font=Courier New][font=Courier New])[/font][/font][/font][/font][/font]
This grid alone already contains 7694 words. This is 20 % of the number of words in the VMS.
Quote:Sam G Wrote:
Surely the relations between words are interesting and relevant but I don't think they show that the text is meaningless or that it was generated via a simple procedure. Quite the opposite, in fact, as argued above.
The observed similarities alone are not enough to decide what the VMS is and if it is meaningless or not.
Quote:Sam G Wrote:
Yeah, but all these other patterns need to be accounted for as well!
Correct, a theory for the VMS should be able to explain all this patterns.
Quote:Sam G Wrote:
...that's obviously much rarer than in the main body of the text.
Why did you assume an uniform distribution for the words? No matter which words you will check they are not uniformly distributed over the VMS or over a page or within a line!