davidd > Yesterday, 03:55 AM
vordgroup 56: cheody 56 446
members: ['cheody', 'opar', 'shek', 'sheody', 'shody', 'she', 'opchedy', 'psheody', 'opchdy', 'cheky', 'tchy', 'chekaiin', 'ytedy', 'olkchedy', 'ytchey', 'ytody', 'cholky', 'chcphy', 'lkeey', 'ycheedy', 'shor', 'olky', 'sshey', 'shckhey', 'keol', 'teeody', 'shaiin', 'lkeeedy', 'ycheeo', 'cheoty', 'shekeey', 'chotal']
num members: 32
vord count: 446
groupname: cheody
lesser likely following : chedy 5.16% instead of 16.02%
more likely following : daiin 6.95% instead of 3.37%
lesser likely followed by : chedy 8.07% instead of 16.02%
more likely followed by : qokain 21.08% instead of 15.88%
coming from group <groupname> followed by <groupname> which has a relative size of <x>
---------------------------------------------------------
< other> -> 31.39% <cheody> 37.44% -> < other>
< chedy> -> 5.16% <cheody> 8.07% -> < chedy> rel size: 16.02%
< ol> -> 6.50% <cheody> 6.50% -> < ol> rel size: 12.64%
< aiin> -> 11.66% <cheody> 3.14% -> < aiin> rel size: 10.69%
< daiin> -> 6.95% <cheody> 3.59% -> < daiin> rel size: 3.37%
< qokain> -> 14.13% <cheody> 21.08% -> < qokain> rel size: 15.88%
< dar> -> 2.02% <cheody> 2.69% -> < dar> rel size: 2.14%
< okaiin> -> 0.90% <cheody> 0.90% -> < okaiin> rel size: 0.72%
< okain> -> 0.90% <cheody> 0.45% -> < okain> rel size: 0.78%
< okeey> -> 0.67% <cheody> 0.22% -> < okeey> rel size: 0.52%
< otar> -> 0.67% <cheody> 0.22% -> < otar> rel size: 0.58%
< otaiin> -> 1.57% <cheody> 3.36% -> < otaiin> rel size: 1.28%
< o> -> 6.73% <cheody> 3.36% -> < o> rel size: 4.40%
< oty> -> 2.02% <cheody> 1.35% -> < oty> rel size: 1.59%
< shol> -> 1.35% <cheody> 1.35% -> < shol> rel size: 0.81%
< am> -> 2.02% <cheody> 2.91% -> < am> rel size: 1.92%
< cheody> -> 2.24% <cheody> 2.24% -> < cheody> rel size: 1.90%
< chedaiin> -> 0.00% <cheody> 0.00% -> < chedaiin> rel size: 0.36%
< yteedy> -> 2.47% <cheody> 0.45% -> < yteedy> rel size: 0.53%
=========================================================
davidd > Yesterday, 04:00 AM
Eiríkur > Yesterday, 05:33 AM
MarcoP > Yesterday, 07:30 AM
davidd Wrote:vordgroup 1: chedy 1 3760
more likely following : qokain 22.71% instead of 15.88%
more likely followed by : qokain 35.69% instead of 15.88%
vordgroup 6: qokain 6 3726
more likely following : chedy 36.02% instead of 16.02%
more likely followed by : chedy 22.92% instead of 16.02%
vordgroup 8: dar 8 503
more likely following : qokain 23.86% instead of 15.88%
more likely followed by : ol 19.28% instead of 12.64%
davidd Wrote:vordgroup 120: yteedy 120 125
more likely following : am 4.00% instead of 1.92%
RobGea > 11 hours ago
(Yesterday, 03:55 AM)davidd Wrote: You are not allowed to view links. Register or Login to view.there may be some bugs in the searching algorithm, It is not very stable, the groups that come out are different every time.idk if this is useful to you but some causes could be.
Probably some memory in python that gives a different order every time.
davidd > 9 hours ago
(11 hours ago)RobGea Wrote: You are not allowed to view links. Register or Login to view.The goal is to discover the same structure even if the order the relationships are parsed in is different.(Yesterday, 03:55 AM)davidd Wrote: You are not allowed to view links. Register or Login to view.there may be some bugs in the searching algorithm, It is not very stable, the groups that come out are different every time.idk if this is useful to you but some causes could be.
Probably some memory in python that gives a different order every time.
Dictionaries : Python version <3.7 Dictionaries are unordered.
Sets : Always unordered.
Memory: (dont know what this is called but ive come across it). Fix : Close all instances of Python between code runs.
Scope(?): Lists, ( Dictionaries(?), watch out if dictionary value is a list as well ) , Mutable, Pass-by-reference ( out of my hobbyist league but its caused me some trouble on occasion )
Ruby Novacna > 8 hours ago
(Yesterday, 03:55 AM)davidd Wrote: You are not allowed to view links. Register or Login to view.... These results look statistically significant to my amateur eyes but ...
davidd > 7 hours ago
(Yesterday, 07:30 AM)MarcoP Wrote: You are not allowed to view links. Register or Login to view.Hi,
I find the text file somehow hard to read, so I haven’t analyzed it in detail.
I tried something similar a few years ago. I wrote two posts based on Part Of Speech POS-tagging software. I only considered Quire20 and (in the update post) Quire 13. I only experimented with a small number of word-classes (from 5 to 8, IIRC).
The process gives decent results for the English King James Genesis: but that’s a very repetitive text with particularly low MATTR. Voynichese is much tougher, so it’s not clear that we have enough coherent text to process.
You are not allowed to view links. Register or Login to view. (2019)
Here I found that groups of words for Voynichese include types that are more similar to each other than for English: if two words are similar, they behave similarly (while in English this is not necessarily the case: better/butter which/witch bare/bear etc).
I also found that Voynichese grammar tends to have loops, where words belonging to the same category follow each other (a trivial case is the consecutive repetition of identical tokens). I am not sure about how such loops are handled in your experiment. Have you noticed that there are word-classes that tend to appear with consecutive tokens? This is of course something that doesn't happen frequently in English.
You are not allowed to view links. Register or Login to view. (2021)
If one considers the newline separator as a symbol, the inferred grammar is based on LAAFU: there’s a class for line-initial words and a class for line-final words (e.g. ending -m).
I then applied some simple transformations (partly based on You are not allowed to view links. Register or Login to view.) to remove line effects (e.g. word initial s-, which typically is line-initial, was removed, assuming that saiin is a line-initial variant of aiin).
At this point, I removed the newline symbol from the input data and analyzed whole paragraphs instead.
The positive result I found is that Q20 and Q13 produce grammars that are somehow comparable. I created graphs where word-classes that often follow each-other are connected by arrows (npnp is the paragraph-separator).
It seems that there is some overlap with your results e.g.
- the bidirectional arrow between shedy/chedy and qokain/qokaiin
- the Q13 sequence qokain->dar->ol
davidd > 6 hours ago
(8 hours ago)Ruby Novacna Wrote: You are not allowed to view links. Register or Login to view.(Yesterday, 03:55 AM)davidd Wrote: You are not allowed to view links. Register or Login to view.... These results look statistically significant to my amateur eyes but ...
Hello Davidd!
I didn't manage to understand your idea, could you please present a step-by-step example with a single word of the B language, without using Python?
Ruby Novacna > 4 hours ago