This seems to be the right place to continue the discussion on the auto-copy hypothesis.
(This post has a general introduction first, and then (further down) some new statistics that argue against the auto-copy hypothesis.)
My thoughts are completely in line with those of Emma, stated near the start of this thread namely that there is a fundamental difference between:
- observing that similar words tend to appear near each other
- saying that the Voynich MS was created by arbitrarily writing words similar to previous words
The result does not provide evidence of the intent of the 'actor'. We cannot judge just from the statistics what was the process that generated these statistics.
It is understood that the evidence presented by Timm (and Schinner) lies with the computer algorithm for generating Voynich look-alike text, but the details of this algorithm have not been shown here, so there remain two important questions:
- how complicated is this algorithm?
- does it include any anachronistic elements?
One way to look at this is to observe the appearance of the most frequent word
daiin (daiin)
Obviously, one word has to be the most frequent, so it makes no difference which particular word it is.
However, now there are two conflicting hypotheses:
1) This word is the most frequent because it is a representation of some very frequent element (not further specified) of an encoded source text
2) This word is the most frequent because it most naturally appears by the auto-copy hypothesis, i.e. by making small modifications of earlier words.
This is something that can be tested.
I have looked at a transliteration of each individual page of the MS, and compared the first occurrence of 'daiin' with the preceding text. (This is something I did several months ago, but I had not yet reported on it). The complete list is included
in a spoiler tag below.
The page is indicated by a file name starting with a two-character code, that some people may recognise, many not, but that is not too relevant.
Then there is a number, which indicates the appearance of the word on this page. If it says 20, it means that the first occurrence of daiin is the 20-th word on this page.
The last two columns show the Levenshtein distance to the most similar previous word on this page, and what this word was. Note that all distances were computed based on Eva.
Let me first include a graph of the frequency by Levenshtein distance.
daiin is a 5-character word (in Eva) and in about 1/3 of the cases, the Levenshtein distance is 3 or higher, meaning that the word is mostly different from the one it was supposed to be copied from.
I can only invite people to look through the detailed list.
When I do this, I can only conclude that in a large number of cases, it is not reasonable that the word daiin, the most frequent word in the MS, was copied from a very different word earlier on the page, but it simply appeared because it is a frequent word for another reason.
AA_lv.txt: 60 daiin 1 dain
AB_lv.txt: 64 daiin 2 dair
AC_lv.txt: 3 daiin 4 kydainy
AD_lv.txt: 14 daiin 2 dair
AE_lv.txt: 14 daiin 2 daimg
AF_lv.txt: 56 daiin 2 koaiin
AG_lv.txt: 23 daiin 1 doiin
AH_lv.txt: 19 daiin 2 chaiin
AI_lv.txt: 19 daiin 1 oaiin
AJ_lv.txt: 8 daiin 3 chodaiin
AK_lv.txt: 10 daiin 5 foar
AL_lv.txt: 48 daiin 2 odaiiin
AM_lv.txt: 26 daiin 4 fchodaiin
AN_lv.txt: 7 daiin 4 dy
AO_lv.txt: 65 daiin 1 dain
AP_lv.txt: 33 daiin 1 saiin
BA_lv.txt: 35 daiin 1 odaiin
BB_lv.txt: 5 daiin 5 opy
BC_lv.txt: 15 daiin 4 dchey
BD_lv.txt: 5 daiin 5 paiindaiin
BE_lv.txt: 12 daiin 4 shfydaiin
BF_lv.txt: 12 daiin 3 dany
BG_lv.txt: 17 daiin 4 dy
BH_lv.txt: 26 daiin 1 oaiin
BI_lv.txt: 4 daiin 3 shoiin
BJ_lv.txt: 6 daiin 4 yfodain
BK_lv.txt: 7 daiin 5 tshor
BL_lv.txt: 12 daiin 1 raiin
BM_lv.txt: 6 daiin 3 sykaiin
BN_lv.txt: 39 daiin 1 dairin
CB_lv.txt: 120 daiin 1 odaiin
CC_lv.txt: 14 daiin 3 dar
CD_lv.txt: 46 daiin 1 oaiin
CE_lv.txt: 8 daiin 4 dy
CF_lv.txt: 20 daiin 1 daiiin
CG_lv.txt: 37 daiin 2 chaiin
CH_lv.txt: 31 daiin 1 aiin
CI_lv.txt: 16 daiin 1 saiin
CJ_lv.txt: 15 daiin 1 odaiin
CK_lv.txt: 8 daiin 4 dpchy
CL_lv.txt: 9 daiin 5 pysaiinor
CM_lv.txt: 8 daiin 3 opyaiin
CN_lv.txt: 23 daiin 2 okaiin
CO_lv.txt: 6 daiin 5 chor
DA_lv.txt: 3 daiin 5 soshy
DB_lv.txt: 4 daiin 4 poeeaiin
DC_lv.txt: 10 daiin 2 odaiir
DE_lv.txt: 23 daiin 1 dain
DG_lv.txt: 5 daiin 5 shod
DH_lv.txt: 30 daiin 3 choiin
DI_lv.txt: 39 daiin 1 aiin
DJ_lv.txt: 9 daiin 3 kooiin
DK_lv.txt: 78 daiin 1 doiin
DL_lv.txt: 15 daiin 3 sheaiin
DM_lv.txt: 8 daiin 1 aiin
DN_lv.txt: 17 daiin 4 podair
DO_lv.txt: 22 daiin 1 dain
DP_lv.txt: 6 daiin 3 qotaiin
EA_lv.txt: 9 daiin 5 shdor
EB_lv.txt: 2 daiin 4 tarar
EC_lv.txt: 30 daiin 3 dar
ED_lv.txt: 42 daiin 1 aiin
EE_lv.txt: 9 daiin 4 dy
EF_lv.txt: 12 daiin 2 roaiin
EG_lv.txt: 19 daiin 2 soiin
EH_lv.txt: 8 daiin 2 otaiin
EI_lv.txt: 7 daiin 2 shaiin
EJ_lv.txt: 10 daiin 4 dor
EK_lv.txt: 26 daiin 1 odaiin
EL_lv.txt: 21 daiin 2 aiind
EM_lv.txt: 125 daiin 1 saiin
EN_lv.txt: 26 daiin 1 aiin
EO_lv.txt: 74 daiin 1 kaiin
EP_lv.txt: 21 daiin 2 okaiin
FA_lv.txt: 77 daiin 2 dalain
FB_lv.txt: 17 daiin 1 saiin
FC_lv.txt: 57 daiin 2 ofaiin
FD_lv.txt: 18 daiin 1 taiin
FE_lv.txt: 22 daiin 1 aiin
FF_lv.txt: 73 daiin 2 araiin
FG_lv.txt: 18 daiin 3 oair
FH_lv.txt: 16 daiin 3 daly
FI_lv.txt: 23 daiin 2 ykaiin
FJ_lv.txt: 12 daiin 4 dod
FK_lv.txt: 10 daiin 2 osaiin
FL_lv.txt: 10 daiin 5 ytedy
FM_lv.txt: 5 daiin 3 sheaiin
FN_lv.txt: 20 daiin 1 daiiy
FO_lv.txt: 11 daiin 2 olaiin
GA_lv.txt: 16 daiin 4 dy
GB_lv.txt: 46 daiin 2 dan
GC_lv.txt: 59 daiin 1 daiiin
GD_lv.txt: 49 daiin 1 aiin
GE_lv.txt: 14 daiin 5 dcheodaiin
GF_lv.txt: 13 daiin 2 dykaiin
GG_lv.txt: 41 daiin 3 dar
GH_lv.txt: 47 daiin 1 aiin
GJ_lv.txt: 5 daiin 5 cpho
GK_lv.txt: 74 daiin 1 daiir
GL_lv.txt: 42 daiin 1 odaiin
GM_lv.txt: 33 daiin 1 aiin
GN_lv.txt: 32 daiin 2 qoaiin
GO_lv.txt: 32 daiin 4 shokaiin
GP_lv.txt: 10 daiin 4 qokoiin
HA_lv.txt: 37 daiin 1 daiir
HB_lv.txt: 103 daiin 1 daiis
HC_lv.txt: 77 daiin 1 saiin
HD_lv.txt: 53 daiin 1 saiin
HF_lv.txt: 9 daiin 4 dy
HG_lv.txt: 141 daiin 1 saiin
HH_lv.txt: 29 daiin 3 dal
IB_lv.txt: 81 daiin 1 aiin
IC_lv.txt: 44 daiin 1 doiin
IE_lv.txt: 10 daiin 2 ain
IF_lv.txt: 43 daiin 2 dair
IJ_lv.txt: 16 daiin 4 oteodaiin
IL_lv.txt: 22 daiin 2 daiidy
IM_lv.txt: 63 daiin 2 dchaiin
IN_lv.txt: 36 daiin 1 aiin
JA_lv.txt: 11 daiin 2 opaiin
JE_lv.txt: 28 daiin 2 aiir
JG_lv.txt: 73 daiin 1 aiin
JH_lv.txt: 26 daiin 1 raiin
KA_lv.txt: 61 daiin 1 aiin
KB_lv.txt: 7 daiin 4 parar
KE_lv.txt: 21 daiin 1 aiin
KH_lv.txt: 24 daiin 5 oteos
LA_lv.txt: 67 daiin 2 opaiin
MB_lv.txt: 17 daiin 1 dain
MC_lv.txt: 380 daiin 1 dain
MD_lv.txt: 34 daiin 3 chedaiin
ME_lv.txt: 84 daiin 1 saiin
MF_lv.txt: 105 daiin 2 taiiin
MG_lv.txt: 133 daiin 1 aiin
MH_lv.txt: 93 daiin 1 dain
MI_lv.txt: 267 daiin 1 saiin
MK_lv.txt: 400 daiin 1 paiin
ML_lv.txt: 13 daiin 4 olkain
MM_lv.txt: 128 daiin 1 dain
MN_lv.txt: 23 daiin 1 saiin
MO_lv.txt: 3 daiin 6 qokeol
MP_lv.txt: 208 daiin 1 raiin
MQ_lv.txt: 23 daiin 3 qokaiin
MR_lv.txt: 43 daiin 1 dain
MS_lv.txt: 93 daiin 2 doiir
MT_lv.txt: 61 daiin 1 saiin
NB_lv.txt: 35 daiin 1 saiin
NC_lv.txt: 26 daiin 2 opaiin
ND_lv.txt: 198 daiin 1 aiin
NN_lv.txt: 60 daiin 1 odaiin
NO_lv.txt: 48 daiin 2 otaiin
NP_lv.txt: 45 daiin 1 aiin
NQ_lv.txt: 64 daiin 1 raiin
OA_lv.txt: 28 daiin 1 saiin
OB_lv.txt: 2 daiin 10 @180;cheey
OC_lv.txt: 19 daiin 3 dal
OD_lv.txt: 20 daiin 1 aiin
OF_lv.txt: 28 daiin 2 qodaiin
OG_lv.txt: 23 daiin 2 dair
OI_lv.txt: 38 daiin 1 saiin
OJ_lv.txt: 10 daiin 5 sfal
OM_lv.txt: 41 daiin 1 saiin
OO_lv.txt: 63 daiin 1 saiin
OP_lv.txt: 20 daiin 2 qodaiin
QA_lv.txt: 104 daiin 3 dal
QB_lv.txt: 16 daiin 3 shodaiin
QC_lv.txt: 37 daiin 1 daiir
QD_lv.txt: 51 daiin 2 chdaiin
QF_lv.txt: 78 daiin 1 kaiin
QG_lv.txt: 9 daiin 2 qodaiin
QI_lv.txt: 29 daiin 1 aiin
QJ_lv.txt: 86 daiin 1 dain
QK_lv.txt: 30 daiin 2 toaiin
QL_lv.txt: 20 daiin 4 odar
SA_lv.txt: 16 daiin 2 ain
SB_lv.txt: 27 daiin 1 dain
SC_lv.txt: 27 daiin 2 daiindy
SD_lv.txt: 22 daiin 3 dalsy
SE_lv.txt: 67 daiin 1 saiin
SH_lv.txt: 87 daiin 1 aiin
SL_lv.txt: 26 daiin 1 dain
SM_lv.txt: 14 daiin 2 olaiin
SO_lv.txt: 110 daiin 1 saiin
SP_lv.txt: 48 daiin 1 doiin
TA_lv.txt: 95 daiin 1 dain
TB_lv.txt: 7 daiin 4 dy
TC_lv.txt: 128 daiin 1 aiin
TD_lv.txt: 116 daiin 1 ydaiin
TE_lv.txt: 45 daiin 1 aiin
TF_lv.txt: 42 daiin 2 ykaiin
TG_lv.txt: 141 daiin 1 aiin
TH_lv.txt: 12 daiin 1 aiin
TI_lv.txt: 186 daiin 1 aiin
TJ_lv.txt: 11 daiin 2 lkaiin
TK_lv.txt: 152 daiin 1 qaiin
TL_lv.txt: 12 daiin 1 aiin
TM_lv.txt: 37 daiin 1 saiin
TN_lv.txt: 32 daiin 1 dain
TP_lv.txt: 40 daiin 2 oraiin
TQ_lv.txt: 105 daiin 1 aiin
TR_lv.txt: 149 daiin 2 ain
TS_lv.txt: 36 daiin 1 raiin
TT_lv.txt: 10 daiin 3 dar
TU_lv.txt: 99 daiin 1 aiin
TW_lv.txt: 142 daiin 1 dain