Hmm, there are 2 issues , all percentages are different, and count for dy is different {6952} , all the other counts are the same, currently investigating { best guess atm is newlines and EVA( ' ) }
In case you can use it for comparison, here is my cleaned up file:
[
attachment=9135]
Thanks for the file bi3mw, that helped find the issue.
<-> A drawing intrusion in the text.
<%>kchsy.chydaiin.ol<->o,l,tchey.char.cfhar,am {ZL3a-n.txt} [line 71] <f1v.1,@P0>
kchsy chydaiin olo l tchey char cfhar am {ZLall2.txt} [line 32]
kchsy chydaiin ol o l tchey char cfhar am {ZL3_W.txt) [line 32]
There are 8 of these strings "d<->y" in ZL3a-n.txt and when the drawing intrusion is removed it results in a new 'dy' string.
It is possible that you may not be using the latest version of ivtt or my codeblocks ming compiler has not updated to the C17 revision .
(28-08-2024, 10:02 PM)RobGea Wrote: You are not allowed to view links. Register or Login to view.It is possible that you may not be using the latest version of ivtt ....
I use version 1.1, is that the latest ?
The latest current version is v2.0
Link to source code is on this page : ivtt: (Intermediate) Voynich Transliteration Tool :: You are not allowed to view links.
Register or
Login to view.
Updated, now dy fits
No. Bigram Frequency (in %) Frequency
1 CH 7.1390 11081
2 HE 5.3267 8268
3 DY 4.4789 6952
4 AI 4.3745 6790
5 OK 3.9809 6179
6 IN 3.8939 6044
7 OL 3.6298 5634
8 EE 3.4204 5309
9 QO 3.4126 5297
10 ED 3.2960 5116
11 II 3.0699 4765
12 SH 2.9204 4533
edit: but now the other bigrams no longer all fit

Nice, exact match with Cryptool.
Digram Analysis of <ZL3_W.txt>. File size 239128 bytes.
Descending sorted on frequency.
No. Substring Frequency (in %) Frequency
1 ch 7.1390 11081
2 he 5.3267 8268
3 dy 4.4789 6952
4 ai 4.3745 6790
5 ok 3.9809 6179
6 in 3.8939 6044
7 ol 3.6298 5634
8 ee 3.4204 5309
9 qo 3.4126 5297
10 ed 3.2960 5116
11 ii 3.0699 4765
12 sh 2.9204 4533
Very nice, "onlinetoolz" also fits. The only thing that doesn't fit anymore is the output in my text editor (rather mysterious). For example, it shows 4581 for II.
Oh yeah, same here, aha i think its the triples like 'daiiirihy'. in notepad++ it highlights them and it only only highlights 2 of the 3 i's
but ngrams would be ['da', 'ai', 'ii', 'ii', 'ir', 'ri', 'ih', 'hy'].
EDIT:
Oh noes , i think i see another issue, grrr. i dont think EVA(-') is being processed properly, Oh well , it'll have to wait till tomorrow now.
You are right, if you subtract the 184 "iii(i)" from the 4765 "ii" then you get exactly 4581. So everything is fine ( hooray )
So we have fulfilled Rene's requirements (same results from cryptool and python script, see post #45).
No. Bigram Frequency (in %) Frequency
1 CH 7.1390 11081
2 HE 5.3267 8268
3 DY 4.4789 6952
4 AI 4.3745 6790
5 OK 3.9809 6179
6 IN 3.8939 6044
7 OL 3.6298 5634
8 EE 3.4204 5309, 4889 + ( 413 x EEE + 7 x EEEE ) 420
9 QO 3.4126 5297
10 ED 3.2960 5116
11 II 3.0699 4765, 4581 + ( 183 x III + 1 x IIII ) 184
12 SH 2.9204 4533