nablator > 28-10-2023, 02:57 PM
(28-10-2023, 02:07 PM)oshfdk Wrote: You are not allowed to view links. Register or Login to view.So, as I understand it, you are saying there is reason to believe it was impossible for Voynichese to split the words using the stem, because it never splits words in a way that produces pieces not seen on their own (say, splitting a||iin or e8||y). And also if you try to reassemble these words, they either don't exist in the MS or exist as separate words. Is this correct?
bi3mw > 28-10-2023, 03:26 PM
awk '{print $NF}' Voynich_full_TT.txt | grep 'dy$' | wc -l
oshfdk > 29-10-2023, 06:53 AM
(28-10-2023, 03:26 PM)bi3mw Wrote: You are not allowed to view links. Register or Login to view.For what it`s worth, as by far the most frequently occurring word ending in the last word of a line is dy ( 814 times ). It would be interesting to know if this occurrence is proportionally equally frequent in the text around the plants.
> cat zl.txt | head -n 3
#=IVTFF Eva- 1.7
# ZL transliteration file, updated from EVMT project
# Version 1r of 11/04/2020
> cat zl.txt | egrep "^<f[^>]+\." | egrep "dy$" | wc -l
717
> cat zl.txt | egrep "^<f[^>]+\." | wc -l
5389
> echo "scale=2; 100 * 717 / 5389" | bc
13.30
# we use -o to count repeated occurrences in a single line
> cat zl.txt | egrep "^<f[^>]+\." | egrep -o "\.<->" | wc -l
751
> cat zl.txt | egrep "^<f[^>]+\." | egrep -o "dy\.<->" | wc -l
141
> echo "scale=2; 100 * 141 / 751" | bc
18.77
MarcoP > 29-10-2023, 08:13 AM
# no end-of-paragraph
cat ZL_ivtff_1r.txt | egrep "^<f[^>]+\." | egrep "dy$" | wc -l
717
# with end-of-paragraph
cat ZL_ivtff_1r.txt | egrep "^<f[^>]+\." | egrep "dy[^a-z]*$" | wc -l
856
cat ZL_ivtff_1r.txt | egrep "^<f[^>]+\." | wc -l
5389
echo "scale=2; 100 * 856 / 5389" | bc
15.88
oshfdk > 29-10-2023, 09:42 AM
(29-10-2023, 08:13 AM)MarcoP Wrote: You are not allowed to view links. Register or Login to view.End-of-paragraph "dy" was not counted in the previous post (while end-of-paragraph lines are counted in the 5389 total).
Antonio García Jiménez > 30-10-2023, 04:52 PM
Antonio García Jiménez > 04-11-2023, 01:33 PM
Antonio García Jiménez > 06-11-2023, 08:10 PM
Antonio García Jiménez > 12-11-2023, 07:30 PM
Antonio García Jiménez > 18-11-2023, 11:52 AM