04-05-2022, 04:19 PM
(04-05-2022, 01:32 PM)Koen G Wrote: You are not allowed to view links. Register or Login to view.(04-05-2022, 12:22 PM)Bernd Wrote: You are not allowed to view links. Register or Login to view.Koen, do any of the latin texts you analyzed contain scribal abbreviations e.g. symbols for -us, -um, -bus, et?One thing I can try is take a normalized Latin text and introduce abbreviation symbols by replacing certain letter groups with numerals.
1 = con, com, cun, cum
2 = tur, ur
3 = us, os
4 = ris, tis, cis
Doing this will remove some information from the text, because when we now see "4", we must guess from context whether it represents ris, cis or tis. Therefore, we could hypothesize that some entropy stat will be reduced. However, they are all increased.
h0: 4.64 -> 4.86
h1: 4.01 -> 4.15
h2: 3.31 -> 3.38
It was to be expected that h1 would increase, since we introduce several new, frequent symbols.
H2 increases as well, probably in part because the non-abbreviated parts of the Latin text still behave like normal. Moreover, abbreviation condenses the text, which is also likely to increase h2.
Hi, Koen:
Thanks for doing this additional investigation. Your results definitely confirm the conclusions provided by Lindemann and Bowern You are not allowed to view links. Register or Login to view. that Marco discussed in the parallel entropy thread.
I think at this point there is quite strong support that hypothesized abbreviation in the text is not going to help "normalize" the very low conditional entropy seen in Voynichese, particularly if such abbreviation shares parallels with scribal abbreviations used in medieval Latin. Because this was the overwhelmingly most popular kind of abbreviation used at the time, this would be, in my opinion, the most likely approach adopted if such abbreviation is used.
Further, I am having trouble trying to imagine what kind of manipulation of an underlying natural language that would be termed "abbreviation" would have the desired effect. Of course, this doesn't eliminate the possibility of abbreviation -- it just eliminates it as a central cause of the issue that we are trying to understand -- namely the low conditional entropy.
Thanks again,
Michelle