Koen G > 09-12-2019, 11:28 PM
Torsten > 10-12-2019, 02:18 AM
(09-12-2019, 02:26 PM)MarcoP Wrote: You are not allowed to view links. Register or Login to view.
MarcoP > 10-12-2019, 03:32 PM
(09-12-2019, 03:12 PM)ReneZ Wrote: You are not allowed to view links. Register or Login to view.Another point I had been wondering about is whether the B language could be seen as A language with additional words. The fact that B-language pages tend to have much more text than A-language pages could be just an effect of this 'adding words'.
This was suggested (probably in quite a cryptic manner) by the last bullet above 'Suggestions for further study' on this page .
(09-12-2019, 06:52 PM)ReneZ Wrote: You are not allowed to view links. Register or Login to view.An alternative to the verbose cipher would be a number theory. If the Voynich MS words are like a numbering or enumeration system, a similar progression could be expected. Just compare it with Roman numerals. D only starts appearing after 500 words and M only after 1000.
In such a case, there is no mapping, but there is a 'generating algorithm' that explains the dialects.
ReneZ > 10-12-2019, 04:18 PM
-JKP- > 10-12-2019, 05:34 PM
ReneZ > 10-12-2019, 06:33 PM
Torsten > 11-12-2019, 12:39 AM
(10-12-2019, 03:32 PM)MarcoP Wrote: You are not allowed to view links. Register or Login to view.the fact that Currier B adds new word types to Currier A is certainly central to the whole phenomenon. I don't think that it can explain everything, since there are fluctuations in the frequencies of words that appear everywhere that must have another explanation.
For instance the occurrences of chol/cheey vary from 192/15=12.8 in Herbal_A to 12/36=0.3 in Bio. The frequency of chol in Herbal_A is 2.5%: comparable with the frequency of the most frequent word in English. Even if tokens belonging to the new B-types are frequent, I don't think they are enough to explain how a word drops from 2.5% to 0.2%.
nablator > 11-12-2019, 06:20 PM
(09-12-2019, 03:12 PM)ReneZ Wrote: You are not allowed to view links. Register or Login to view.Another point I had been wondering about is whether the B language could be seen as A language with additional words.
MarcoP > 13-12-2019, 11:38 AM
nablator > 13-12-2019, 12:59 PM
(13-12-2019, 11:38 AM)MarcoP Wrote: You are not allowed to view links. Register or Login to view.There could be other substitutions worth commenting upon, but my main interest now is understanding how to proceed. A possibility could be re-formulating this search into something like "simulated annealing", rather than this simple brute-force approach.You need to investigate multiple substitutions at the same time. One substitution at a time may not be enough to detect an improvement in your metric. This of course makes the search space huge. I would try to "hill climb" first. Some problems are well suited to the hill climbing algorithm (just swap the target part of two randomly selected rules or enable a rule and disable the other and retest: if the result is worse, backtrack). If you are out of luck you get a different sub-optimal "solution" each time, and you need a better algorithm.