(25-09-2020, 02:19 PM)ReneZ Wrote: You are not allowed to view links. Register or Login to view.To be a bit more specific about this, when using Eva, the average line length is (approx) 160,000 / 5700 = 28 characters. This point can be indicated in the graph from Schinner, as follows (blue lines):
Hi Rene,
if I understand correctly, the X axis corresponds to positions in the binary path, not in the original character sequence. Since Schinner (like Schenkel) maps characters to 5 bits patterns, the position on the graph corresponding to 28 characters is 28*5=140. But, since the change in direction is so "slow", I don't think that this shift makes a great difference.
(25-09-2020, 02:34 PM)MichelleL11 Wrote: You are not allowed to view links. Register or Login to view.Would there be a similar internal structure that could be paralleled for the Bible in English? I know there are discussions of its repetitiveness. But why would that be special for English compared with translations of the same text in other languages?
(25-09-2020, 02:15 PM)MichelleL11 Wrote: You are not allowed to view links. Register or Login to view.Since your results are comparable to each other (internally consistent, unlike Schinner’s which found the VMS to be an outlier) yet not completely comparable to Schinner’s results as a whole - do you think variations in method of preparation between data sets could be the explanation for all of it?
Hi Michelle,
as I said, I don't fully understand the implications of these measures. They seem to me to be very far from anything directly related to the structure of the text. It could also be that my code is bugged, so I wouldn't take it too seriously. If you check Schenkel's paper, you might find some more information about how different texts behave. For instance, in Fig.5 he shows plots of the actual random walks for the texts he discusses: he corrects the plots for the overall linear drift, so that the last position is made to have Y=0 like the first one.
These are the plots I get for the four texts in my plot.
[
attachment=4801]
It could be that the larger Y-range in these plot is the "cause" of the greater alpha values, but I am not sure.
As you suggest, data preparation could also be relevant. For instance:
Kokol et al Wrote:Each character is then transformed into a six bit long binary representation according to a fixed code table. It has been shown by Schenkel (Schenkel 1993) that the selection of the code table does not influence the results as long as all possible codes (i.e. we have 64 different codes for the six bit representation – in our case we assigned 56 codes for the letters and the remaining codes for special symbols like period, comma, mathematical operators, etc) are used.
Schinner relaxed Kokol's requirement (emphasis mine):
Schinner Wrote:As a first step it is necessary to encode the characters of the texts under investigation to bit sequences. It has been shown that the actual definition of this code table has negligible influence on the interesting quantities, as long as all (or at least almost all) possible bit patterns are used
Schinner used 5 bit codes (vs Kokol's 6 bits) with only 32 possible patterns. But EVA only uses about 22 characters, so I don't think that "almost all possible bit patterns" were used. This is also the case for the other texts, since Schinner only processes lower-case characters, with no punctuation.
But this could be irrelevant as well: I don't understand enough to say anything meaningful, I am sorry.
(25-09-2020, 02:15 PM)MichelleL11 Wrote: You are not allowed to view links. Register or Login to view.I agree with you that the switch to fiction prose for English rather than sticking with the Bible feels like cherry picking.
If "cherry picking" implies intentionally twisting data, I don't think this is the case. I believe it is just confirmation bias at work: Schinner's reading of Schenkel and Kokol's works happened through the lenses of the idea that Voynichese is gibberish. He was unable to see those parts that point in a different direction.