(22-10-2023, 03:43 PM)MarcoP Wrote: You are not allowed to view links. Register or Login to view.You are not allowed to view links. Register or Login to view.
This post is too long for me to read in full. I read the conclusions at the end and skimmed through the rest.
Thank you for checking it out, I actually didn't expect anyone to read it in full. I wouldn't track this research myself, at least not until some important results appear.
When answering the questions below, I provide some relevant quotes from the text, just to double-check myself.
Quote:I don't know much about statistics, but I'd like to comment on this statement:
Quote:Using the same logic as above, we take two locations for ytoa and compute the probability of yfa to appear in one of remaining 6 slots in either location. 2 * 6 * 2 / (1033 - 1 - 7) = 2.3%. Which is still very low.
I don't know why 2.3% should be regarded as "very low". When one cherry-picks complex patterns from anything that is not totally trivial, it's easy to find patterns that do not occur frequently. By making the patterns more and more specific, probabilities can be made arbitrarily low.
Relevant quotes from the text: "When assessing how likely certain arrangement is, you always need to consider it against all other arrangements or all other attempts." and "Before we proceed with the calculations, it's very important to note that the choice of f67r2 was not random or a result of any brute-force enumeration. It was a result of tentative identification of planet correspondences that we made earlier. Should our analysis involve comparison of all sequences in the manuscript to all other sequences in the manuscript, we'd have to correct for the number of attempts that we make, but we don't have to do this if we start with a particular sequence in the manuscript."
Cherry-picking refers to a process where you use a large sample of relationships and pick only those that are of interest to you. This cannot apply here, since we started from 7 labels in the text that we tentatively identified before and started looking for statistically relevant patterns related to these 7 labels specifically in the rest of the manuscript. The methodological reason for picking these 7 labels was that we have tentatively attributed some meaning to them in my past article (we couldn't check for further meaningful connections if we don't attribute some meaning in the first place). Then we computed the probabilities considering all other labels in the text.
That's why 2.3% is the expected probability of this structural feature to occur by random chance and it definitely looks very low. If we started by examining the whole of the manuscript looking for patterns, as you probably interpreted it, we'd have to adjust for the total number of the relationships we examined, so instead of 2.3% we'd have something closer to 1 - (1 - 0.023) ^ (1033 / 7) = 96.77% of this relationship being random, given there could be ~ 1033 / 7 blocks of 7 labels in the manuscript. This is definitely not the right way to compute this precisely, but a good upper bound.
Quote:The post discusses these two sequences of Voynichese words (this the order in which they appear in the post, which is different from the order in the manuscript):
f67r2: ytoaiin, yfain, opcholdy, ofar.oeoldan, okain.am, okal, dolchsody
f72r3: ytoar.shar, yfary, opalal, oraiiral, oletal, aral, octho
The conclusion is:
Quote:There is 99.42% probability that two sets of 7 labels on f67r2 and f73r3 are textually related to one another (from the fact that their probability to co-occur by chance is only 0.58%).
Some observations:
- Stating that "textually related" follows from "unlikely to occur by chance" is a non-sequitur. Many non-random patterns have been spotted in the Voynich manuscript, e.g. word structure (Stolfi), line patterns, word repetition, different "dialects" for the different sections (Currier). We know very well that Voynichese is not random and has structure at some level: this does not imply that the text is meaningful.
You probably interpret "textually related" as "meaningfully related". In the article we talk about textual structures as statistically significant regularities in the text, the text treated as a sequence of characters, defined by the transliteration. No connection to meaning is implied, quoting: "The structure lets you make predictions about the text using the information in the text itself, but when it doesn't relate to any external objects, properties or behaviors, the structure doesn't convey meaning."
Quote:- It's not clear why the similarities between the two lists are discussed, but not the differences (e.g. in most cases word lengths are markedly different, in 3 out of seven cases the number of words in the labels are different, suffixes only match in one case).
The differences are discussed, e.g., "If we assume there is a meaningful link between two charts, the imperfect match of labels on f67r2 and f72r3 could imply one of two things: either a one-to-many cipher is used or two sets of labels refer to parallel sets of objects with similar names or in similar but different languages."
The basis of my arguments is built on statistical significance of the matching parts, the features of parts that don't match do not affect it in any way.
Quote:- The seven labels from f72r3 are part of a set of 30 labelled stars for the Cancer page. The page is part of a sequence of 10 zodiac signs (one page is missing), each containing 30 labelled stars. Therefore (contrary to f67r2) those seven labels do not look likely to have anything to do with the seven planets but seem more likely to be related with the thirty degrees that make up each zodiac sign (see Panofsky's reference to ms BAV Reg.lat.1283a).
I'm not sure I understand the specific counterpoint in this comment, sorry. Could you rephrase maybe? If this is not a critique of any aspect of my text, but just a reference to an alternative interpretation, thank you for the reference!
On a general note, arguing what things look like and not look like in the Voynich manuscript appears to be of low use, given that very few things in the manuscript were specifically identified with high certainty in a universally accepted way. As I joked on Twitter, plant images could very well be schematics for alien machinery and bathing ladies represent organic chemistry. I don't think we should give strong weight to any interpretations based just on what things look like, as, in my opinion, they have been largely fruitless so far when it comes to explaining what the manuscript is and how, why and by whom it was created.
Quote:- The presence of several circle-gallows prefixes is typical of Voynichese labels (where circles are o a y and gallows t k p f). See also histogram You are not allowed to view links. Register or Login to view.. While only about 16% of paragraph words start with such sequences, the rate is 48% for labels. It seems that this wider phenomenon could help in the analysis of the prefixes in the two sets of seven labels.
Again, if this is a specific counterpoint, I'm not sure I understand it. The prefix statistics that I compute use the set of all labels, including the features that you mention. The result I get is computed against the full distribution of the actual label prefixes in the text.
Thanks again for your comments and looking forward for more, if you have time and desire.