(15-02-2021, 12:31 AM)Stephen Carlson Wrote: You are not allowed to view links. Register or Login to view.To stay on topic, the most disappointing aspect of the paper that T&S are responding is the oversight of their work, because it is relatively more plausible than the other hoax alteratives and a good paper ought to dispose of its strongest alternatives, not its weakest. That said, I'm not sure that linguists would have any particular insight into the potential defects of the T&S proposal.
I must emphasize that we do not criticize the fact that Bowern and Lindemann obviously see our work as completely irrelevant. In our eyes this doesn't make any difference. But we think that it is important to cover also counter arguments. Also Bowern and Lindemann share this view since they criticize other authors for "omitting any information that does not fit the theory they propose." Therefore we criticize places where the authors only state what they believe and places where important counter arguments are overseen.
Since it was necessary to prioritize the flaws I would like to share some additional points here:
1) "There are some exceptions, in particular with the word 'daiin', which is common in every position except paragraph-initially"
Since paragraph-initially words usually start with a gallow glyph it is no surprise if words not starting with a gallow glpyh are uncommon there.
2) On p.4 the paper rules out a polyalphabetical cipher since such a cipher "would lead to identical words being encoded differently in different parts of the manuscripts".
However if it comes to a statistical outlier the paper argues in the opposite direction: "It is possible that chedy and shedy represent the same word as they are distinguished only by whether there is a plume stroke over the bench character. If we make this assumption Voynich B is less of an outlier." Later it is even argued "A comprehensive linguistic analysis needs to take seriously the possibility that, for example, paiin, saiin, aiin, and am are all positional variants of the same word." But if 'chedy'/'shedy' or 'paiin'/'saiin'/'aiin' represent the same word what does this mean for word pairs like 'cheedy'/'sheedy', 'chdy'/'shdy', 'taiin'/'daiin' or 'chaiin', 'shaiin'?
3) The paper refers to Tiltman (1967), Stolfi (2000) and Reddy & Knight (2011) for the hypothesis that "Voynich words consist of three separate 'fields,' with particular symbols occurring at the beginning, middle, or end of the word".
* However Tiltman did in fact "divided words into ... 'roots' and 'suffixes'" (Tiltman 1967, p. 7).
* Stolfi on the other hand tried to parse Voynich words into prefix, midfix and suffix (see Stolfi 1997, You are not allowed to view links.
Register or
Login to view.)
But the outcome of this experiment was "that there is a surprisingly small number of prefixes and suffixes with significative frequency." Later Stolfi published a model using three nested layers for parsing Voynich word tokens. (see Stolfi 2000, You are not allowed to view links.
Register or
Login to view.)
* Reddy and Knight used an MDL-based algorithm to segment "words into prefix+stem+suffix, and extracts ‘signatures’, sets of affixes that attach to the same set of stems."
This way Reddy and Knight illustrate that some glyph sequences like 'ol+', 'ot+' and '+dy', '+y' occur predominantly at the beginning or end of a word.
Reddy and Knight concluded "that stems in the same signature tend to have some structural similarities." (Reddy & Knight 2011, p. 82).
* Moreover beside the common word 'aiin' also words like 'daiin', 'kaiin', 'taiin' exists. For some unknown reason 'd' is listed as prefix but 'k' and 't' are listed as midfixes. In the same way the common glyph groups 'ol'/'or'/'al'/'ar' are split into 'o'/'a' as midfixes and 'l'/'r' as suffixes.
4) On p. 18 the paper concludes "All of these observations lead to generalizations which appear to be typographical rather than linguistic in nature."
But later the paper argues that "the word and line level metrics show it to be regular natural language."
5) "Syntax describes the ways in which words fit together in a hierarchical structure, and generalizations about word and phrase combinations can explicate this structure. Syntax has been studied less systematically than character- and word-level patterns in the Voynich Manuscript."
The lack of repetitive phrases is in fact the most often used argument against the natural language hypotheses. Even some of the papers cited do in fact cover the observation that phrases are missing.
* Tiltmann wrote in 1976 "My analysis, I believe, shows that the text cannot be the result of substituting single symbols for letters in the natural order. Languages simply do not behave in this way. ... And yet I am not aware of any long repetitions of more than 2 or 3 words in succession, as might be expected for instance in the text under the botanical drawings" (Tiltmann 1976).
* D'Imperio wrote in 1978: "Also the strange lack of parallel context surrounding different occurrences off the 'same' word as shown by word indexes. In the words of several researchers ' the text just doesn't act like natural language'" (D'Imperio 1976, p. 30).
6) The paper argues that the Voynich manuscript is encoded using a cipher that preserves language like structure "and in addition create predictability in the writing system."
Ciphers normally use randomness to hide language structures. Therefore simple ciphers do not change the predictability whereas more advanced ciphers do decrease the predictability.
This means a cipher can either preserve the text structure (for instance by using a dictionary cipher) or it can add predictability (for instance by omitting information or by encoding each plain text letter with a whole cipher word). In the first case it would be possible to detect typical phrases and in the second case the hypothesis that a Voynich word does represent a word in a natural language must be wrong.
Also some basic facts are wrong:
7) The paper does already start with a mistake: "The Voynich Manuscript has 116 folios (i.e., 232 pages)".
Since some folios are missing there are less than 116 folios and since the Voynich manuscript has foldouts there are not twice as many pages as folios. Actually the Voynich manuscript contains only 102 folios and "including blank pages and pages without text there are 240 pages. 225 pages include text" (Reddy & Knight 2011, p. 78).
8) "The sequence qo is represented as qo in large part because q never appears in the text except before o" (p. 291).
There are numerous counter examples. There are for instance 66 instances of 'qe', 23 instances of 'qc', and 7 instances of 'qa'.
9) "even though 'h' is never found separately'. (p. 291).
There are 85 instances of 'hh' like in 'chcphhy' on folio 7v and 'chckhhy' on folio 15r. There is 'tohedy' on folio 39r, 'okehdar' on folio f67r1, and 'olohy' on folio f83v. And there is also 'qoiheey' on folio 73r and 'chcthihy' on folio 85r.
10) "For example, p/f p/f are never followed by e" (p. 302).
At least <shefeeedy> on folio 48v and <qopeeedar> on folio 50r exists.
11) The paper refers multiple times to Sterneck & Bowern (2020). But such a paper was not published until today.