10-12-2019, 10:44 PM
I was recently thinking about Palindromes (words that read the same backwards and forwards).
All European languages (that I know of) have single word palindromes, but this effect seems to be almost absent in the VM.
You are not allowed to view links. Register or Login to view. are simple three letter words.
The only longer palindromes seem to be unique in all cases with the exception of occo, which appears three times in the manuscript.
Here are the ones I've spotted (quite possible I've missed some, this was only a quick count)
dydyd (f1)
seees (f3)
oeeo (f6v, f72v2, f101v2)
ykaky (f55v)
yekey (f69v)
ylaly (f73v)
lolol (f72v)
shchs (f113r)
The low number of palindromes is of course to be expected, due to the position awareness of glyphs.
It's possible that such palindromes are actually the result of misspellings, and this could give us some concrete examples of such scribal errors within the corpus, allowing us to correct scribal errors and reduce erroneous words from the transcription.
For example, taken at random because it made me laugh out loud, lolol: lo appears 15 times by itself, 182 as a word initial and ol 3052 times as a word final and 538 times as a word by itself. But lo*ol only gives two results, lolol and lolkeol. This second word is more likely to be two words run together, as both lol and keol are common words. This suggests to me that lo and ol have well defined functions, but shouldn't be used together; the scribe made a mistake with lolol, and missed out a space in lolkeol. What mistake in lolol? Well, the prior word is checkho, which is unique. If we move the first l over, then we get checkhol, which appears twice in the corpus. We now have checkhol appearing three times, followed by olol, which appears 18 times in the corpus.
So we have now removed three unique words from the corpus in a logical manner!
No idea if we can do this with the rest of them, it's getting late and I'm tired now. Has anyone any research into this angle, or into reducing the number of unique words by seeing if they can be exploded and reassembled with adjoining words?
All European languages (that I know of) have single word palindromes, but this effect seems to be almost absent in the VM.
You are not allowed to view links. Register or Login to view. are simple three letter words.
The only longer palindromes seem to be unique in all cases with the exception of occo, which appears three times in the manuscript.
Here are the ones I've spotted (quite possible I've missed some, this was only a quick count)
dydyd (f1)
seees (f3)
oeeo (f6v, f72v2, f101v2)
ykaky (f55v)
yekey (f69v)
ylaly (f73v)
lolol (f72v)
shchs (f113r)
The low number of palindromes is of course to be expected, due to the position awareness of glyphs.
It's possible that such palindromes are actually the result of misspellings, and this could give us some concrete examples of such scribal errors within the corpus, allowing us to correct scribal errors and reduce erroneous words from the transcription.
For example, taken at random because it made me laugh out loud, lolol: lo appears 15 times by itself, 182 as a word initial and ol 3052 times as a word final and 538 times as a word by itself. But lo*ol only gives two results, lolol and lolkeol. This second word is more likely to be two words run together, as both lol and keol are common words. This suggests to me that lo and ol have well defined functions, but shouldn't be used together; the scribe made a mistake with lolol, and missed out a space in lolkeol. What mistake in lolol? Well, the prior word is checkho, which is unique. If we move the first l over, then we get checkhol, which appears twice in the corpus. We now have checkhol appearing three times, followed by olol, which appears 18 times in the corpus.
So we have now removed three unique words from the corpus in a logical manner!
No idea if we can do this with the rest of them, it's getting late and I'm tired now. Has anyone any research into this angle, or into reducing the number of unique words by seeing if they can be exploded and reassembled with adjoining words?