Bernd > 05-07-2022, 08:26 PM
(04-07-2022, 03:07 PM)ReneZ Wrote: You are not allowed to view links. Register or Login to view.Between the two views:Large differences in sample text size are certainly a problem but it can be addressed by subsampling.
- ignoring sample text length
- dividing by sample text length
both views are imperfect. It is not clear to me which one of the two is the more indicative one.
Certainly, dictionary size increases very non-linearly with sample text size.
It is only a problem when text lenghts are significantly different. That is the case here of course.
(05-07-2022, 12:16 PM)Torsten Wrote: You are not allowed to view links. Register or Login to view.If illustrations do indicate topics, some common terms specific to a particular type of illustration or topic should exist. However such terms doesn't exist.This hypothesis implies a simple encoding in which 'vords' always translate 1:1 into plaintext words regardless of context, which is highly questionable considering the strange properties of 'Voynichese'. If it was that easy, the VM would have been deciphered long ago.
ReneZ > 05-07-2022, 09:04 PM
(05-07-2022, 08:26 PM)Bernd Wrote: You are not allowed to view links. Register or Login to view.Large differences in sample text size are certainly a problem but it can be addressed by subsampling.
You can split each section into chunks of ~1000 vords each (maybe rounded to full pages) to match text length to the smallest and 2nd smallest ones (Astro and Zodiac).
Bernd > 05-07-2022, 09:45 PM
Torsten > 06-07-2022, 12:24 AM
(05-07-2022, 09:45 PM)Bernd Wrote: You are not allowed to view links. Register or Login to view.Still I'd be interested if it is possible to distinguish between subsamples and samples of different sections and how much the length of the text fragment matters.
Torsten > 08-07-2022, 09:05 AM
(05-07-2022, 08:26 PM)Bernd Wrote: You are not allowed to view links. Register or Login to view.This hypothesis implies a simple encoding in which 'vords' always translate 1:1 into plaintext words regardless of context, which is highly questionable considering the strange properties of 'Voynichese'. If it was that easy, the VM would have been deciphered long ago.Indeed.
(05-07-2022, 08:26 PM)Bernd Wrote: You are not allowed to view links. Register or Login to view.But still so far there is little (if any?) evidence of a connection between text and imagery. Please correct me if I'm wrong.
Juan_Sali > 08-07-2022, 12:37 PM
(08-07-2022, 09:05 AM)Torsten Wrote: You are not allowed to view links. Register or Login to view.To illustrate my point I have used three different colors to mark all instances of vords containing the sequences 'ed' (plum), 'ho' (green), and 'in' (yellow): You are not allowed to view links. Register or Login to view.I consider that 'in', even 'ain' 'aiin' are a unique caracter. That explains the abundance of them.
Torsten > 08-07-2022, 03:23 PM
(08-07-2022, 12:37 PM)Juan_Sali Wrote: You are not allowed to view links. Register or Login to view.I consider that 'in', even 'ain' 'aiin' are a unique caracter. That explains the abundance of them.
'ho' is also part of the trigrams 'hor' and 'hol' when they are not separated by a space. A separated analisys is needed for the 3 of them.