![]() |
|
List of all line-initial words - Printable Version +- The Voynich Ninja (https://www.voynich.ninja) +-- Forum: Voynich Research (https://www.voynich.ninja/forum-27.html) +--- Forum: Analysis of the text (https://www.voynich.ninja/forum-41.html) +--- Thread: List of all line-initial words (/thread-1612.html) |
List of all line-initial words - Oocephalus - 08-03-2017 I promised to do this already a few days ago, but couldn't find time for it. Sorry! So here is a list of all line-initial words in the VMS. It's taken from the Takahashi transcription (I haven't checked the accuracy of anything, so there are probably some errors), excluding single-word "lines", most of which are actually labels. From left to right, the columns are: - the word type - number of line-initial occurrences - number of all occurrences - proportion of occurrences which are line-initial There are in total 1958 word types, including: - 1000 unique ones - 114 others that occur only line-initially - 844 that are not exclusively line-initial I hope this will be helpful. RE: List of all line-initial words - Davidsch - 08-03-2017 I've done this already a week ago and much much more (and published it. Since there are no people interested and I also I've made some conclusions, I decided to take it offline) Strange numbers you have. I've used only lines not labels, because you can not call such a text "a line" I have: 3905 filled lines of which from the unique list of words overall 1771 words of which with only 1 repeat: 1299 The words that are position-unique were discussed already. RE: List of all line-initial words - Oocephalus - 08-03-2017 Davidsch: I have 4354 lines. Maybe the difference is because there are some labels consisting of several words, which my script would have counted. RE: List of all line-initial words - Davidsch - 09-03-2017 I checked the top of the file and it is identical to my results, with respect to the previous remark. RE: List of all line-initial words - Addsamuels - 28-01-2023 This point is indeed interesting and also it seems that the first word of each page seems to be of less frequent than average |