• List of all line-initial words
  • List of all line-initial words

    Oocephalus > 08-03-2017, 02:52 PM

    I promised to do this already a few days ago, but couldn't find time for it. Sorry!
    So here is a list of all line-initial words in the VMS. It's taken from the Takahashi transcription (I haven't checked the accuracy of anything, so there are probably some errors), excluding single-word "lines", most of which are actually labels.
    From left to right, the columns are:
    - the word type
    - number of line-initial occurrences
    - number of all occurrences
    - proportion of occurrences which are line-initial

    There are in total 1958 word types, including:
    - 1000 unique ones
    - 114 others that occur only line-initially
    - 844 that are not exclusively line-initial

    I hope this will be helpful.
  • RE: List of all line-initial words

    Davidsch > 08-03-2017, 04:23 PM

    I've done this already a week ago and much much more  (and published it. Since there are no people interested and I also  I've made some conclusions, I decided to take it offline)

    Strange numbers you have. I've used only lines not labels, because you can not call such a text "a line"

    I have:

    3905 filled lines
    of which from the unique list of words overall 1771 words
    of which with only 1 repeat: 1299

    The words that are position-unique were discussed already.
  • RE: List of all line-initial words

    Oocephalus > 08-03-2017, 04:32 PM

    Davidsch: I have 4354 lines. Maybe the difference is because there are some labels consisting of several words, which my script would have counted.
  • RE: List of all line-initial words

    Davidsch > 09-03-2017, 12:29 AM

    I checked the top of the file and it is identical to my results, with respect to the previous remark.
  • RE: List of all line-initial words

    Addsamuels > 28-01-2023, 07:40 PM

    This point is indeed interesting and also it seems that the first word of each page seems to be of less frequent than average