According to the same VQP data, there are 116 lines starting with
k. This means that 66.4% of those fall onto paragraph starts.
Following that, here's some quick data that I can share right now.
Below is the breakdown of line-initial characters in
k-initial paragraphs:
k (paragraph-initial included) - 80
o - 79
d - 70
q - 60
y - 58
s - 32
sh - 29
t - 26
ch - 22
p - 7
l - 4
r - 1
and also one occurrence falls onto the strange character which is like the English letter "e" (see You are not allowed to view links.
Register or
Login to view. p4).
The sum of the above is 469, so I still miss one of 470 total lines somewhere, but I'm too sleepy to seek.
However, if we consider only the first three lines of paragraphs, then the breakdown is as follows:
k (paragraph-initial included) - 78
d - 38
o - 30
y - 26
q - 23
t - 9
ch - 8
sh - 7
s - 6
l - 2
r - 0
p - 0
In other words, the second and the third line begin mostly with one of four characters -
d,
o,
y and
q (78% of 2nd and 3rd lines vs 68% of all lines except the 1st).
One passing observation is that
a appears to be exceedingly rare in the line starting position. It never occurs as such in k-initial paragraphs, and the overall count is only 25 (VQP). This given that there are 1962
a-initial vords (which is even more than k-initial).
Considering the order proposed earlier:
[font=Eva]k t y d ch o l q r s sh p[/font], 72,7% of k-initial paragraphs match this order. Moving
sh to the position between
ch and
o increases this figure to 74,0%. Furthermore, if we consider gallows as the symbol "resetting" the order (only
t occurs within the first three lines along with
k), then the figure rises to 81,8%.