Jorge_Stolfi > 29-09-2025, 11:11 PM
(29-09-2025, 10:50 PM)SherriMM Wrote: You are not allowed to view links. Register or Login to view.Also my statistics only include the 18 line-initial characters, not any amount of random letters. Of the 18 characters, should I compute based on frequency?
Jorge_Stolfi > 06-10-2025, 05:43 AM
(30-09-2025, 12:25 AM)RobGea Wrote: You are not allowed to view links. Register or Login to view.Code:First_letter_of_a_line counts from RF1b-er.txt
Total first_letter_of_a_line character Tokens: 4130
d 640 15.4964% rf: 0.155
s 624 15.109% rf: 0.1511
y 610 14.77% rf: 0.1477
o 544 13.1719% rf: 0.1317
q 531 12.8571% rf: 0.1286
...
RadioFM > 06-10-2025, 10:52 PM
Jorge_Stolfi > 07-10-2025, 10:44 AM
(06-10-2025, 05:43 AM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.if the line-initial letters had been chosen randomly and independently with those frequencies
Magical Raven > 07-10-2025, 09:05 PM
dashstofsk > 08-10-2025, 11:49 AM
Jorge_Stolfi > 09-10-2025, 08:34 AM
(08-10-2025, 11:49 AM)dashstofsk Wrote: You are not allowed to view links. Register or Login to view.I think you have raised something of genuine significance. But perhaps the way to analyse this further is not to look at 3 or higher character string repeats but to look at the distribution of 2-character repeats.
nt nh nb word
-- -- -- ---------
16 00 16 chedy
15 00 15 shedy
12 00 12 qokedy
09 00 09 lchedy
07 00 07 qoky
07 01 06 qokeedy
05 00 05 chey
05 00 05 qokaiin
04 00 04 aiin
05 01 04 dal
04 00 04 ol
04 00 04 qokal
04 00 04 qokeey
04 00 04 qoteedy
04 00 04 shey
03 00 03 otedy
03 00 03 qokchdy
03 00 03 r
03 00 03 s
02 00 02 checthy
02 00 02 cheey
02 00 02 dar
02 00 02 l
02 00 02 lkedy
02 00 02 lo
02 00 02 lsheedy
02 00 02 olchedy
02 00 02 oldy
02 00 02 otaiin
02 00 02 qokey
02 00 02 qotal
02 00 02 qotedy
02 00 02 sain
02 00 02 shckhedy
02 00 02 shcthy
02 00 02 shecthy
02 00 02 shedal
02 00 02 sheedynt nh nb word
-- -- -- ---------
01 00 01 air
01 00 01 altedy
01 00 01 ar
01 00 01 atal
01 00 01 chary
01 00 01 chckhal
01 00 01 chckhdy
01 00 01 chcthdy
01 00 01 chdal
01 00 01 chdy
01 00 01 cheal
01 00 01 chealror
01 00 01 checkhy
01 00 01 checphedy
01 00 01 ched
01 00 01 chedaiin
01 00 01 chedain
01 00 01 chedchy
01 00 01 cheeb
01 00 01 cheedar
01 00 01 cheeety
01 00 01 cheeky
01 00 01 cheg
01 00 01 cheky
01 00 01 cheol
01 00 01 chepol
01 00 01 chety
01 00 01 chkain
01 00 01 chkedy
01 00 01 chkeedy
01 00 01 chky
01 00 01 chldaiin
01 00 01 ckal
01 00 01 cthal
01 00 01 cthalsaiin
01 00 01 daiiin
01 00 01 dairydy
01 00 01 dalchdy
01 01 00 dcheokedy
01 00 01 dolshed
01 01 00 dsheey
01 00 01 kain
01 00 01 kesd
01 00 01 lal
01 00 01 lchcphedy
01 00 01 lchdy
01 00 01 lchs
01 00 01 ldalor
01 00 01 ldar
01 00 01 ldy
01 00 01 lkeed
01 00 01 lol
01 00 01 lpchedy
01 00 01 lshedy
01 01 00 ocheol
01 00 01 ockhey
01 00 01 okain
01 00 01 okair
01 00 01 okedy
01 00 01 okeedyldy
01 00 01 okeeol
01 01 00 olkeedy
01 01 00 olkeey
01 00 01 ols
01 00 01 olsaly
01 00 01 opshcdy
01 00 01 opshedy
01 00 01 oqol
01 01 00 or
01 00 01 otal
01 00 01 otar
01 01 00 otchdy
01 00 01 otchedy
01 01 00 otchey
01 00 01 otor
01 00 01 otshdy
01 01 00 pchedal
01 00 01 pchedar
01 01 00 pchor
01 01 00 pdalshdy
01 00 01 qcphhedy
01 00 01 qeeedy
01 00 01 qekeiiin
01 00 01 qekey
01 00 01 qetal
01 00 01 qockhey
01 01 00 qockhol
01 00 01 qody
01 00 01 qofshdy
01 00 01 qokchy
01 00 01 qokeal
01 00 01 qokedal
01 00 01 qokol
01 00 01 qokylddy
01 00 01 qolchey
01 00 01 qolkain
01 00 01 qopchedy
01 00 01 qopy
01 00 01 qot
01 00 01 qotaiin
01 00 01 qotar
01 00 01 qotchedy
01 00 01 qotedaiin
01 00 01 qoty
01 00 01 raly
01 00 01 rchedy
01 00 01 rches
01 00 01 rchs
01 00 01 sam
01 01 00 schedair
01 01 00 schedy
01 00 01 schety
01 00 01 shckhey
01 00 01 shcthey
01 00 01 shdy
01 00 01 shechy
01 00 01 sheckhdy
01 00 01 sheckhy
01 00 01 shecthedchy
01 00 01 shecthedy
01 00 01 shedaiin
01 00 01 shedkedy
01 00 01 sheekchy
01 00 01 sheey
01 00 01 sheol
01 00 01 shetar
01 01 00 shky
01 00 01 shocphedy
01 00 01 shol
01 00 01 shy
01 00 01 skal
01 01 00 skar
01 01 00 soiiin
01 01 00 sokeedy
01 01 00 solche'dy
01 00 01 solchedy
01 01 00 solcheol
01 01 00 solchkal
01 00 01 soldy
01 01 00 solkeey
01 01 00 solkey
01 01 00 solshed
01 00 01 tain
01 00 01 tal
01 00 01 tar
01 01 00 techedy
01 00 01 tol
01 00 01 ytaiindashstofsk > 09-10-2025, 09:39 AM
ReneZ > 09-10-2025, 12:22 PM
(09-10-2025, 09:39 AM)dashstofsk Wrote: You are not allowed to view links. Register or Login to view.I have just recalled that user 'tavie' posted something similar about line starting characters. He used the term 'vertical pair' which sounds more suitable. In particular, he looked at the frequency of vertical pairs within different sections of the manuscript.