10-11-2022, 04:53 AM
(09-11-2022, 02:00 AM)Emma May Smith Wrote: You are not allowed to view links. Register or Login to view.I'm currently rechecking my model for predictions/rules. Would you be interested in testing them?
The 'leapfrog' pattern from my previous post is unusual because it can easily be implemented as a functional operator on the big correlation matrix. Other schemes, of greater linguistic merit, may be roundabout or impossible (for me!) to check in that way.
A practical strategy is to generate the statistics on a case-by-case basis. We first formulate the rule scheme as a statement containing two ordered character variables, such as the proven-productive question
- To what extent does the final glyph of a word predict the initial glyph of its sequel?
...(fist glyph) + (space) + (second glyph)...
and calculate correlations between the variable characters. For the whole of IT paragraph text (without consolidating any EVA glyphs, and respecting line breaks) the matrix of correlation probabilities looks like this
[attachment=6947]
...again with the first character in rows, and the second in columns. Following the procedure of Smith and Ponzi 2019, we can divide this matrix by one generated from the same text sample, except with the words within each line scrambled. It shows the factor by which line break combinations deviate from statistical expectation:
[attachment=6950]
...which is just a visualization of your Tables 3.1-3.8 that highlights the positively deviant combinations, but suppresses their absolute frequency. Our numerical values are in satisfactory agreement (considering haste and shortcuts). It is to be hoped that the human mind can find patterns in the pixels. Is the general anisotropy of the picture telling us anything?
It would be most interesting to test some some non-obvious rule schemes. The work of dropping a new string pattern into the graphics code is minimal.