pfeaster > 04-11-2022, 04:21 PM
Juan_Sali > 04-11-2022, 08:19 PM
(04-11-2022, 04:21 PM)pfeaster Wrote: You are not allowed to view links. Register or Login to view.I'm less sure about transitional probability matrices being able to account for the patterns exposed by slot-based word morphologies. But I wouldn't rule out that they could.To make an analisys of bigrams within vords I took data from You are not allowed to view links. Register or Login to view.
A while ago I calculated matrices for each bigram in Currier B (e.g., what the next single glyph would be after each pair of glyphs, such as [k] after [qo]) and generated some text randomly based on them. Of course I had to make some working assumptions about what a "glyph" is, all of which are open to challenge, and I don't think analysis of preceding bigrams goes deep enough. But the results, which I may have quoted here before, came out looking like this (with spaces inserted between any two glyphs that are more often separated by a space than not):
Hermes777 > 04-11-2022, 10:13 PM
(04-11-2022, 01:05 PM)pfeaster Wrote: You are not allowed to view links. Register or Login to view.(03-11-2022, 11:36 PM)Hermes777 Wrote: You are not allowed to view links. Register or Login to view.The sequence is continuous because:
Qokeedy – qokaain = the word break is [y.k]
Qokaiin – olkaiin = the word break is [n.o]
Olkaiin – Olkeedy = the word break is [n.o]
Thus: qokeedy.qokaiin.olkaiin.olkeedy. And so on. The line of least resistance.
Do you see evidence of a sequence of those four forms recurring continuously in that specific order, or could a model like the following one also express the pattern you have in mind?
obelus > 05-11-2022, 09:18 PM
(04-11-2022, 04:21 PM)pfeaster Wrote: You are not allowed to view links. Register or Login to view.I'm less sure about transitional probability matrices being able to account for the patterns exposed by slot-based word morphologies. But I wouldn't rule out that they could.
MarcoP > 06-11-2022, 01:04 PM
pfeaster > 07-11-2022, 03:46 PM
(05-11-2022, 09:18 PM)obelus Wrote: You are not allowed to view links. Register or Login to view.Fully 142/167 of the word tokens are present verbatim in the source. The remainders are
Shekchdy, tolor, rchdy, omolShey, olkear, otchorol, karaly, dchedeey, Shedyo, otaiil, olalchkalches, qoteedair, darorar, pychedy, tchdar, aldShedy, laiikam, qoetchy, tcheokedeedy, tary, okolkeey, lShlchep, okaoy, rSheody, qoto
(05-11-2022, 09:18 PM)obelus Wrote: You are not allowed to view links. Register or Login to view.In this rather dry sense, the higher-order matrices descriptively account for all patterns, and provide no explanations.
nablator > 07-11-2022, 05:38 PM
(07-11-2022, 03:46 PM)pfeaster Wrote: You are not allowed to view links. Register or Login to view.I'm not sure why the percentage goes down from my third-order sample to obelus's fourth-order sample -- I'd have guessed it would be the other way around.
Torsten > 07-11-2022, 11:16 PM
(04-11-2022, 04:21 PM)pfeaster Wrote: You are not allowed to view links. Register or Login to view.Torsten's [ol], [chedy], and [daiin] prototypes correspond rather closely to closed loops [cholcholchol...], [qokeedyqokeedyqokeedy...], and [daiindaiindaiin...], made up of the highest-probability sequences of individual graphemes in particular parts of the manuscript, such that if transitional probability matrices have high predictive power in themselves, we should also expect to see word frequencies correlate with "edit distance" as Torsten has found -- though not exactly, since the words [ol] and [chedy] will often represent partial cycles in contexts such as [~r.ol~] and [~l.chedy~]. So I think it might be possible to predict the "network" patterns on the basis of transitional probability matrices, with no need to factor in "edit distance" as such. Perhaps we could think of a statistical test that would yield a different result depending on which model is more effective.
obelus > 08-11-2022, 04:57 AM
(07-11-2022, 03:46 PM)pfeaster Wrote: You are not allowed to view links. Register or Login to view.Like any raw data, they need analysis and interpretation. Examples of the kind of "rule" we might infer from them could look something like this (ignoring spaces):
- In Currier B, a glyph preceded by e is more likely than usual to be followed by y.
- In Currier B, d is more likely to be followed by a than by y if it is preceded by y, r, p, n, or l.
Emma May Smith > 09-11-2022, 02:00 AM