Jorge_Stolfi > 23-09-2025, 05:04 AM
(22-09-2025, 10:27 PM)quimqu Wrote: You are not allowed to view links. Register or Login to view.The four plots contain the same data but are ordered differently. Each color is a topic. For each folio (think of a single vertical column), the vertical stack shows the proportions of topics on that folio. If a folio is a single color, all its paragraphs fall into the same topic; if it’s multicolored, different paragraphs are assigned to different topics.
quimqu > 23-09-2025, 09:43 AM
(23-09-2025, 05:04 AM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.What got me confused is that the plots show gradual transition (slanted lines) between topics. But those slanted lines are an artifact of the plotting routine. For instance, from You are not allowed to view links. Register or Login to view. to f17v the plot suggests a gradual transition, but the transition is actually abrupt: You are not allowed to view links. Register or Login to view. is 100% "brown", You are not allowed to view links. Register or Login to view. is 100% "green", then You are not allowed to view links. Register or Login to view. is again 100% "brown". Right?
Bernd > 23-09-2025, 11:32 AM
quimqu > 23-09-2025, 12:24 PM
quimqu > Yesterday, 10:23 PM
nablator > Yesterday, 10:33 PM
(Yesterday, 10:23 PM)quimqu Wrote: You are not allowed to view links. Register or Login to view.Let's check one of the folio where the detected language is not the Currier language indicated, You are not allowed to view links. Register or Login to view.You are not allowed to view links. Register or Login to view. has 10 "edy" in its second paragraph, none in the first, what is the detected Currier language of the 2nd paragraph in isolation?
quimqu > 8 hours ago
(Yesterday, 10:33 PM)nablator Wrote: You are not allowed to view links. Register or Login to view.(Yesterday, 10:23 PM)quimqu Wrote: You are not allowed to view links. Register or Login to view.Let's check one of the folio where the detected language is not the Currier language indicated, You are not allowed to view links. Register or Login to view.You are not allowed to view links. Register or Login to view. has 10 "edy" in its second paragraph, none in the first, what is the detected Currier language of the 2nd paragraph in isolation?
What about You are not allowed to view links. Register or Login to view. ? It has 5 "edy" in its second paragraph, none in the first so there seems to be a transition from A to B there too.
nablator > 7 hours ago
(8 hours ago)quimqu Wrote: You are not allowed to view links. Register or Login to view.the punctuation for language A is 3 times the punctuation for language B
quimqu > 6 hours ago
(7 hours ago)nablator Wrote: You are not allowed to view links. Register or Login to view.(8 hours ago)quimqu Wrote: You are not allowed to view links. Register or Login to view.the punctuation for language A is 3 times the punctuation for language B
Punctuation? Do you mean cumulative weight or number of words or some other metric?
quimqu > 3 hours ago
Currier | Pred A | Pred B |
---|---|---|
Language A | 109 | 1 |
Language B | 16 | 77 |
Section | Dominant |
---|---|
Astronomical, Pharmaceutical, Zodiac | A |
Biological (balneological), Marginal stars, Text-only | B |
Herbal, Cosmological | Mixed |
Currier hand | %A | %B |
---|---|---|
1, 4 | ~100 | 0 |
2 | 43 | 57 |
3, 5, X, Y | 0–18 | 82–100 |
writting_hand | %A (folios) | %B (folios) | %A (paragraphs) | %B (paragraphs) | Observation |
---|---|---|---|---|---|
1 | 1.00 | 0.00 | 0.984 | 0.016 | Clearly A |
2 | 0.00 | 1.00 | 0.078 | 0.922 | Clearly B |
3 | 0.097 | 0.903 | 0.081 | 0.919 | Consistently B |
4 | 0.933 | 0.067 | 0.805 | 0.195 | Mostly A, with some B contamination |
5 | 0.143 | 0.857 | 0.263 | 0.737 | Clearly B but with more mixing than 2–3 |
@ | 0.00 | 1.00 | 0.231 | 0.769 | Clearly B, with slight local variation |