Bernd > 02-11-2025, 11:05 PM
quimqu > 03-11-2025, 08:21 AM
(01-11-2025, 06:58 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.You are sure that the other special characters [µß^~+`'°] are not deleted, remapped, or interpreted as separators at some point?
Jorge_Stolfi > 04-11-2025, 09:21 AM
Trithemius > 04-11-2025, 07:32 PM
quimqu > 04-11-2025, 08:45 PM
(04-11-2025, 07:32 PM)Trithemius Wrote: You are not allowed to view links. Register or Login to view.Hey quimqu,
Nice work--
I have also done some graph analysis, but slightly different. In my graph, each node is a folio and an edge connects two pages when they share one or more words that appear at least twice in the entire text and are 4 or more characters long. I then did cosine normalization to prevent pages with more words from dominating so that the connections would reflect shared content rather than just... content/word count.
Anyway what I found was that the manuscript seems to cluster into two clear groups. The left is (I think) pretty clearly herbal A, and the right is the balneological section and the final text-only pages. I think from this graph we can infer that the final pages concern themselves with the balneological matters, rather than the herbal or astrological ones.
Trithemius > 04-11-2025, 10:11 PM
(04-11-2025, 08:45 PM)quimqu Wrote: You are not allowed to view links. Register or Login to view.(04-11-2025, 07:32 PM)Trithemius Wrote: You are not allowed to view links. Register or Login to view.Hey quimqu,
Nice work--
I have also done some graph analysis, but slightly different. In my graph, each node is a folio and an edge connects two pages when they share one or more words that appear at least twice in the entire text and are 4 or more characters long. I then did cosine normalization to prevent pages with more words from dominating so that the connections would reflect shared content rather than just... content/word count.
Anyway what I found was that the manuscript seems to cluster into two clear groups. The left is (I think) pretty clearly herbal A, and the right is the balneological section and the final text-only pages. I think from this graph we can infer that the final pages concern themselves with the balneological matters, rather than the herbal or astrological ones.
Nice work!
Yes, you can take a look at my thread about You are not allowed to view links. Register or Login to view.. There, it is also clear that the last pages "talk" about balneological, but also about herbal. You can see how.tje automated found topics are.disteibuted.throughout the MS.
Jorge_Stolfi > 05-11-2025, 09:09 AM
(04-11-2025, 07:32 PM)Trithemius Wrote: You are not allowed to view links. Register or Login to view.Anyway what I found was that the manuscript seems to cluster into two clear groups. The left is (I think) pretty clearly herbal A, and the right is the balneological section and the final text-only pages. I think from this graph we can infer that the final pages concern themselves with the balneological matters, rather than the herbal or astrological ones.
quimqu > 05-11-2025, 01:42 PM
Jorge_Stolfi > 05-11-2025, 04:59 PM
(05-11-2025, 01:42 PM)quimqu Wrote: You are not allowed to view links. Register or Login to view.Here are the plots with Jorge Stolfi's "Dom Casmurro's"
Quote:Compared with Lazarillo de Tormes, Dom Casmurro is more uniform and modern in structure. Lazarillo shows greater lexical variety and less repetition, giving it a sparser, more modular network. This can be the difference between current Spanish and older, as the older has a less standardized syntax (16th-century Spanish) and it has a heavier use of subordinate clauses.
Quote:The Manx text is clearly different (but very away from the Voynich). It forms a compact network with shorter paths and higher clustering. It seems to be a morphologically rich language.
quimqu > 05-11-2025, 05:22 PM
(05-11-2025, 04:59 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.Is Lazarillo the work of a single author, or a collection of stories by different authors that aggregated over a long time?