Jorge_Stolfi > 07-11-2025, 06:00 AM
(07-11-2025, 12:46 AM)ReneZ Wrote: You are not allowed to view links. Register or Login to view.I have also played with my own:
You are not allowed to view links. Register or Login to view.
The 'fun' results are in Annex A.
MarcoP > 07-11-2025, 07:00 AM
ol(green)->shedy/chedy(purple)->qokedy(blue)->qokedy(blue,loop)
|
V
shedy/chedy(purple)magnesium > 07-11-2025, 09:47 PM
(01-11-2025, 12:16 AM)quimqu Wrote: You are not allowed to view links. Register or Login to view.Again, a quite dense post. So, I summarize here first the findings, and then, if you are interested, you can deepen into the dense part of the post.
quimqu > 07-11-2025, 11:24 PM
| KPI | Natural texts (min–max) | Voynich (min–max) | Artificial (Naibe, Timm) | Description | Voynich vs Natural / Artificial |
|---|---|---|---|---|---|
| H0_mean_lifetime | 0.79–0.97 | 0.81–0.85 | 0.79–0.89 | Mean persistence of connected components. Measures graph cohesion at 0-dim topology. | Voynich is slightly lower than natural texts, close to artificial, suggesting weaker component stability. |
| H1_frac_inf | ≈0.00 (none) | ≈0.00 (none) | ≈0.00 (none) | Fraction of infinite 1-dim holes. Reflects whether loops persist indefinitely. | No difference; all show finite loops only. |
| avg_clustering | 0.38–0.53 | 0.50–0.57 | 0.45–0.82 | Local density of triangles. Measures how often neighbors are connected. | Voynich has moderate clustering, slightly higher than natural texts but lower than some artificial graphs. |
| spectral_gap | 1.6–5.9 | 2.9–4.5 | 5.9 | Second Laplacian eigenvalue gap. Indicates overall graph connectivity and mixing speed. | Voynich falls mid-range; less connected than natural, less random than artificial. |
| kcore_max | 4–17 | 5–12 | 8–17 | Maximal k-core index. Captures how strongly nodes are mutually connected in dense cores. | Voynich has mid-core density, lower than typical artificial graphs. |
| flow_hierarchy | 0.76–0.91 | 0.58–0.65 | 0.51–0.64 | Fraction of edges in acyclic subgraphs. Quantifies directionality and information flow. | Voynich shows weaker directionality, closer to artificial graphs → more isotropic structure. |
| edge_reciprocity | 0.08–0.24 | 0.34–0.42 | 0.36–0.48 | Proportion of mutual (bidirectional) edges. Measures how often relations are symmetric. | Voynich has high reciprocity, like artificial ciphers, unlike natural directional flow. |
| entropy_rate | 0.53–1.59 | 0.89–1.87 | 1.74–2.01 | Average information rate of random walk transitions. Higher = less predictable. | Voynich entropy sits between natural and artificial → partially random transition dynamics. |
| degree_assortativity | -0.10–0.15 | ≈0.00 | ≈0.00 | Correlation between node degrees. Positive = hubs connect to hubs. | Voynich similar to artificial: nearly neutral assortativity (no social-like structure). |
| avg_shortest_path | 2.0–2.9 | 2.1–2.2 | 2.0–2.1 | Mean minimal steps between nodes. Reflects overall navigability. | Voynich is very close to artificial; both slightly shorter paths than natural languages. |
Rafal > 08-11-2025, 12:21 AM
quimqu > 08-11-2025, 09:22 AM
(08-11-2025, 12:21 AM)Rafal Wrote: You are not allowed to view links. Register or Login to view.Are you able to interpret pca1 and pca2, the new 2 dimensions that emerged from your analysis? What features of text do they describe?
Rafal > 17-11-2025, 12:29 PM
Jorge_Stolfi > 17-11-2025, 02:50 PM
quimqu > 17-11-2025, 03:47 PM
(17-11-2025, 12:29 PM)Rafal Wrote: You are not allowed to view links. Register or Login to view.Now as we are comparing different texts in different languages we don't know the source of differences.
The good candidate would be Bible as we can easily get it in most languages.
(17-11-2025, 02:50 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.The Chinese texts, in particular, use an old encoding with two bytes per
Chinese character. There is a Linux program "autogb -i gb -o utf8" that
converts them to the modern Unicode/UTF8.
nablator > 17-11-2025, 05:19 PM
(17-11-2025, 02:50 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.The Chinese texts, in particular, use an old encoding with two bytes per
Chinese character. There is a Linux program "autogb -i gb -o utf8" that
converts them to the modern Unicode/UTF8.