Jorge_Stolfi > 18-11-2025, 07:52 PM
(18-11-2025, 04:49 PM)quimqu Wrote: You are not allowed to view links. Register or Login to view.But this needs to be tested and proofed... I really don't know if this can be so easy... (Guess not).
quimqu > 19-11-2025, 09:21 AM
| corpus | C | L | modularity | assort_degree | reciprocity | TTR | gini_deg | zipf_slope | betweenness_avg | resilience_frac_to_half |
|---|---|---|---|---|---|---|---|---|---|---|
| English | 0.7666 | 2.0371 | 0.7124 | 0.0198 | 0.1373 | 0.0298 | 0.6996 | -1.1380 | 0.0002 | 0.2199 |
| French | 0.7739 | 2.1227 | 0.7270 | 0.0349 | 0.1119 | 0.0256 | 0.6927 | -1.1320 | 0.0001 | 0.1908 |
| Latin | 0.6570 | 2.2485 | 0.8165 | 0.0159 | 0.0801 | 0.0089 | 0.6299 | -1.2245 | 0.0001 | 0.2200 |
| Mandarin Other | 0.7207 | 1.9919 | 0.1300 | 0.4068 | 0.1650 | 0.0115 | 0.6303 | -1.0823 | 0.0004 | 0.3652 |
| Mandarin Union | 0.7135 | 1.9913 | 0.1380 | 0.4069 | 0.1757 | 0.0107 | 0.6189 | -1.0910 | 0.0004 | 0.3737 |
| Russian | 0.6752 | 2.2441 | 0.7944 | 0.0270 | 0.0835 | 0.0104 | 0.6744 | -1.2164 | 0.0001 | 0.2222 |
| Spanish | 0.7691 | 2.0930 | 0.7474 | 0.0216 | 0.1242 | 0.0119 | 0.6941 | -1.1710 | 0.0001 | 0.1753 |
| Vietnamese | 0.7649 | 2.1641 | 0.7484 | -0.3592 | 0.1712 | 0.0275 | 0.7068 | -1.1176 | 0.0001 | 0.2500 |
| German | 0.7590 | 2.0996 | 0.7915 | 0.0163 | 0.1262 | 0.0129 | 0.6992 | -1.1534 | 0.0001 | 0.1987 |
| Hebrew | 0.6145 | 2.8124 | 0.3923 | -0.0871 | 0.0195 | 0.0045 | 0.5409 | -1.3817 | 0.0001 | 0.2203 |
Rafal > 19-11-2025, 01:31 PM
Quote:The ELHM is morphologically plural ("gods") but seems to be grammatically singular and now assumed to be a conventional/honorific way to refer to (the single) God.
It really matters if you write Hebrew without vowels (like in ancient Biblical times) or with them like currently. Adding vowels makes the number of unique words grow several times. You seem to use vowel included transcription here. And Hebrew has declension (I was earlier wrong). So the data may be correct after all.quimqu > 19-11-2025, 01:59 PM
(19-11-2025, 01:31 PM)Rafal Wrote: You are not allowed to view links. Register or Login to view.PCA2 captures how robust or centralized the graph is.
I have problems with imagining it. Quimqu, are you able to give as example a sample of text which is robust and which isn't. Or does such feature emerge only for big texts?
(19-11-2025, 01:31 PM)Rafal Wrote: You are not allowed to view links. Register or Login to view.And going back to Voynich Manuscript... How would you say, what is VM position on PCA1 and PCA2?
Rafal > 19-11-2025, 02:20 PM
Quote:But you can see how the MS behaves compared with simmilar corpus (in length) and different languages in this You are not allowed to view links. Register or Login to view.
quimqu > 19-11-2025, 02:30 PM
(19-11-2025, 02:20 PM)Rafal Wrote: You are not allowed to view links. Register or Login to view.Do I think correctly that dimensions and their interpretation in Principal Component Analysis emerge from the used data and so are different each time?
So is the meaning of PCA1 and PCA2 the same on both graphs?
quimqu > 20-11-2025, 09:39 AM
| Voynich variant | Language 1 | Language 2 | Language 3 | Language 4 | Language 5 | Language 6 | Language 7 | Language 8 | Language 9 | Language 10 |
|---|---|---|---|---|---|---|---|---|---|---|
| Voynich CUVA | Mandarin Union (15516.18) | English (17416.91) | Mandarin Other (20211.07) | Vietnamese (36435.13) | Spanish (63245.70) | French (67399.46) | German (74894.93) | Hebrew (125298.75) | Russian (126951.13) | Latin (140980.96) |
| Voynich EVA | Mandarin Union (14941.14) | English (16929.77) | Mandarin Other (19719.66) | Vietnamese (36023.59) | Spanish (63012.04) | French (67169.80) | German (74703.50) | Hebrew (125042.78) | Russian (126656.67) | Latin (140810.12) |
| Voynich EVA A | Mandarin Union (102035.97) | English (106230.68) | Mandarin Other (107906.38) | Vietnamese (125713.50) | Spanish (153639.03) | French (155921.74) | German (165153.76) | Hebrew (215671.55) | Russian (217425.71) | Latin (230164.48) |
| Voynich EVA B | Mandarin Union (51741.69) | English (55704.18) | Mandarin Other (57564.85) | Vietnamese (75215.28) | Spanish (102978.62) | French (107093.42) | German (114510.12) | Hebrew (165138.20) | Russian (166790.26) | Latin (180836.49) |
Jorge_Stolfi > 20-11-2025, 10:19 AM
(20-11-2025, 09:39 AM)quimqu Wrote: You are not allowed to view links. Register or Login to view.the differences are very large and indicate a completely different behavior from any natural language.ces, but it does not imply that the Voynich behaves like or unlike any specific language in terms of meaning or topic.
Philipp Harland > 20-11-2025, 10:26 AM
(18-11-2025, 12:48 AM)Philipp Harland Wrote: You are not allowed to view links. Register or Login to view.Seems like a very interesting method. I don't know if it's in the literature or not but it's fascinating nonetheless.Is it truly research-grade, i.e. can it produce non-trivial results that couldn't be produced without it? It seems like it's working pretty well for the VMS.
nablator > 20-11-2025, 10:40 AM
(07-11-2025, 11:24 PM)quimqu Wrote: You are not allowed to view links. Register or Login to view.This first part of the analysis is based on directed word-to-word graphs, where each edge connects a token A → B if word B follows A in the text. This approach keeps the natural direction of information flow, unlike undirected co-occurrence graphs that only record proximity.