13-12-2025, 12:07 AM
(12-12-2025, 01:51 PM)mxv456 Wrote: You are not allowed to view links. Register or Login to view.The whole analysis is based on IT2a-n.txt from You are not allowed to view links. Register or Login to view. Is this the correct choice? As far as I understand, it's a version of the TT transcription, but I don't know what the state of the art is.
A more recent, more accurate and more complete transliteration is this file: RF1b-er.txt
It is in the same format so you should in principle be able to repeat the same analysis by just swapping the file.
At the highest level, the two are very similar, so I would not expect to see any significant difference.
At the same time, this gives a good indication of the error (or uncertainty) in the input data.
Some statistics are far more sensitive to these changes/errors, for example those based on word counts.
![[Image: word_length_stats_corrected_currier.png?raw=true]](https://github.com/Marvel4U/Voynich_semantic_exploration/blob/master/plots/word_length_stats_corrected_currier.png?raw=true)
![[Image: Zipf_stats_corrected_currier.png?raw=true]](https://github.com/Marvel4U/Voynich_semantic_exploration/blob/master/plots/Zipf_stats_corrected_currier.png?raw=true)
![[Image: bigram_heatmap_a_b_corrected_currier.png?raw=true]](https://github.com/Marvel4U/Voynich_semantic_exploration/blob/master/plots/bigram_heatmap_a_b_corrected_currier.png?raw=true)
![[Image: word_end_trigrams_a_b_corrected_currier.png]](https://github.com/Marvel4U/Voynich_semantic_exploration/raw/master/plots/word_end_trigrams_a_b_corrected_currier.png)