Urtx13 > 24-04-2025, 01:53 PM
(24-04-2025, 01:44 PM)nablator Wrote: You are not allowed to view links. Register or Login to view.(24-04-2025, 01:00 PM)Urtx13 Wrote: You are not allowed to view links. Register or Login to view.Do you know what a seed is?
A seed is just a number that lets you start the pseudo-random process from the same place every time. Without it, you’d get different results every run, like someone running 1 km, but always starting from a different city.
Yes, absolutely clear. I'm a software engineer.I've been doing this in my programs for decades, setting the random generator seed to be able to get the same result in a later run of course.
My problem is conceptual/philosophical: let me try to explain it.
If any calculation based on a sequence of random numbers produces something that compares amazingly well with reality this is a property of the sequence of random numbers, not a property of reality.
It's like discovering the entire US Declaration of Independence coded in the decimals of pi at offset 14287468794577131. Yes, it's there somewhere, but what does the discovery mean? Absolutely nothing.
Urtx13 > 24-04-2025, 01:56 PM
(24-04-2025, 01:43 PM)ReneZ Wrote: You are not allowed to view links. Register or Login to view.A random number generator (RNG) creates an arbitrary sequence of numbers.
When two people run an experiment that includes a random number generator, they will not get exactly the same results. When the process that is being experimented is well-behaved (not chaotic), the results will be similar but not the same.
If one wants to verify an experiment on a process, then one wants to make sure that two people get exactly the same results. This can be done in case both people set the same 'seed' (starting point) for the RNG. (Note that there are also RNG's for which this will not work).
So people here are invited to use the seed 1405 in order to get exactly the same result. Using other seeds can be interesting in order to check that it creates similar results.
Urtx13 > 24-04-2025, 02:04 PM
(24-04-2025, 01:53 PM)oshfdk Wrote: You are not allowed to view links. Register or Login to view.(24-04-2025, 01:38 PM)Urtx13 Wrote: You are not allowed to view links. Register or Login to view.Mathematical rigor? The analysis includes entropy metrics, LDA modeling, supervised classification, permutation tests, ablation studies, cross-checking...
Actually, the assortment of tools is one of the things that makes the whole pipeline hard to evaluate for me. From my point of view, mathematical rigor implies using the minimum number of tools and transformations to clearly validate your hypothesis. The fact that there was a need to stack a number of models and analytical tools on top of each other only makes it harder to identify potential problems.
I wonder if cyclical nature of the folios was detected in the past research on topic modeling in the Voynich Manuscript? E.g., You are not allowed to view links. Register or Login to view.
RadioFM > 24-04-2025, 05:33 PM
Urtx13 > 24-04-2025, 10:40 PM
(24-04-2025, 05:33 PM)RadioFM Wrote: You are not allowed to view links. Register or Login to view.Just because seed no. 3,247,442 produces a paragraph of Shakespeare doesn't mean RandomShuffler.py has encoded Macbeth with that key LMAO
tavie > 24-04-2025, 10:48 PM
Urtx13 > 24-04-2025, 11:27 PM
(24-04-2025, 10:48 PM)tavie Wrote: You are not allowed to view links. Register or Login to view.For the few of us who only just learned this meaning of "seed" today, could you please explain your findings in layman's terms? What are the characteristics of each of the four phases? Could you list a folio for each phase and explain why it falls under that category?
RadioFM > 25-04-2025, 03:46 AM
Urtx13 > 25-04-2025, 05:19 AM
(25-04-2025, 03:46 AM)RadioFM Wrote: You are not allowed to view links. Register or Login to view.EDIT: Your deterministic mapping from Lunar_Angle to Phase in generate_lunar_angles_seed.py makes the Lunar_Angle a trivial predictor to Phase - is Lunar_Angle a feature fed into your ML model?
Regardless of the previous EDIT, would you mind answering these questions?
- The sweet spot is 4 phases; any other number doesn't yield such good results?
- How's the performance with a single decision tree instead of a random forest? Unless I'm doing something wrong, scores seem to be good.
- You're splitting the data 75% for training and 25% for testing during the shuffle/ablation/etc. validation step, using the same seed as during training, correct?
nablator > 25-04-2025, 09:09 AM
(24-04-2025, 11:50 AM)Urtx13 Wrote: You are not allowed to view links. Register or Login to view.The real breakthrough is realizing that the Voynich text, when processed this way, contains a cyclical rhythm that aligns with that specific structure — something we wouldn’t expect from a meaningless or random text.