Urtx13 > 24-04-2025, 08:20 AM
oshfdk > 24-04-2025, 09:25 AM
nablator > 24-04-2025, 09:37 AM
Urtx13 > 24-04-2025, 09:43 AM
(24-04-2025, 09:25 AM)oshfdk Wrote: You are not allowed to view links. Register or Login to view.Hi and welcome!
requirements.txt is missing from the repository
Could you give some high level plaintext description of what this is? I found your other posts that explains the idea: You are not allowed to view links. Register or Login to view.
But I'm not sure I understand it. Is the analysis based on word token distributions?
Also, how was this seed 1405 selected?
nablator > 24-04-2025, 09:52 AM
oshfdk > 24-04-2025, 10:20 AM
(24-04-2025, 09:43 AM)Urtx13 Wrote: You are not allowed to view links. Register or Login to view.What the project does (in plain terms)
Urtx13 > 24-04-2025, 10:33 AM
(24-04-2025, 09:52 AM)nablator Wrote: You are not allowed to view links. Register or Login to view.I installed python 3.13.2 from Microsoft store (the only version proposed, I don't know if it's good for this project).
pip install -r requirements.txt
ended with:
..\meson.build:78:0: ERROR: Unknown compiler(s): [['ifort'], ['gfortran'], ['flang-new'], ['flang'], ['pgfortran'], ['g95']]
I guess I need a Fortran compiler.
---
The new requirement.txt (with different content and version numbers) fails quicker with a long stack trace:
pip install -r requirements.txt
Defaulting to user installation because normal site-packages is not writeable
Collecting numpy==1.24.4 (from -r requirements.txt (line 1))
Using cached numpy-1.24.4.tar.gz (10.9 MB)
Installing build dependencies ... done
Getting requirements to build wheel ... done
ERROR: Exception:
Traceback (most recent call last):
File "C:\Program Files\WindowsApps\PythonSoftwareFoundation.Python.3.13_3.13.1008.0_x64__qbz5n2kfra8p0\Lib\site-packages\pip\_internal\cli\base_command.py", line 106, in _run_wrapper
status = _inner_run()
...
pip._vendor.pyproject_hooks._impl.BackendUnavailable: Cannot import 'setuptools.build_meta'
---
So I installed gfortran (in gcc-14.2.0-
64.exe from You are not allowed to view links. Register or Login to view. ), removed the version numbers from requirement.txt and now I get a different error:
Run-time dependency pybind11 found: YES 2.12.1
Run-time dependency scipy-openblas found: NO (tried pkgconfig)
Run-time dependency openblas found: NO (tried pkgconfig and cmake)
Run-time dependency openblas found: NO
..\scipy\meson.build:163:9: ERROR: Dependency lookup for OpenBLAS with method 'pkgconfig' failed: Pkg-config for machine host machine not found. Giving up.
Urtx13 > 24-04-2025, 10:41 AM
(24-04-2025, 10:20 AM)oshfdk Wrote: You are not allowed to view links. Register or Login to view.(24-04-2025, 09:43 AM)Urtx13 Wrote: You are not allowed to view links. Register or Login to view.What the project does (in plain terms)
Sorry, but probably this is not plain enough for me, I'm still stumped. Could you try explaining it is simpler terms, or maybe better let's try some Q&A?
1) You mention cycles, but I don't understand how these cycles relate to the text. Is it topics cycling through folios? Do we have to assume the present ordering of folios for these cycles to make sense? Are there cycles within folios?
2) Basically, as far as I can see, the model splits the text into 4 topics (the number of topics is imposed upon the model) and then it is tested that the model can correctly identify the topic based on the tokens. I'm not sure what this proves exactly, I would assume if you take any text separated into chunks and ask a model to produce a split of chunks into 4 topics, the model will successfully identify some split based on token frequencies and then will successfully sort new chunks according to this split. I think I'm missing something here.
nablator > 24-04-2025, 10:46 AM
(24-04-2025, 10:33 AM)Urtx13 Wrote: You are not allowed to view links. Register or Login to view.The requirements.txt has been updated and works correctly under those versions.
Let me know if switching Python solves it — happy to help further if needed!
oshfdk > 24-04-2025, 10:48 AM
(24-04-2025, 10:41 AM)Urtx13 Wrote: You are not allowed to view links. Register or Login to view.There’s a 4-phase cycle repeating through the folios, like seasons.
We detect it using statistics, such as token entropy, topic modeling, and FFT (like finding beats in music), among others.
These cycles show up between folios and inside them, and they break when shuffled. So it’s not just the order — it’s baked into the content.