Rafal > 09-06-2025, 11:04 PM
davidd > 10-06-2025, 09:23 AM
(09-06-2025, 11:04 PM)Rafal Wrote: You are not allowed to view links. Register or Login to view.Davidd, would you like to try Genesis in Latin?score: 280
I believe it may be interesting and important test which may give different result than English version.
If anybody wonders - Latin has declension which is mostly absent in English. While in English it is always for example "Egypt", in Latin it may be "Aegypto", "Aegypti", "Aegyptum" etc.
I believe your model is not "clever" enough to know that it is the same word and will treat these variants as totally different words.
I wonder how it may impact the found word groups and their "quality" which you count with your statistics. And I am not sure.
I include Vulgate version of Genesis if you needed it.
dashstofsk > 10-06-2025, 10:27 AM
Rafal > 10-06-2025, 11:28 AM
Quote:You've lost me a bit. What is it that these numbers represent?
davidd > 10-06-2025, 11:34 AM
davidma > 10-06-2025, 12:06 PM
(10-06-2025, 11:44 AM)davidd Wrote: You are not allowed to view links. Register or Login to view.Did a second run of vulgata
score 248
You are not allowed to view links. Register or Login to view.
dashstofsk > 10-06-2025, 12:40 PM
(10-06-2025, 12:06 PM)davidma Wrote: You are not allowed to view links. Register or Login to view.follow Zattera's "sub-languages"
davidma > 10-06-2025, 01:46 PM
(10-06-2025, 12:40 PM)dashstofsk Wrote: You are not allowed to view links. Register or Login to view.(10-06-2025, 12:06 PM)davidma Wrote: You are not allowed to view links. Register or Login to view.follow Zattera's "sub-languages"
I believe this suggestion will lead you down a false path.
dashstofsk > 10-06-2025, 02:40 PM
(10-06-2025, 01:46 PM)davidma Wrote: You are not allowed to view links. Register or Login to view.enough vord tokens to still produce some meaningful analysis no?