Mauro > 25-06-2025, 07:02 PM
(25-06-2025, 06:49 PM)oshfdk Wrote: You are not allowed to view links. Register or Login to view.(25-06-2025, 06:14 PM)Mauro Wrote: You are not allowed to view links. Register or Login to view.(25-06-2025, 04:33 PM)oshfdk Wrote: You are not allowed to view links. Register or Login to view.If I encode a character using the combination of the word length and the second character in the word (say, qOkedy is O-6 or S, cHedy is H-5 or A, oTey is T-4 or Y, pOchey is O-6 or S again), this is non-deterministic, but perfectly reversible.
Hmm sorry but I don't understand. It seems to me your encoding method is deterministic (no random numbers are involved) but non-reversible (is O-6 'qokedy' or 'qokain'?)
It's non deterministic because you can freely choose any word as long as the second letter is the one you need, and the whole word is of the required length. It's reversible, because the plaintext letter is uniquely determined by the word length and the second letter of the word.
Torsten > 25-06-2025, 11:21 PM
(24-06-2025, 11:07 AM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.
- Binomial word-length distribution: Since you cited my 2002 webpage, you should know that East Asian monosyllabic languages do have a binomial word-length distribution.
- A single network of similar word forms: Monosyllabic languages have such "networks of similar forms" because of the fixed structure of the syllables
- Absence of semantic categories (e.g., nouns vs. verbs): Like most East Asian languages.
- No identifiable function words: Like most East Asian languages.
(24-06-2025, 11:07 AM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.
- Lack of clear word order or repeated phrases: Again, this is a property of the text, not of the language. But East Asian languages do give the impression of having no fixed word order.
(24-06-2025, 11:07 AM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.
- Deep correlation between frequency, similarity, and spatial proximity What do you mean, precisely? Those terms can describe normal features of texts in natural languages.
(24-06-2025, 11:07 AM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.
- Random walk-like statistical behavior and long-range correlations: What do you mean, precisely? Texts in natural languages generally have "random-walk like behavior" and "long-range correlations".
(24-06-2025, 11:07 AM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.
- Systematic shifts from Currier A to Currier B: Those "language shifts" are perfectly compatible with each section having been copied or summarized from a different book on a different topic -- even if the books were all in the same natural language.
(24-06-2025, 11:07 AM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.
- Lines function as structural units: There are simple explanations for why the Scribe could have created such patterns. Like stretching, squeezing, or abbreviating the text to get line breaks falling on sentence boundaries when possible.
- No corrections or deleted sequences: Surely it was not written directly on vellum, even if it was a hoax. The Author wrote a final draft on paper and recruited a Scribe to copy it onto vellum. But in fact there are many instances of corrections and uncorrected errors made by the Scribe.
- Line-endings fit precisely into available space: Again, a normal product of a minimally competent Scribe.
(24-06-2025, 11:07 AM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.
- Context-dependent self-similarity: Again, it is quite possible that a meaningful text in any language would have this feature.
nablator > 25-06-2025, 11:42 PM
(23-06-2025, 06:47 PM)dashstofsk Wrote: You are not allowed to view links. Register or Login to view.This takes time, work and effort. It wasn't 'garbage'.
Jorge_Stolfi > 26-06-2025, 07:21 AM
(25-06-2025, 04:42 PM)bi3mw Wrote: You are not allowed to view links. Register or Login to view.The binomial word length distribution of a text generated with Torsten`s SelfCitationTextgenerator ( 34,980 words, 198,497 characters , comparable to VMS ). The result is better than expected, at least the basic structure is recognizable.
You are not allowed to view links. Register or Login to view.
Jorge_Stolfi > 26-06-2025, 01:07 PM
(25-06-2025, 11:21 PM)Torsten Wrote: You are not allowed to view links. Register or Login to view.Language, by definition, is a system for conveying information, and word order is one of the primary mechanisms for signaling relationships between words — such as who is doing what to whom.
Quote:[natural laguages have] Preferred or default orders (e.g., Subject-Verb-Object in English, Subject-Object-Verb in Japanese). [...] This also holds true for East Asian languages. See for instance these articles about You are not allowed to view links. Register or Login to view.
Quote:[or natural languages will have] Morphological markers (like case endings) to compensate for word order flexibility.
Quote:[natural languages also have] Discourse patterns, where word order reflects emphasis or focus rather than grammar alone.
Quote:The respective frequency counts confirm the general principle: [in the VMS] high-frequency tokens also tend to have high numbers of similar words.
Quote:I disagree [about the Scribe clean-copying from a draft]. The Voynich text clearly responds to its physical container — the manuscript page. This suggests the text was generated during the act of writing, not copied mechanically from a pre-existing source. The scribe would need to actively choose, at minimum, the words at the end of each line to ensure that the text consistently fits neatly within the available space. That level of precise layout control isn't possible if the text was fully predetermined elsewhere.
Quote:[in the VMS] pairs of frequently used words with high mutual similarity appear. The exact cooccurrences may vary: there are pages where <daiin> is paired with <dain>, but also pages where it is frequently used together with <aiin> (f41v, f46r, f55v, f89v2, v105v and f114r) or <saiin> (f2r, f16r, and f90r2)."
Torsten > 26-06-2025, 03:20 PM
(26-06-2025, 01:07 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.That does not mean that "in every language, certain words can only appear at the beginning/middle/end of sentences"
(26-06-2025, 01:07 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.The division of the lexicon into four largely disjoint sets -- nouns, verbs, adjectives, and adverbs -- with well-distinguished grammatical roles is a feature of Indo-European languages, not a "linguistic universal". (II seem to recall Jacques Guy saying that linguists are still desperately trying to find one of these.) Even English, a creole language that lost most of the characteristic IE features, will often use the same word as any of those categories: "this is a stone", "this is a stone chisel", "stone him", "it is stone hard".
(26-06-2025, 01:07 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.Quote:The respective frequency counts confirm the general principle: [in the VMS] high-frequency tokens also tend to have high numbers of similar words.
This is true in any natural language, no? Even in English? "cat" "bat" "fat" "hat" "mat" "pat" "rat" "sat" "vat" "kit" "cot" "cut" "cab" "cam" "can" "cap" "car" ... but not so much for "however" or "equinox" ...
Quote:[in the VMS] pairs of frequently used words with high mutual similarity appear. The exact cooccurrences may vary: there are pages where <daiin> is paired with <dain>, but also pages where it is frequently used together with <aiin> (f41v, f46r, f55v, f89v2, v105v and f114r) or <saiin> (f2r, f16r, and f90r2)."
Such occurrences are expected if the words are single syllables. Even in the random 5-sentence above there is a "mài shì zài".
(26-06-2025, 01:07 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.When copying running text, Medieval European Scribes (like today's word processors) routinely disregarded line breaks in the draft and inserted line breaks, abbreviations, capitals, flourishes on their own. Fitting the lines neatly between margins was part of their basic skill set, just as preparing ink and shaping the pen.
Jorge_Stolfi > 26-06-2025, 05:59 PM
(26-06-2025, 03:20 PM)Torsten Wrote: You are not allowed to view links. Register or Login to view.In linguistics, word order refers to the structured arrangement of words in a sentence that signals grammatical relationships. It does not imply that specific words are rigidly restricted to the beginning, middle, or end of sentences.
Quote:Chinese provides a great example of how word order functions in a language that has minimal inflection (i.e., very few endings or case markers) and relies heavily on strict word order to convey meaning. Changing the word order can either make the sentence ungrammatical or completely change its meaning.
(26-06-2025, 01:07 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.The division of the lexicon into four largely disjoint sets -- nouns, verbs, adjectives, and adverbs -- with well-distinguished grammatical roles is a feature of Indo-European languages, not a "linguistic universal". (II seem to recall Jacques Guy saying that linguists are still desperately trying to find one of these.) Even English, a creole language that lost most of the characteristic IE features, will often use the same word as any of those categories: "this is a stone", "this is a stone chisel", "stone him", "it is stone hard".
Quote:No, all known natural languages use for instance some form of function words or grammatical markers, though the way they appear can vary dramatically.
Quote:Function words are words that don't carry lexical meaning themselves, but instead serve a grammatical role — structuring the sentence and clarifying relationships between words.
Quote:There is no known natural language without function words or functional equivalents.
Quote:If the Voynich text represents plain natural language it should be easy to identify function words or some common markers used for indicating the relationships between words.
Quote:Sorry, but I’m not sure how your statement explains the specific situation in the Voynich Manuscript, where the line itself behaves as a functional unit, with both line-start and line-end patterns being consistently observable.
Torsten > 26-06-2025, 07:32 PM
(26-06-2025, 05:59 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.Quote:Sorry, but I’m not sure how your statement explains the specific situation in the Voynich Manuscript, where the line itself behaves as a functional unit, with both line-start and line-end patterns being consistently observable.
"line starts and line end patterns are clearly observable" does not imply "lines are functional units". This is an hypothesis that needs good additional evidence to become likely.
If the line is a "functional unit", how do you explain that most of those "functional units" happen to have the right number of glyphs to precisely span the width of the text? And why is the last "functional unit" of a page almost always shorter, ending at some random point between the text rails (and, in some sections, it is the only "functional unit" that is like that)?
(26-06-2025, 05:59 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.On page f112r, after writing the first "functional unit" at full width, the Scribe apparently noticed that there was something wrong with he vellum in that area (perhaps oily, or too absorbent, or too rough, or whatever) and thus reduced the text width by ~30% for "functional units" 2 to 20, and returned to normal width after that. Was that layout planned by the Author, too?
On page f112v, the same vellum problem apparently caused the Scribe to indent "functional units" 1-10 by ~3 cm, then exdent "functional unit" 11 by ~1 cm, and continue to the end of the page with a left rail that was quite a bit slanted away from vertical; while the right rail was all straight and vertical. Semantically significant too?
Torsten > 26-06-2025, 07:53 PM
(26-06-2025, 05:59 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.Quote:Function words are words that don't carry lexical meaning themselves, but instead serve a grammatical role — structuring the sentence and clarifying relationships between words.
Sure. But here is what the dictionary says about the first function word of Mandarin that I picked at random with Google,
[font=-apple-system, BlinkMacSystemFont, Roboto, Helvetica, Arial, sans-serif]You are not allowed to view links. Register or Login to view. , [/font]yán:
- along [preposition]
- [font='Palatino Linotype', 'Book Antiqua', Palatino, serif]to follow (a line, tradition etc) [verb][/font]
- [font='Palatino Linotype', 'Book Antiqua', Palatino, serif]to carry on [verb][/font]
- [font='Palatino Linotype', 'Book Antiqua', Palatino, serif]to trim (a border with braid, tape etc) [verb][/font]
- [font='Palatino Linotype', 'Book Antiqua', Palatino, serif]border [noun? adjective?][/font]
- [font='Palatino Linotype', 'Book Antiqua', Palatino, serif]edge [noun][/font]
ReneZ > 27-06-2025, 01:49 AM
(26-06-2025, 05:59 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.Sure. But here is what the dictionary says about the first function word of Mandarin that I picked at random with Google,
You are not allowed to view links. Register or Login to view. , yán:
- along [preposition]
- to follow (a line, tradition etc) [verb]
- to carry on [verb]
- to trim (a border with braid, tape etc) [verb]
- border [noun? adjective?]
- edge [noun]