rikforto > 25-06-2026, 09:24 PM
Quote: domus militis regis fenestram rotundam habet.Ah, of course, the house of the military dictator standing in until the king comes of age has fat windows. I am being a little difficult here, but the thing I am being worst about (rotund as "fat") is offset by the great unlikelihood that illiterate English speakers would readily recognize "rotund", "fenestra", and perhaps even the "dom-" root, though I will not be so forceful about that. The roots only get you so far, and can actively mislead. Japanese is the analogy here if you imagine English doublets, and that's the Eastern language have the most formal study of, and on'yomi-kun'yomi correspondences are as perilous as they are helpful. Opening up my kanji old textbook, I see a correspondence between "polish" and "toilet"; "reply" and "turn"; "run" and "to write hastily". In each case, seeing how they got associated makes this seem inevitable, but at the same time, you illustrate that having "ball" and "round" share a word could well lead people astray.
domicile military regent fenestra rotund have
JoeyB > 26-06-2026, 03:01 AM
JoeyB > 26-06-2026, 03:05 AM
Jorge_Stolfi > 26-06-2026, 10:17 AM
(25-06-2026, 03:18 PM)JoJo_Jost Wrote: You are not allowed to view links. Register or Login to view.My point was not about raw character frequencies! It was about a conditional dependency across a visible token boundary: the internal form before final y changes the probability that the next token begins with qo.
Quote:If qo is “and”, the distribution should mainly follow syntax or enumeration structure, not the internal shape of the previous token!!!
Quote:If y is a pitch marker, the model still has to explain why the material before y changes the following qo probability so strongly. what is the concrete mechanism that connects the internal pre-y form with the next token’s qo?
rikforto > 26-06-2026, 11:30 AM
Jorge_Stolfi > 26-06-2026, 04:12 PM
(25-06-2026, 09:24 PM)rikforto Wrote: You are not allowed to view links. Register or Login to view.So, I at least now understand where the misunderstanding is. To use the Suassarian formulation, Chinese characters are not the signified, they are the signifier. So in any given language it's not that a character represents the lexeme of "house, home, dwelling, abode" it's that it represents one morpheme. ... And if the passage is Latin, not English, they will probably scoff at the idea that it says "house" even if the reading exists in an an English character dictionary and they are correct because only an idiot would think a Latin passage had the signifier "house". ...
Quote:For instance, all of "maison", "domus", and "casa" would have different characters, and all those characters would be in all three of French, Latin, and Spanish because all three of those morphemes exist in each language.
Quote:an English speaker would have "house" available as a reading on some Romance word---probably the character for <domus> thinking about the history of Latin in England [...] And if the passage is Latin, not English, they will probably scoff at the idea that it says "house" even if the reading exists in an an English character dictionary and they are correct because only an idiot would think a Latin passage had the signifier "house".
Quote:an English speaker educated in Latin logographs for writing English but not Latin would have some clues....That reading would be totally silly for a number of reasons. First, the English native would almost certainly not know Latin. He would not even know that whoever wrote that sentence first thought of it as Latin. Even if he knew, he would not know which English words are derived from Latin words --- that "house" does not but "domicile" does, etc..
Quote:domicile military regent fenestra rotund have
igajkgko > 26-06-2026, 05:44 PM
(26-06-2026, 04:12 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.But that is not true for the Chinese script -- because it is not phonetic! In Chinese writing, each symbol (or pair of symbols) represents directly a concept. Each symbol also represents a sound; but it represents a different sound for the speakers of different languages, while the concept is essentially the same for all Sinitic languages (and even, to some extent, for some non-Sinitic ones).
Jorge_Stolfi > 26-06-2026, 08:13 PM
(26-06-2026, 05:44 PM)igajkgko Wrote: You are not allowed to view links. Register or Login to view.I'm not sure if it's relevant to your argument in the end, but this seems like a simplification. While I'm no expert in Chinese, I'm reminded of You are not allowed to view links. Register or Login to view. which I had a passing interest in once upon a time as a student of Japanese.
Quote:the Chinese have a system of writing similar to the hieroglyphic signs of the Egyptians ... they do not express their concepts by writing, like most of the world, with a few alphabetic signs, but they paint as many symbols as there are words. ... each word has its own hieroglyphic character, ... there are no fewer symbols than words.Except of course that we now know that the Egyptian hierglyphs were mostly phonetic, not ideogaphic or "lexographic".
ReneZ > 27-06-2026, 01:06 AM
(26-06-2026, 04:12 PM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.But that is not true for the Chinese script -- because it is not phonetic! In Chinese writing, each symbol (or pair of symbols) represents directly a concept.
pfeaster > 27-06-2026, 02:34 AM
(26-06-2026, 10:17 AM)Jorge_Stolfi Wrote: You are not allowed to view links. Register or Login to view.Is that because of some mysterious rule that makes the word 'and' attract the letter 's'? Not quite. Here are the counts for the words that occur before the word 'and':
262 vertues!and
95 bruised!and
87 wine!and
83 liver!and
79 them!and [...]