10-02-2026, 11:41 PM
(10-02-2026, 09:57 PM)MHTamdgidi_(Behrooz) Wrote: You are not allowed to view links. Register or Login to view.It seems “guess” work has helped the charts quite a bit. “In the transcription files, the boundaries between those hypothetical parags had to be guessed based on dubious clues like the spacing of the lines; but they are almost certain to be wrong, because the actual breaks must be in the middle of lines. Besides those two large blocks, there are a few smaller suspicious text blocks on other pages, each being probably two or three parags merged into one. Misplacing the parag breaks would not affect the average parag size, but would affect its deviation and the shape of the histogram.” (p. 6). All this sounds like arbitrarily introducing line breaks or modifications,
That is a good point. After people claimed that the histograms were not similar enough for their liking, I spent easily 100 hours carefully re-transcribing the entire SPS and re-checking the parag breaks. But that effort was done without looking at the SBJ, and they involved only a fraction of the SPS (basically two out of the 22.6 pages), and in the end neither the breaks nor the transcription changed much relative to Rene's transcription. And did not improve the match very much.
You can check You are not allowed to view links. Register or Login to view., page by page, and the You are not allowed to view links. Register or Login to view. here. In this directory you will find You are not allowed to view links. Register or Login to view. (~20 MB each) to help understanding those notes. (I should have usd JPG instead of PNG to make them smaller. And You are not allowed to view links. Register or Login to view. is low res, sorry. Will fix these problems eventually...) You will see that, apart from those two pages (f108v and f111r), most of the page breaks are obvious and not "cheatable".
All that work done, I did try to redo the histograms after excluding all the blocks of lines where the parag divisions were not 100% obvious, hence subjective and probably wrong. That left 243 parags instead of the full 330. It improved the histogram match a bit, but not enough to justify the hassle of having to explain the filtering and arguing that it was not cherry-picking, so I left that attempt out of the paper. Here is the SBJ histogram in red, that 243 "good parags only" histogram in graysish green, and the raw 330 all-parags histogram in dark blue. Note that the vertical scale is different. The last royal-blue histogram is the "good parags only" again but measuring the size of a parag by the EVA character count instead of the word count.
Quote:All the enormous graphics in the VM are then just made up to make the Chinese text plausible, in oddly new writing alphabet?
Sorry, I don't understand the question. Which "enormous graphics"?
Quote:Are you saying, the traveler could not find a scribe in the whole of China to simply copy the original in Chinese?
Learning spoken Chinese is not much harder than learning any other language. In fact it may be easier, because Chinese has no inflections for gender, number, person, mood, tense, etc, which are the bane of learners of languages like Latin, German, of Finnish.
Leraning the written language is another matter. One must memorize ~4000 characters in order to be basically literate. Chinese (and Japanese) students learn a couple hundred per year, and reach that level only at the end of high school. (Japanese manga books used to display on the cover the number of years of schooling one needed in order to read them.) Some Jesuit missionaries managed to become literate, after years of intense effort, because they knew that had to in order to pursue their mission of converting the ruling class to Catholicism. Even them found that, in order to make the language more accessible to their Western colleagues, they had to invent a phonetic script for it. That is why they invented pinyin and the modern Vietnamese script, even though those languages already had their own scripts.
Thus, copies of the books in Chinese characters would be quite useless for him after he got home, because he would be unable to read them. With the phonetic script, he could get the sound of every word, and his knowledge of the spoken language would have been enough to get at least some of the sense out.
Quote:If he did not even read Chinese, how could he know the value of a purely textual manuscript?
If he had asked any doctor in the Chinese area of influence "what is the most important book of medicine you have", the answer would quite probably have been "here, this one, the Shennong Bencaojin".
(And, by that time, the Chinese already were printing books in large numbers with full-page carved wood blocks. The technical name, IIUC, is "incunabula". Such books were produced in Europe too for a hundred years or so, using copper plates, until Gutenberg invented moveable type. (Which apparently was already in use in Korea before that.))
Quote:If he did, or he could have someone translate it for him, why not write it in his own language?
Because no one, not even him, could have done that. No one in "China" knew Latin or German or whatever. He could have translated the most common terms, but very few of the 365 remedy names and the thousands of disease names. He could only hope that he could somehow assemble a glossary of those before returning.
Quote:If somebody had found something of value in the original Chinese, why not just bring the original and share it in 1400s, so it could become known then centuries earlier?
But that is precisely what I believe he hoped to do!
Europeans reached Southeast Asia by sea a bit after 1500, after which Europe quickly got to know Chinese culture. By 1700 the Jesuits already had many people who could read Chinese characters and many more who could speak the language. The first Chinese-Latin dictionary, using pinyin for the Chinese, was printed in Rome around 1580 -- by that same Jesuit who managed to become literate and who invented pinyin.
All the best, --stolfi

