28-09-2019, 08:11 AM
(26-09-2019, 02:13 PM)RenegadeHealer Wrote: You are not allowed to view links. Register or Login to view.Marco, was the raw data for this study one of the ingredients for the project you and Emma published earlier this year?
Hi RH,
the subject is quite similar to You are not allowed to view links. Register or Login to view., but the data presented here were computed from scratch. The Zandbergen-Landini transcription was indeed used for our paper too.
(26-09-2019, 02:13 PM)RenegadeHealer Wrote: You are not allowed to view links. Register or Login to view.I was immediately reminded of Julian Bunn's demonstration that the different plant-delineated text columns on You are not allowed to view links. Register or Login to view. show different ink density, suggesting that they were written separately, and thus are true columns functionally and methodologically. Your data here supports the idea that text before and after a plant drawing constitutes two separate lines, not one interrupted line. As someone who has looked at a lot of old manuscripts, how precedented was this style of column composition in medieval manuscripts of the time?
I was not familiar with that page, thank you for mentioning it! Bunn credits the idea to Jim Reeds. Personally, I don't find it very convincing, though everything is possible of course. I believe the matter should ideally be assessed by a palaeographer. As far as other manuscripts go, similar layouts to the VMS herbal are not frequent. Some examples were discussed by Koen You are not allowed to view links. Register or Login to view.. I am not 100% sure about the Greek manuscripts, but the Latin and Italian manuscripts he mentions all have lines continuing across illustrations. The exception is Munich Cim. 79, which however is entirely arranged in two accurately defined columns.
Something comparable with what Reeds thinks happens in the bilingual page I discussed You are not allowed to view links. Register or Login to view. (two distinct columns at the sides of the image, a single column at the bottom). I believe this case is quite exceptional. Moreover the text, however long, should be regarded as a "marginal" later addition (this is also the case for some of Koen's Greek manuscripts).
(25-09-2019, 08:06 PM)Koen G Wrote: You are not allowed to view links. Register or Login to view.When a word encounters an image and the text wants to continue on the other side, there are a few options:
1. the word is split
2. the final complete word before the image is squeezed in (or the scribe takes into account his spacing to make it fit well)
3. the final word before an image is truncated
There are probably more options, but some of those are less likely. For example that an image splits text into various columns like in a newspaper.
You say that words before an image are shorter, while words after an image are the same. This rules out (1), because in that case words after the image would also be shorter.
Hi Koen,
I don't think it's granted that the normal length of words after the image break excludes that words can sometimes (or even often) be split by the image. I computed some statistics on a modern English text with hyphenation (You are not allowed to view links. Register or Login to view.).
I vaguely remember running a similar experiment years ago, with a different text: possibly I posted it here, but I am far from sure. In this text, about 1/6 of the lines end with a hyphen. The result is that the first word of a line is averagely longer.
Average lengths:
all 4.224
first 4.632
last 4.194
[attachment=3376]
I believe this counter-intuitive result is due to two reasons:
1. short words can be easily forced into the end of a line, so they tend to appear there instead of at the beginning of the next line;
2. (in English) short words tend to alternate with long words: if short words frequently appear at the end of a line, the first word of the next line is more likely to be long. (This is rather speculative: I think it should be possible to compute numbers confirming or rejecting the idea, but I did not try to)
The histogram of individual word-lengths shows that that the first word of a line has both fewer short types and more long types than average. The last word of a line has more short types. Something similar might happen with Voynichese words split across an image.
(26-09-2019, 07:31 PM)Emma May Smith Wrote: You are not allowed to view links. Register or Login to view.The occurrence of [s] alone before a mid-break are interesting in relation to line start patterns. We see an increase at the line start of words beginning [s] which is often followed by [a] or [o]. It would be good to learn is these lone [s] matched up with [o] or [a] on the other side.
If so (and I do not know if it is) it would make an stronger link to line start patterns.
Hi Emma,
since there only are 24 occurrences of stand-alone s before an image-break, the statistics are not very reliable. Here the blue bar corresponds to the first character of the word after a stand-alone [font=Eva]s[/font] followed by an image-break ([font=Eva][font=Eva]s[/font][/font] <-> X-); the orange bar is the first character for all words after an image break (<-> X-); the yellow bar is the first character in any word (X-), the green bar is the the first line-initial character (NewLine X-), the brown bar is the first character after a line-initial [font=Eva]s[/font] (NewLine sX-).
[attachment=3380]
It seems that o- and a- are frequent after [font=Eva]s[/font]+imageBreak, with ch-, d-, s- also being more frequent in this case than on average (blue vs yellow bars). Due to the limited size of the data-set, the actual occurrences of [font=Eva]s[/font]<->a that make up 8% of the cases are only 2.
The graph is computed on the corpus of pages with at least an image-break (results on the whole manuscript will be slightly different).
(26-09-2019, 07:31 PM)Emma May Smith Wrote: You are not allowed to view links. Register or Login to view.Likewise, it would be interesting to know the second glyph in words immediately after mid-breaks which start with [y]. Both [yk] and [yt] are line start patterns.
[attachment=3378]
yt- and yk- are very frequent after an image break. On the other hand, ych- is common at line start, but after <-> 'ch' appears with no additional prefix. A major difference from line-start patterns is the almost total absence of gallows immediately after <-> (see also the previous graph). Another huge difference it the behaviour of qo-: maybe these image breaks can help us understand more of this elusive prefix?
These results appear to be somehow mixed to me: there could be an overlap with line-start patterns, but there also are differences.