008348dc760f858fd668476b75fb6f > 28-01-2025, 10:26 AM
ReneZ > 28-01-2025, 11:11 AM
008348dc760f858fd668476b75fb6f > 28-01-2025, 11:17 AM
(28-01-2025, 11:11 AM)ReneZ Wrote: You are not allowed to view links. Register or Login to view.Your result looks very much like mine!
Which transliteration (text) of the MS did you use?
oshfdk > 28-01-2025, 11:34 AM
(28-01-2025, 10:26 AM)008348dc760f858fd668476b75fb6f Wrote: You are not allowed to view links. Register or Login to view.I wrangled a python script that goes through each folio, takes each individual word and counts how many times it appears in any other folio. To balance the fact that different folios will have a different number of words I divide the final score by the number of words in the comparison page. The idea is that we are balancing the count against the event opportunity (although I admit this is likely not a *clean* scoring method).
008348dc760f858fd668476b75fb6f > 28-01-2025, 12:10 PM
folioScore = sum([list(set(compFolio)).count(w) for w in set(baseFolioWords)])
ret.append(str( folioScore / (len(set(baseFolioWords)) + len(set(compFolio))) ))
008348dc760f858fd668476b75fb6f > 28-01-2025, 12:20 PM
oshfdk > 28-01-2025, 12:48 PM
(28-01-2025, 12:10 PM)008348dc760f858fd668476b75fb6f Wrote: You are not allowed to view links. Register or Login to view.I'm not super convinced that the differing output based on the "hand" writing would contradict the idea...
Quote:Can we establish correlation between folios and sections based on the number of words shared.
Quote:It helps us isolate sections to work on, there is a lot of text in the manuscript and reducing the attack surface can help.
Quote:We can also use this to validate folio and quire order, for example I've always been confused by location of quire 19 and suspected it belonged somewhere else, but looking at its position on the sheet it shares the same correlations as its neighbors, the bathing section and the recipe section, meaning it likely belongs where it is.
Quote:The current status of the "is it a cipher or not" debate boils down to either "no its the product of a generative method that produces pseudo-language" or "yes but we don't know how or what" or "something else". If this research shows anything it shows that the manuscript text shares a logical correlation to its pictographic themes and even the physical structure (the quires). If this is pseud-language then it follows themes and relations. Also, I don't think these concepts are mutually exclusive, I think you absolutely can use a generative method as an encoding or enciphering mechanism. If I was to state it in a simple way it would be, "you hide a needle in a bunch of haystacks, but you're going to need to manufacture those haystacks". I think the generative mechanism takes words as a seed and produces pseudo-language as an obscured output. I also think there is a sort of pun going on here, it's not just the text being generated, the characters and plants are too, hence why we see so many unrecognisable plants.
The plants are gathered, cut up thrown into a device and turned into something new, like the words.
Quote:If I'd say anything, my theory is that a generative mechanism turns a word into many words, perhaps letter by letter. It adds pre/suff-ixes, modifies letters, adds false root etc so that we end up with what we have, a text poor in individual characters and overburdened with words.
008348dc760f858fd668476b75fb6f > 28-01-2025, 01:08 PM
Quote:I'm not sure I understand what exactly "the idea" is. Sorry, I can be quite slow when it comes to understanding various conceptsNo worries and no reason to apologize, the idea is that a reusable encoding/enciphering mechanism is used to generate text as a way to obscure the underlying message. This mechanism can be setup to produce different output based on different settings. I say that different hands using different words is not an issue in this scheme.
Quote:Is it mainly about the correlation between folios and folios, between sections and sections or correlation between sections and folios (as in "this page correlates with balneology, etc")?It kind of ends up being the same thing, the starting point was correlation between folios but as it turns out this confers correlation between sections and also quires.
Quote:To me this is not very obvious from the chart. Unless I'm reading it wrong, it correlates a bit with balneology and with a number of herbal pages, but doesn't correlate particularly well with the end section.I think this is an issue of resolution, i.e the correlation between the bathing and recipe sections looks so strong that it blots out everything else. Essentially anything that isn't white is an above average correlation and is notable.
Quote:I think this is a cool idea (that plants and figures are generated according to some mechanistic process). It is reminiscent of various theories that "plants are not actually plants, but some schematic representation", but also explains why the plants look weird and many nymphs are in strange poses. Not sure if there is any relation to the study of correlations here.
Quote:As far as I know, this is a very popular idea and over the years a lot of systems and devices explaining how the words were produced have been proposed. Again, does the study of correlations play any particular role here?
oshfdk > 28-01-2025, 03:27 PM
(28-01-2025, 01:08 PM)008348dc760f858fd668476b75fb6f Wrote: You are not allowed to view links. Register or Login to view.Yes, the correlation suggests that the text adheres to the themes of the folio. If so, the mechanism that generates the text is not a random stream of generated text but text generated to a theme. That theme could be text related to that specific folio or section. The "vine" is a good example, it appears on 3 different folios and these 3 different folios have above average word correlation.
Quote:TA nice example of this is the "vine" plant that appears in f17v, f96v, f99. It's correlated through theme and word correlation.
(28-01-2025, 01:08 PM)008348dc760f858fd668476b75fb6f Wrote: You are not allowed to view links. Register or Login to view.The question around generative text is usually positioned as "the manuscript is either pseudo-language or a cipher...", my point is that it can easily be both.