bi3mw > 31-05-2019, 10:44 AM
-JKP- > 31-05-2019, 06:13 PM
Torsten > 31-05-2019, 06:57 PM
(26-05-2019, 09:55 PM)Emma May Smith Wrote: You are not allowed to view links. Register or Login to view.The occurrences of [daiin] for each page of the first quire are: 7 1 3 5 2 1 5 4 2 2 5 5 4. Considering that You are not allowed to view links. Register or Login to view. has more words than any of the other pages, I don't see how this is anomalous.
(26-05-2019, 09:55 PM)Emma May Smith Wrote: You are not allowed to view links. Register or Login to view.Indeed, You are not allowed to view links. Register or Login to view. has 5 [daiin] over eight lines compared with You are not allowed to view links. Register or Login to view. having 7 [daiin] over 24 lines. Yet you give the signature word pair for You are not allowed to view links. Register or Login to view. as [chol, chor], which also have 5 tokens, simply because that page doesn't have any strong matches to [daiin].
(26-05-2019, 09:55 PM)Emma May Smith Wrote: You are not allowed to view links. Register or Login to view.Why did the writer choose to copy some words more than others on any given page?
(26-05-2019, 09:55 PM)Emma May Smith Wrote: You are not allowed to view links. Register or Login to view.Why does the choice of words and word pairs seem unpredictable aside from their overall frequency? Why did the writer choose to write [daiin] a lot, but [dain] rather less, and [daiir] or [daiiin] a lot less?
(26-05-2019, 09:55 PM)Emma May Smith Wrote: You are not allowed to view links. Register or Login to view.The page f89r2 has a massive 19 tokens of [daiin], yet zero [dain], one [daiiin], and zero [daiir]. He wrote [daiin] three times in a row!Linguistic theories are expected to account for this, but how can you? Did the writer not catch what he was writing? Was he suffering from monotony? Did he forget that he could spice things up a bit by dropping or adding an [i]? Was he simply insane? Blind? Experiencing dementia?
(31-05-2019, 06:13 PM)-JKP- Wrote: You are not allowed to view links. Register or Login to view.Torsten, could you post three or four lines of text generated with your software? Many of the people on the forum have never seen the output from your program.
The programmers on the forum (this includes me) know how to go to github, download it and get it running, but not everyone knows how to do that and they are basically being excluded from the discussion by not being able to see the algorithmic output that everyone is talking about.
-JKP- > 31-05-2019, 07:50 PM
Torsten > 31-05-2019, 07:58 PM
(31-05-2019, 07:50 PM)-JKP- Wrote: You are not allowed to view links. Register or Login to view.Those are dead links (or maybe the person has to be registered to see them).
nablator > 07-06-2019, 03:29 PM
(26-05-2019, 11:18 AM)Emma May Smith Wrote: You are not allowed to view links. Register or Login to view.I think it's best now to turn to Figure 4 (page 8) as a lot of the argument hinges on this. The graph shows that the edit distance between words is lower the nearer the words occur. Words are more similar by up to 0.3 of an edit distance within 30 (Herbal A) or 60 lines (Quire 20), before reaching a stable edit distance beyond those numbers of lines. A comparison with a text in English (Alice in Wonderland) is made.
(26-05-2019, 10:07 PM)Torsten Wrote: You are not allowed to view links. Register or Login to view.This would mean that every page has it's own topic.And its own commentary. Maybe.
Torsten > 07-06-2019, 07:53 PM
(07-06-2019, 03:29 PM)nablator Wrote: You are not allowed to view links. Register or Login to view.(26-05-2019, 11:18 AM)Emma May Smith Wrote: You are not allowed to view links. Register or Login to view.I think it's best now to turn to Figure 4 (page 8) as a lot of the argument hinges on this. The graph shows that the edit distance between words is lower the nearer the words occur. Words are more similar by up to 0.3 of an edit distance within 30 (Herbal A) or 60 lines (Quire 20), before reaching a stable edit distance beyond those numbers of lines. A comparison with a text in English (Alice in Wonderland) is made.
A better choice than Alice in Wonderland for comparison with Herbal A would be the Latin translation of Dioscorides with Mattioli's commentary (1554). This is book 4, with titles removed to avoid spurious local repetitions of words.
Horizontally: distance in lines (standardized to 60-80 characters), vertically: average edit distance between words of these lines.
The highly thematic text causes some repetitions of words and a significant part of the vocabulary of the Dioscorides text is repeated in the commentary, so some local similarities are not unexpected, but the effect is much less pronounced than in the VMS.
nablator > 07-06-2019, 09:11 PM
(07-06-2019, 07:53 PM)Torsten Wrote: You are not allowed to view links. Register or Login to view.This is an interesting example. Is the text online available?
Quote:Could you post a link to the analyzed text sample?
Quote:Word repetitions are not enough to explain the effect for the VMS.
Antonio García Jiménez > 07-06-2019, 11:15 PM
Emma May Smith > 08-06-2019, 10:14 AM
(07-06-2019, 03:29 PM)nablator Wrote: You are not allowed to view links. Register or Login to view.(26-05-2019, 11:18 AM)Emma May Smith Wrote: You are not allowed to view links. Register or Login to view.I think it's best now to turn to Figure 4 (page 8) as a lot of the argument hinges on this. The graph shows that the edit distance between words is lower the nearer the words occur. Words are more similar by up to 0.3 of an edit distance within 30 (Herbal A) or 60 lines (Quire 20), before reaching a stable edit distance beyond those numbers of lines. A comparison with a text in English (Alice in Wonderland) is made.
A better choice than Alice in Wonderland for comparison with Herbal A would be the Latin translation of Dioscorides with Mattioli's commentary (1554). This is book 4, with titles removed to avoid spurious local repetitions of words.
Horizontally: distance in lines (standardized to 60-80 characters), vertically: average edit distance between words of these lines.
The highly thematic text causes some repetitions of words and a significant part of the vocabulary of the Dioscorides text is repeated in the commentary, so some local similarities are not unexpected, but the effect is much less pronounced than in the VMS.