Ruby Novacna > 7 hours ago
davidd > 7 hours ago
(7 hours ago)Ruby Novacna Wrote: You are not allowed to view links. Register or Login to view.Do you have some suggestions, i need to have a flat text file, or something where i can easily copy paste the whole text to a simple textfile(11 hours ago)davidd Wrote: You are not allowed to view links. Register or Login to view.possibly: test grammar findings against other 15th c texts in known languages.
In my opinion, it would be really useful to take your first steps on a well-known text from the same period.
MarcoP > 6 hours ago
davidd > 3 hours ago
(6 hours ago)MarcoP Wrote: You are not allowed to view links. Register or Login to view.Personally, I would first try something as simple as 10k words from King James Bible. The first thing is checking that the method works on a straightforward text.
Later, you can move to one of the Chaucer manuscripts transcribed here:
You are not allowed to view links. Register or Login to view.
davidd > 47 minutes ago
Quote:members: ['they', 'he', 'she', 'levi', 'afterward', 'whatsoever', 'goeth', 'offered', 'pursued', 'judah', 'reuben', 'lot', 'onan', 'laid'
, 'korah', 'trade', 'cain', 'pharaoh', 'joseph', 'laban', 'abimelech', 'jacob', 'isaac', 'leah', 'abraham', 'israel', 'god', 'noah', 'abram', 'rebekah', 'there', 'it', 'adam', 'esau', 'sarai', 'rachel', 'sarah', 'shelah', 'multiply', 'cainan', 'mahalaleel', 'jared', 'enos', 'methuselah', 'eber', 'serug', 'reu', 'arphaxad', 'salah', 'peleg', 'lamech', 'enoch', 'terah', 'benjamin', 'i', 'we', 'ye', 'here', 'neither', 'zebulun', 'who', 'what', 'why', 'thou', 'whither', 'how', 'arise', 'year', 'zerah', 'adah', 'shechem', 'issachar', 'gomorrah', 'truly', 'naphtali', 'japheth', 'menservants', 'spotted', 'rest', 'wept', 'therefore', 'milcah', 'begat', 'lo', 'kissed', 'tamar', 'hagar']
num members: 87
vord count: 4376
top3: he i it
groupname: he
lesser likely following : the 1.33% instead of 14.54%
more likely following : and 53.88% instead of 14.34%
lesser likely followed by : lord 1.17% instead of 13.90%
more likely followed by : said 55.69% instead of 11.33%