MarcoP > 19-09-2016, 08:23 AM
(18-09-2016, 01:47 PM)Davidsch Wrote: You are not allowed to view links. Register or Login to view.This is exactly what i am trying to do for several months (last year)
also i find it incredible that Marco did the exact same things I already did and published.
Davidsch > 10-10-2016, 09:20 AM
MarcoP > 10-10-2016, 06:22 PM
Davidsch > 29-11-2016, 05:20 PM
MarcoP > 20-08-2017, 10:11 AM
Quote:The theory explains labelese—the different word statistics associated with label words—by putting such words effectively outside the transformation the text undergoes. If transformation works by altering words according to their environment then labels, which have no environment because they are usually isolated, should not be transformed. Labels are then the normal text. Of course, some labelese words are found in the main text, but there is no reason why every word in the transformed text must be altered, if the environment does not cause it.
MarcoP > 20-08-2017, 11:23 AM
Emma May Smith > 20-08-2017, 01:14 PM
MarcoP > 20-08-2017, 04:22 PM
(20-08-2017, 01:14 PM)Emma May Smith Wrote: You are not allowed to view links. Register or Login to view.There are some interesting stats, Marco, and I've certainly never seen labelese so clearly described. Researchers often talk about how it differs and so it's good to see the numbers.
(20-08-2017, 01:14 PM)Emma May Smith Wrote: You are not allowed to view links. Register or Login to view.I'll add my thoughts, which may duplicate some of yours.
Word starts:
[qo]: it's no surprise to see this at lower levels, as this is the classic mark of labelese.
(20-08-2017, 01:14 PM)Emma May Smith Wrote: You are not allowed to view links. Register or Login to view.[ch, sh]: the lower levels of these two is interesting, as I can think of two different kinds of word which begin [ch, sh]. Some are short, like [chy] and [sheo], others are long with a gallows, such as [chckhy] and [chety]. I would guess it's the former which is changing the numbers and they're more common. But if so, what do these short words mean? Are they not in the labels because they're grammatical?
(20-08-2017, 01:14 PM)Emma May Smith Wrote: You are not allowed to view links. Register or Login to view.[ot, ok]: the increase in these should be a result of the lower [qo]. I note that for all labels the levels are similar and increase similarly, though for the small plants labels [ot] start much lower and grow much further.
(20-08-2017, 01:14 PM)Emma May Smith Wrote: You are not allowed to view links. Register or Login to view.[yt, yk, sa, so]: this is the biggest surprise! These are line start patterns, so to see them here is interesting. This is a link between labels and line patterns, though it's hard to see immediately how.
(20-08-2017, 01:14 PM)Emma May Smith Wrote: You are not allowed to view links. Register or Login to view.Word ends:
...
[ly, ry]: it might seem that these endings account for the lower [ol, or] by the addition of [y], but I think more of these are [aly, ary].
MarcoP > 20-08-2017, 04:48 PM
MarcoP > 22-08-2017, 05:20 PM