dvallis > 22-08-2020, 05:54 PM
ReneZ > 22-08-2020, 07:20 PM
Quote:The language vectors really are 10,000 dimensional so it's impossible to visualize in 3D before PCA. You've just got to remember the letters a-z and space are all orthogonal. That means any given trigram will be unique in the 10K dim space.
Quote:Vector multiplication is bitwise XOR
0 0 = 0
0 1 = 1
1 0 = 1
1 1 = 0
dvallis > 23-08-2020, 04:20 AM
Alin_J > 24-08-2020, 04:34 PM
(22-08-2020, 05:54 PM)dvallis Wrote: You are not allowed to view links. Register or Login to view.If enough trigrams are similar the language vectors will be -mathematically- similar.
ReneZ > 25-08-2020, 08:10 AM
dvallis > 28-08-2020, 03:50 PM
ReneZ > 28-08-2020, 07:44 PM
dvallis > 30-08-2020, 07:22 PM
ReneZ > 31-08-2020, 10:47 AM
Alin_J > 31-08-2020, 04:35 PM
(22-08-2020, 07:20 PM)ReneZ Wrote: You are not allowed to view links. Register or Login to view.That is right, but PCA only gives you two the first two of 10,000 dimensions, and it should then be clear that one has to be very careful about drawing conclusions from only these two dimensions.