elieD > 02-03-2020, 07:36 PM
davidjackson > 02-03-2020, 08:35 PM
DonaldFisk > 03-03-2020, 01:36 AM
elieD > 03-03-2020, 12:52 PM
Quote:What is the advantage of t-SNE over other dimensional reduction techniques such as PCA?
Quote:Also, did you try to recognize parts of speech, either through their position relative to nearby words or through their inflection patterns?
Quote:Finally, it would be good to exclude languages which are impossible given the manuscript's known provenance, such as Guarani, and very improbable, such as Yoruba or Japanese.[/font]
Quote:[font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif]There are a few minor mistakes in the paper. Scottish Gaelic, which I can read, is a modern language (it has its own television channel, BBC Alba) and didn't even exist in the middle ages: it's a direct descendant of Middle Irish. It, and its close relative Irish, are unusual in two important respects: their grammar, which is superficially closer to Semitic languages than other non-Celtic European languages, and their spelling system, so not surprisingly it's a linguistic outlier. Also, Tagalog is spoken in the Philippines, quite a long way from Australia.[/font][/font]
Quote:Perhaps your N-gram approach partially addressed this[/font][/font]