(19-03-2018, 07:49 PM)bi3mw Wrote: You are not allowed to view links. Register or Login to view.a) Is it possible to generate a text from word types of a plain text that shows similar characteristics as the VMS and from which these words can be decrypted ?
b) Basic conditions that must be met are:
The sum of the words in the generated text should not deviate too much from the source text. Also the ratio of word types to word tokens should not change significantly. The plaintext consists of 25014 words, the generated text of 23264. In the VMS , the ratio word types / word tokens" is 8114 to 37919. So the ratio is 1: 4,6732807493221. In the encrypted comparison text, the ratio is 5653 to 23002. Starting from the word types, one would expect a total text of 26418 words (at best). In my opinion, the deviation to the VMS is acceptable.
c) The generated text must be highly repetitive. In the encrypted text, 5391 word types face 17873 word tokens. All 17873 are generated from the 262 new word types. This condition is certainly fulfilled.
d) The average word length must be comparable. It is probably 5,6 in the VMS. The encrypted comparison text has a word length of 5,8.
e) Just to avoid misunderstanding, I do not claim that the manuscript was made this way. But it seems to be possible.
f) About filling with nulls, I had used that in another try. But far too long words were generated.
Nice to talk to you guys on specifics!!
a) This is far too complex and the information is far too delicate to be discussed here in the open.
If you have a specific angle where you got stuck perhaps I can help, I will try. or e-mail me directly (do not use the forum bbmail)
b) I do not quite agree that this ratio should be used in determination of your information (as a preset).
Yes, afterwards it can be used, but you can also simply use the ratio: vowel/consonant.
c) Here also, repetition is not a preset. If you have a text, repetition must occur autonome.
d) yes. this is important as a preset.
e) it is.
f) I do not claim anything, but in my opinion nulls have been added.
Again, this is perhaps cryptic, but I do not want to give away my fresh research, which looks promising and is, after two years, yet not finished.
Regarding the comment of Phsillycyber, "edit distances"; I have investigated every "horizontal edit distance" possibility but it did not work out positive and far more negative towards such. I understand this is hobby site, but if anyone is sincerely interested in that research please invest the time to understand the principle, my research, and the You are not allowed to view links.
Register or
Login to view. so we can build a genuine discussion.
Anyway, this thread is about "You are not allowed to view links.
Register or
Login to view.", and that is something entirely different, please do not pollute the thread with that and make a new one.
PS If I am wrong of if I wrote anything wrong, I apologize, I do not want to get a warning level for 100%, whatever it means anyways.