RenegadeHealer > 16-03-2020, 11:19 PM
(14-03-2020, 08:45 PM)Ben Trovato Wrote: You are not allowed to view links. Register or Login to view.(10-03-2020, 04:42 PM)Ben Trovato Wrote: You are not allowed to view links. Register or Login to view.It is actually the goal of this thread to find out if there is interest for a project like this.... but unfortunately, that does not appear to be the case.
Ben Trovato > 16-03-2020, 11:48 PM
Alin_J > 17-03-2020, 07:14 AM
(16-03-2020, 11:48 PM)Ben Trovato Wrote: You are not allowed to view links. Register or Login to view.Thank you for the encouragement and your commitment! But as I said, the problem is not mainly the text sample - which I would happily compile - but its analysis. Here I would would like to have a commitment from a person competent in computational analysis. Then I would go to work and create the Walla file.
If there is more textual sources you can supply, great! But I'd say let's approach this project step by step. I have no idea on how to include more (and different) text, methodologically. And then, oh, the Corona situation. My wife is running a pharmacy, so the next weeks could be tough for us. Nevertheless I like to keep this in my head...
Ben Trovato > 17-03-2020, 12:18 PM
Alin_J > 17-03-2020, 05:20 PM
(17-03-2020, 12:18 PM)Ben Trovato Wrote: You are not allowed to view links. Register or Login to view.That sounds very promising, thank you!
I would also need some advice on how to prepare the data. Is capitalization important? Should line breaks be included? Should voids in the lines (due to the structure of the painting) be marked? Which kind of metadata should be provided? That's what I would think of, but there's probably more...
RenegadeHealer > 17-03-2020, 05:33 PM
(16-03-2020, 11:48 PM)Ben Trovato Wrote: You are not allowed to view links. Register or Login to view.Thank you for the encouragement and your commitment! But as I said, the problem is not mainly the text sample - which I would happily compile - but its analysis. Here I would would like to have a commitment from a person competent in computational analysis. Then I would go to work and create the Walla file.
If there is more textual sources you can supply, great! But I'd say let's approach this project step by step. I have no idea on how to include more (and different) text, methodologically. And then, oh, the Corona situation. My wife is running a pharmacy, so the next weeks could be tough for us. Nevertheless I like to keep this in my head...
Ben Trovato > 22-03-2020, 07:22 PM
Alin_J > 23-03-2020, 05:26 PM
(22-03-2020, 07:22 PM)Ben Trovato Wrote: You are not allowed to view links. Register or Login to view.So I was thinking about how to prepare the text sample. To demonstrate the scheme, I take the "SAÄRILL"-picture posted above. The text file would look like this:
[1-Np98#163-????-G0]
Walla August.!
[@]BIN DER HEILIGE
GOTT SAÄRILL.!
[@]GOTT,
SAÄRILL.!
[@]I 1[font=Tahoma, Verdana, Arial, sans-serif][@]II 2[font=Tahoma, Verdana, Arial, sans-serif][@]I 1[/font][/font][font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif][@]III 3[/font][/font][/font]
[@]Sonne hinterm
Berg Bisasmberg
in den Bezirk
Korneuburg an
der Donau?
[font=Tahoma, Verdana, Arial, sans-serif][@]KPÖ[/font]
[font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif][@]GOTT[/font][/font]
[font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif]ALLAH.[/font][/font]
[font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif][@] SARARILL[/font][/font][/font]
[font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif]UND, ALLAH.![/font][/font][/font]
[font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif]Line 1: squared brackets are not used in Walla's work, so I can take them to indicate metadata: here, a consecutive number, source (book/page/figure number), the year of origin, ???? unknown in this case, and the "genre". I would propose to have the "genres" of "understandable" text (like this one), fake languages (like "Turkish" and "latin" in the samples in my initial post, samples #7 and #8) and plain gibberish (sample #6). My goal would be to collect text from the last two genres.[/font][/font][/font]
[font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif]The transcription moves from top-left to bottom-right. Line breaks are line breaks. [font=Tahoma, Verdana, Arial, sans-serif][@] indicates that the text is interrupted by parts of the drawings, while [font=Tahoma, Verdana, Arial, sans-serif][@] plus a line break means that the text is from a different area in the picture. (@ is just another symbol Walla did not happen to know...)[/font][/font][/font][/font][/font]
[font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif]Then I would simply type this into a text file, one after another...[/font][/font][/font][/font][/font]
[font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif]@Alin_J, does that sound reasonable for you, or should things be done differently? (And what about the other mighty wizards of computational science?) [/font][/font][/font][/font][/font]
[font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif][font=Tahoma, Verdana, Arial, sans-serif]I hope I get some books next week, so I could start working on this...[/font][/font][/font][/font][/font]
Tobias > 26-03-2020, 03:22 PM
Ben Trovato > 26-03-2020, 06:17 PM