The Voynich Ninja

Full Version: Matching Plant Images Internally
You're currently viewing a stripped down version of our content. View the full version with proper formatting.
Pages: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17
I think a simple way to perform a Inner Join on the table is just to sort it by Folio 1 as then the the folios with root and leaf matches will be next to each other by row.
I generated the following list from our spreadsheet:

f101r[1,7], f102v2[3,3]
f99r[4,1], f100v[2,5]
f99r[1,7], f101v[1,3]
f101v1[3,8], f101v1[2,7]
f102v1[2,2], f100v[1,4]
f102r1[3,2], f88r[2,4]
f89v2[2,2], f100r[1,5]
f89v2[1,3], f100r[1,1]
f88r[1,5], f89r1[2,2]
f99v[1,4], f89r1[1,4]
f101v2[1,4], f89r1[3,2]
f89r2[4,1], f99v[3,2]
f89r2[1,3], f88v[1,4]
f102r1[3,1], f101v[2,5]
f101v[1,5], f95r2
f99v[1,8], f100v[1,1]
f89r1[1,1], f101r[2,1]
f102v2[1,6], f88v[2,1]
f89v2[1,5], f102v1[2,3]
f99r[3,1], f101r[1,2]
f101r2[2,9], f100v[2,1]
f101r1[3,3], f102r1[2,1]
f89v2[1,4], f102v2[3,5]
f88r[1,4], f102v2[3,1]
f99v[1,7], f101v[1,8]
f99r[2,1], f100r[2,1]
f99r[2,2], f100r[1,2], f99r[5,1]
f102r2[2,1], f101r1[1,6], f102r2[2,4]

It needs tidying up a bit. I guess I will sort it by the left column. I will also remove all entries for which there is no label.
I hope that the Google doc (Excel) will not disappear, and that a link to it will be kept in some easily found location.

My 'scores' are initial and I may review them.
They were based on the assumption that there may well be lost (or never drawn) large plant and pharma pages, so not every item has to match an existing one.

I also have some additional possible matches, but not sure whether these should be added to the same table, and if so, how to distinguish them from Mark's list.
The file will remain, it is in my google Drive and I am reliable :) 
Of course it won't hurt if someone keeps a backup.

I'll add a link to the first post in this thread, as well as in the other thread I made. 

Regarding additional matches, we certainly need those. Maybe add them as R1, R2 etc? Or discuss on the forum first which other matches are possible, then add them? 
I also think that those matches which have been rated 0 by all members but Mark should probably be removed for simplification.
(03-10-2023, 06:49 AM)ReneZ Wrote: You are not allowed to view links. Register or Login to view.I hope that the Google doc (Excel) will not disappear, and that a link to it will be kept in some easily found location.

Here is the link again, just to keep it recently posted in this thread:

You are not allowed to view links. Register or Login to view.

I've been in and out of the spreadsheet so many times now, I remember I can always find it in You are not allowed to view links. Register or Login to view. of this thread.  Rolleyes

Speaking of, I'm making progress adding in the links to the Folio Browser comparisons, assuming that is still helpful. More than halfway done. And I'm checking the folio numbers as I go but I haven't been checking the folio 2 positions. I'm sure Rene got them all right though.

I mentioned earlier hitting a snag in the links - You are not allowed to view links. Register or Login to view. is missing - so I got sidetracked with a side project checking the Folio Browser. Turns out only You are not allowed to view links. Register or Login to view. is missing from it and it only appears in the spreadsheet in two rows so in those rows, I added the links from JasonDavies for the individual folios to their respective cells in column A and column B instead of in column E.

And speaking of that, I have also added a data filter to the spreadsheet but filters in Google Sheets are not like they are in Excel. To apply the filter, click on the Data menu, scroll to the Filter Views option, then in the submenu, click on Filter 1. The spreadsheet will change to the filter view and you will see filter down arrows similar to Excel in each of the column headers. To filter to see only a specific folio, like f87, in column A or B, click the filter arrow for the column desired. In the Filter menu, under Filter by values, click the Clear link, then type f87 in the Search field, then select any matching options that appear below the field then click OK at the bottom of the menu (you may have to use the Filter menu scroll bar to get to it). Once you're done with the filter view, you have to first clear the filter unless you want it saved for the next time you use Filter Views. To clear it, in the Filter menu, under Filter by values, click the Select All link then click OK. Once the filter is cleared, click on Data then Filter Views then Exit View to return to normal view.
The more suggested matches that there are the better, so please list any not included by me.

When matching I compare the large plants in the herbal section with the small plants in the botanical section. However one thing that I didn't do directly was compare small plants in the botanical section with other small plants in this section to see if they can be matched up. I have scanned these briefly for matches, but not compared them systematically. I don't expect that there are many such matches, however I believe that Vladimir found one such match.

I feel like it would be useful to have a note against each match, which lists the reasons for and against that being a match. Ultimately deciding on what to treat as a match, probable match, possible match or conceivable match is down to the individual, but a justification for or against a match could be useful.
So now towards comparing labels

Plant 1 Position, Plant 2 Position, Plant 3 Position, Plant 1 Label, Plant 2 Label, Plant 3 Label
f88r[1/4], f102v2[3/1],
f88r[1/5], f89r1[2/2],
f89r1[1/1], f101r[2/1],
f89r2[1/3], f88v[1/4],
f89r2[4/1], f99v[3/2],
f89v2[1/3], f100r[1/1],
f89v2[1/4], f102v2[3/5],
f89v2[1/5], f102v1[2/3],
f89v2[2/2], f100r[1/5],
f99r[1/7], f101v[1/3],
f99r[2/1], f100r[2/1],
f99r[2/2], f100r[1/2], f99r[5/1]
f99r[3/1], f101r[1/2],
f99r[4/1], f100v[2/5],
f99v[1/4], f89r1[1/4],
f99v[1/7], f101v[1/8],
f99v[1/8], f100v[1/1],
f101r[1/7], f102v2[3/3],
f101r1[3/3], f102r1[2/1],
f101r2[2/9], f100v[2/1],
f101v[1/5], f95r2,
f101v1[3/8], f101v1[2/7],
f101v2[1/4], f89r1[3/2],
f102r1[3/1], f101v[2/5],
f102r1[3/2], f88r[2/4],
f102r2[2/1], f101r1[1/6], f102r2[2/4]
f102v1[2/2], f100v[1/4],
f102v2[1/6], f88v[2/1],


I probably want to eliminate the Plant 3 column and those items without labels. Having sorted by Plant 1 Position column it should make it easier to locate the Plant 1 Labels in the text. Then sorting by Plant 2 Position column I should be able to track down the Plant 2 Labels. That should leave us with many pairs of labels for comparison. Whether there is a relationship between the two or no relationship at all, I think the results would be interesting. Comparing small plant labels with associated large Plant paragraph text is a separate task to be carried out.
Our suggestions for changes to the spreadsheet seem to be getting lost in the trail of posts in this thread. So, I made some modifications to the Google workbook today as an improvement for that process.

I have renamed the main sheet as Plant Match Ratings instead of Sheet 1. Then I added a new spreadsheet named Ratings Sheet Wish List where we can input our suggestions for changes to the workbook in general or the ratings sheet. I also added, based on Mark's comments here today, sample rows to give an idea how we might use the wish list. Approval or Rejection of any changes noted in the wish list could either be per Mark or discussed first here in this thread. Approved changes could then be made by the person who suggested them or by another participant.

I forget how many times but the suggestion has been made more than once now that it might be helpful for participants to be able to add comments about their individual ratings. So, I have also added a separate comment sheet for each current participant for this purpose. I won't repeat here but you can see in the wish list where I have completed this change and my explanatory comments about it, including noting the example links and comment I added to one of my ratings.

Let me know if this helps.
Is there an easy of grabbing the label text from the small plant reference? Would it be better to grab the image of the label or a text string as EVA? Maybe the image of the label would be more reliable to work with.

Obviously I will ideally want to focus on the most reliable probable label pair matches. I can check with the original spreadsheet scores. How many label text pair matches I can have some confidence in I do not know.

Of course one can ask what those pairs mean if anything.(I have a theory that many Voynichese words are null) If they have a meaning, I think the first expectation would be the name of the plant (or part of plant) in the illustration. So one might expect the label text pair to match, as for example, the name on the label next to the root of the plant might be the same as the name on the label next to the leaf of the same plant on a different folio.
Folio Browser links are done now.
Pages: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17