Mark Knowles > 02-05-2026, 04:12 PM
(02-05-2026, 03:29 PM)DG97EEB Wrote: You are not allowed to view links. Register or Login to view.(02-05-2026, 03:16 PM)Mark Knowles Wrote: You are not allowed to view links. Register or Login to view.(02-05-2026, 02:42 PM)DG97EEB Wrote: You are not allowed to view links. Register or Login to view.(02-05-2026, 02:16 PM)Mark Knowles Wrote: You are not allowed to view links. Register or Login to view.It would be nice if the AIs could do some kind of real-time OCR so that typed(or even handwritten) documents that haven't read digitised could be read and analysed.
It can do it fairly well now.. Gemini is the best model, but really one page at a time.
One page at a time is really a problem.
I didn't explain clearly. I mean as part of its search or maybe Google could automatically OCR all documents found. I don't know how far we are now from that being computationally feasible.
There are quite a lot of typed and scanned, but not digitised inventories which are a lot of effort to read manually and for which being able to search inside would be really helpful. (Obviously, being able to read handwritten documents as well would be amazing).
My view is the first person to build an algorithm and robot to scan manuscripts at volume will make some serious money... There's OCR for typed documents and HTR (Handwriting text Recognition) for written. Transkribus is the gold standard, but the frontier models are catching up fast... But there are 10s of thousands of Manuscripts that no one has even looked at...
JoJo_Jost > 03-05-2026, 07:25 AM