In Facebook messenger two days ago:
Dear Mr. Danh Hong,
my name is Moritz Raguschat. I am from Germany.
I have heard/read about your great work of creating OCR and spell-checking software for the benefit of the public, offered at nextspell.com.
The way I came to know of it, and the reason I am contacting you, is via a Buddhist monk, the venerable Johann, residing near Phnom Aural currently, who was wondering about possible use of this software for the Sangha. (See request in the online forum/online "Wat" at http://forum.sangham.net/index.php/topic,9657.msg21395.html#msg21395)
Since abstaining from taking what is not given, and not using Facebook, Bhante was wondering about possible ways to contact. So I am writing to you here in Facebook. It would be great if you could come directly into contact via the online forum/online monastery http://forum.sangham.net.
Many thanks, and may you have a good day!
Moritz
Ok. Thank you
Not sure yet what else to say, possibly not made clear enough and too much other worries to think about (in both heads involved here possibly
).
I will try again later.
Meanwhile, I have learnt about the software, it seems to be based on the open-source OCR machine-learning software "Tesseract", which can also be easily installed on the new server, and a web-frontend built for it. I could do that. Just would need time, like for many other things.
The open-source Tesseract software already had some Khmer recognition for a longer time, but was possibly not yet trained very well yet.
The KhmerOCR project led by Mr. Danh Hong, was for the purpose of improving its accuracy, by feeding it with more training data.
Not sure if that improved training data has been integrated back into the original Tesseract as open-source data, or might be kept private for now.
I will try to ask for it, if it could be given to use for the Sangha.