Sunday, April 22, 2012

Detecting Multiple Hanzi in a Single Image

The following image shows the results of running the character detector on the sign13.jpg image with the classifier trained for 5 of the characters present in the image ( 向,前,小,大,文 ). Note that it fails to detect 文. Also, this set of bounding boxes is selected using a hand-picked threshhold value. The next image shows that with another value for threshold, the detector returns a lot of false detections, and a lot of noise (extra bounding boxes at different scales) for true detections.

So, it's apparent that I need to figure both a) how to "tune" the detector and b) select the "best" candidate bounding box.

threshold hand-picked for good results
results with poorly chosen threshold
results with previous "poorly chosen" threshold, and NMS applied


No comments:

Post a Comment