I don’t want to share a lot of details, but I’ve been working on a project that uses OpenCV for basic object detection (mostly through template matching) and tesseract OCR to read text, all in a video game. It’s to be expected that many of the objects are nearly identical 2D video game assets, so detection using template matching in opencv is very accurate.
I’ve been interested in exploring whether or not tensorflow would be appropriate for my use case.
I have the following use case / pattern that I’m looking to implement, and was interested to get your opinion on if tensorflow is an appropriate framework for my use case, or if I should continue to use opencv/tesseract:
- Image classification – I need to first determine which type of image I’m looking at to determine what kind of processing to do. There are 3-4 classes of images I’m interested in
- There’s one class of image where I need to perform both object detection and OCR
- Object detection/tracking would be for half a dozen objects on the screen at any given time.
- OCR can be done using tesseract if necessary, as my experimentation with tensorflow OCR implementations has been pretty poor accuracy.
- Text is always expected to be of a certain format and in the same region of the screen
- There’s another class of image where I need to perform only OCR. The OCR properties of the previous image class remain true.
- OCR can be done using tesseract if necessary, as my experimentation with tensorflow OCR implementations has been pretty poor accuracy.
- Text is always expected to be of a certain format and in the same region of the screen
- Finally, the last image class tell me that no object detection or OCR need to be performed.
My questions are:
- Is tensorflow right for me? Why or why not? What are the tradeoffs (besides time to tag custom datasets)?
- If tensorflow is right for me, how would I model the above logic? Should I just detect everything and program business logic based on what’s detected, or is there a good way to model this pipeline in tensorflow?
Feedback is appreciated. Thanks.
submitted by /u/UtesCartman
[visit reddit] [comments]