johnGettings OP t1_j7lqkj2 wrote on February 7, 2023 at 6:41 PM

Reply to comment by GufyTheLire in Hi-ResNet: High resolution image classifier. (448, 896, 1792 sq.px.) by johnGettings

Yes, definitely agree. The project started as one thing, then turned into another, then another. I was only doing the coin grading for fun and wasn't planning on actually implementing it anywhere. So I switched gears and just focused on building a high resolution ResNet, regardless of what would be best for the actual coin grading.

There are probably better solutions, especially for this size of a dataset, and maybe a sliding window is necessary to achieve very high accuracy.

But I think this model can still be useful and preferable for some datasets of large images with fine patterns. Or at the very least preferred for simplicitys sake.

johnGettings OP t1_j7l45fw wrote on February 7, 2023 at 4:16 PM

Reply to comment by thelibrarian101 in Hi-ResNet: High resolution image classifier. (448, 896, 1792 sq.px.) by johnGettings

Excel lol

johnGettings OP t1_j7hwa3w wrote on February 6, 2023 at 10:27 PM

Reply to comment by jimtoberfest in Hi-ResNet: High resolution image classifier. (448, 896, 1792 sq.px.) by johnGettings

I didn't try it. I decided to just bust this out and move on to the next project. It was fun though.

johnGettings OP t1_j7gpg3x wrote on February 6, 2023 at 5:56 PM

Reply to Hi-ResNet: High resolution image classifier. (448, 896, 1792 sq.px.) by johnGettings

A 224x224 image is sufficient for most classification tasks, but there are instances where the fine details of a large image need to be analyzed. Hi-ResNet is the ResNet50 architecture expanded (with the same rules from the paper) to allow for higher resolution images.

I was working on a coin grading project and found that accuracy could not surpass 30% because the image size completely obscured the necessary details of the coin. One option is to tile the image, run them each through a classifier, and combine the outputs. Another is to just try a classifier with a higher resolution input, which is actually kind of difficult to find. Maybe I did not look hard enough, but I figured it would be a good exercise to build this out regardless.

It may come in handy for you later. It's a very simple function with 3 arguments that returns a Hi-ResNet Tensorflow model.