Viewing a single comment thread. View all comments

dwightsrus t1_iwu9b2p wrote

I am a noob to ML. How do you suggest I go about converting pdf with restaurant menu and pricing into structured data in json format? Are they ready to use models/websites/services?

1

InitialWalrus t1_iwuetbz wrote

https://pypi.org/project/PyPDF2/ This python library will allow you to convert the pdf to a string (assuming it is text readable. If it's not text readable you'll need to look into OCR, optical character recognition).

2

dwightsrus t1_iwuq4um wrote

Thanks for the suggestion. My challenge is that each pdf is not structured the same way. Would love to get a bunch of them go through a ML training model that spits out the data in the format I need.

1