Comments
There's unfortunately not much to read here yet...
Follow the full discussion on Reddit.
I have a table document(3-columns fixed) which when passed through an ocr gives me the text ,mid x-coordinate and mid y-coordinate of the corresponding text. Apparently when all the text is detected i use simple distance based metric to make a table structure using the top 3 header coordinate and then classifying all the points into respective columns. Now when all the text is not detected(say the ocr fails to get 1 text in column B) i have a different length and a dataframe could not be formed.
There's unfortunately not much to read here yet...
Ever having issues keeping up with everything that's going on in Machine Learning? That's where we help. We're sending out a weekly digest, highlighting the Best of Machine Learning.
Discover the best guides, books, papers and news in Machine Learning, once per week.