How I found nearly 300,000 errors in MS COCO

Follow the full discussion on Reddit.
Hi folks! I've made a new technique for finding errors in object detection datasets, using new explainable AI techniques from my PhD. I was frankly pretty surprised to be able to find about 275k errors in MS COCO's training set (which has around 700k labels). This includes things like incorrectly drawn bounding boxes (shown below, about 55k), missing background labels (178k), and missing labels that overlap with existing labels (40k).

Comments

There's unfortunately not much to read here yet...

Discover the Best of Machine Learning.

Ever having issues keeping up with everything that's going on in Machine Learning? That's where we help. We're sending out a weekly digest, highlighting the Best of Machine Learning.

Join over 900 Machine Learning Engineers receiving our weekly digest.

Best of Machine LearningBest of Machine Learning

Discover the best guides, books, papers and news in Machine Learning, once per week.

Twitter