Hi, during summer I worked on a project with the objective of removing people or objects from photos. Person-remover uses a pretrained YOLO to detect them and then feeds the resulting bounding boxes to the generator of a pix2pix which I trained from zero on Paris dataset. Even though the generator wasn't trained with the purpose of filling person-shaped objects, the results are pretty great and seems to generalize well to unseen photos or video.


