DenseCap: Fully Convolutional Localization Networks for Dense Captioningcs.stanford.edu6 pointsvkhuc10 years ago