100 likes | 209 Vues
Co-translator for Heterogeneous T ransfer L earning. Problem. Input n images with k labeled instances , k << n. (target domain) A set of English documents ; (source domain) Or a set of audio data . (source domain) Output Image classification function
E N D
Problem • Input • n images with k labeled instances, k << n. (target domain) • A set of English documents; (source domain) • Or a set of audio data. (source domain) • Output • Image classification function • Question : Can we leverage knowledge in heterogeneous sources for improving performance in target?
Previous Work HTLIC : Find a lower image representation by using knowledge in documents and a translator. TTI : learn a similarity function between documents and images by a translator. We define a translator as a set of pairs, each of which contains a image and several words. The translator can be constructed from crowd-sourced web site, such as Flickr, where texts and images co-occur.
Issues in the translator • The crowd-sourced translator is unreliable • (Sparse) Vocabularies used in annotation and documents are too large, while the number of tags for one images is small. It makes the translator sparse; • (Noise); • (Biased) words used in tags are different from that in formal documents.
Solution Let’s express the issues in the other way. That is, the text-image pairs in translator are unreliable. If we can find more reliable pairs, then we will get a high quality translator. And the prediction performance is improved.