Multimodal Visual Concept Learning with Weakly Supervised Techniques Multimodal Visual Concept Learning with Weakly Supervised Techniques