Classifying Community QA Questions That Contain an Image - Yahoo! JAPANの研究開発

Publications

カンファレンス (国内) Classifying Community QA Questions That Contain an Image

玉置賢太（早大）、富樫陸、藤田澄男、河東宗祐（早大）、前田英行、酒井哲也（早大）

第10回データ工学と情報マネジメントに関するフォーラム (DEIM 2019)

2018.3.31

We consider the problem of automatically assigning a category to a given question posted to a Community Question Answering(CQA) site, where the question contains not only text but also an image. For example, CQA users may post a photograph of a dress and ask the community “Is this appropriate for a wedding?" where the appropriate category for this question might be “Manners, Ceremonial occasions." We tackle this problem using Convolutional Neural Networks (CNNs) with Multimodal Compact Bilinear (MCB) pooling for combining the image and text networks. Our experiments with real data from a major CQA site and crowdsourced gold-standard categories show that our method that combines MCB with a simple sum and element-wise product approach statistically significantly outperforms a baseline that relies only on text at alpha = 0.10 (p-value: 0.091; effect size: 0.046).

Paper : Classifying Community QA Questions That Contain an Image （外部サイト）