Abstract
This paper proposes learning algorithms for the problem of multimodal document classification. Specifically, we develop classifiers that automatically assign documents to categories by exploiting features from both text as well as image content. In particular, we use meta-classifiers that combine state-of-the-art text and image based classifiers into making joint decisions. The two meta classifiers we choose are based on support vector machines and Adaboost. Experiments on real-world databases from Wikipedia demonstrate the benefits of a joint exploitation of these modalities.
Original language | English (US) |
---|---|
Title of host publication | 2009 IEEE International Workshop on Multimedia Signal Processing, MMSP '09 |
DOIs | |
State | Published - 2009 |
Event | 2009 IEEE International Workshop on Multimedia Signal Processing, MMSP '09 - Rio De Janeiro, Brazil Duration: Oct 5 2009 → Oct 7 2009 |
Other
Other | 2009 IEEE International Workshop on Multimedia Signal Processing, MMSP '09 |
---|---|
Country/Territory | Brazil |
City | Rio De Janeiro |
Period | 10/5/09 → 10/7/09 |
All Science Journal Classification (ASJC) codes
- Artificial Intelligence
- Computer Networks and Communications
- Computer Vision and Pattern Recognition
- Signal Processing