Abstract
The need for figure-ground segmentation in video arises in many vision problems like tracker initialization, accurate object shape representation and drift-free appearance model adaptation. This paper uses a 3D spatio-temporal Conditional Random Field (CRF) to combine different segmentation cues while enforcing temporal coherence. Without supervised parameter training, the weighting factors for different data potential functions in the CRF model are adapted online to reflect changes in object appearance and environment. To get an accurate boundary based on the 3D CRF segmentation result, edge pixels are classified into three classes: foreground, background and boundary. The final foreground region bitmask is constructed from the foreground and boundary edge pixels. The effectiveness of our approach is demonstrated on several airborne videos with large appearance change and heavy occlusion.
Original language | English (US) |
---|---|
DOIs | |
State | Published - 2008 |
Event | 2008 19th British Machine Vision Conference, BMVC 2008 - Leeds, United Kingdom Duration: Sep 1 2008 → Sep 4 2008 |
Other
Other | 2008 19th British Machine Vision Conference, BMVC 2008 |
---|---|
Country/Territory | United Kingdom |
City | Leeds |
Period | 9/1/08 → 9/4/08 |
All Science Journal Classification (ASJC) codes
- Computer Vision and Pattern Recognition