TY - GEN
T1 - Disharmony
T2 - 2025 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, WACVW 2025
AU - Shin, Philip Wootaek
AU - Sampson, Jack
AU - Narayanan, Vijaykrishnan
AU - Marquez, Andres
AU - Halappanavar, Mahantesh
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Content generation and manipulation approaches based on deep learning methods have seen significant advancements, leading to an increased need for techniques to detect whether an image has been generated or edited. Another area of research focuses on the insertion and harmonization of objects within images. In this study, we explore the potential of using harmonization data in conjunction with a segmentation model to enhance the detection of edited image regions. These edits can be either manually crafted or generated using deep learning methods. Our findings demonstrate that this approach can effectively identify such edits. Existing forensic models often overlook the detection of harmonized objects in relation to the background, but our proposed Disharmony Network addresses this gap. By utilizing an aggregated dataset of harmonization techniques, our model outperforms existing forensic networks in identi-fying harmonized objects integrated into their backgrounds, and shows potential for detecting various forms of edits, in-cluding virtual try-on tasks.
AB - Content generation and manipulation approaches based on deep learning methods have seen significant advancements, leading to an increased need for techniques to detect whether an image has been generated or edited. Another area of research focuses on the insertion and harmonization of objects within images. In this study, we explore the potential of using harmonization data in conjunction with a segmentation model to enhance the detection of edited image regions. These edits can be either manually crafted or generated using deep learning methods. Our findings demonstrate that this approach can effectively identify such edits. Existing forensic models often overlook the detection of harmonized objects in relation to the background, but our proposed Disharmony Network addresses this gap. By utilizing an aggregated dataset of harmonization techniques, our model outperforms existing forensic networks in identi-fying harmonized objects integrated into their backgrounds, and shows potential for detecting various forms of edits, in-cluding virtual try-on tasks.
UR - https://www.scopus.com/pages/publications/105005027678
UR - https://www.scopus.com/pages/publications/105005027678#tab=citedBy
U2 - 10.1109/WACVW65960.2025.00085
DO - 10.1109/WACVW65960.2025.00085
M3 - Conference contribution
AN - SCOPUS:105005027678
T3 - Proceedings - 2025 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, WACVW 2025
SP - 708
EP - 717
BT - Proceedings - 2025 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, WACVW 2025
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 28 February 2025 through 4 March 2025
ER -