Skip to main navigation Skip to search Skip to main content

Hierarchical Query Classification in E-commerce Search

  • Bing He
  • , Sreyashi Nag
  • , Limeng Cui
  • , Suhang Wang
  • , Zheng Li
  • , Rahul Goutam
  • , Zhen Li
  • , Haiyang Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

E-commerce platforms typically store and structure product information and search data in a hierarchy. Efficiently categorizing user search queries into a similar hierarchical structure is paramount in enhancing user experience on e-commerce platforms as well as news curation and academic research. The significance of this task is amplified when dealing with sensitive query categorization or critical information dissemination, where inaccuracies can lead to considerable negative impacts. The inherent complexity of hierarchical query classification is compounded by two primary challenges: (1) the pronounced class imbalance that skews towards dominant categories, and (2) the inherent brevity and ambiguity of search queries that hinder accurate classification. To address these challenges, we introduce a novel framework that leverages hierarchical information through (i) enhanced representation learning that utilizes the contrastive loss to discern fine-grained instance relationships within the hierarchy, called “instance hierarchy”, and (ii) a nuanced hierarchical classification loss that attends to the intrinsic label taxonomy, named “label hierarchy”. Additionally, based on our observation that certain unlabeled queries share typographical similarities with labeled queries, we propose a neighborhood-aware sampling technique to intelligently select these unlabeled queries to boost the classification performance. Extensive experiments demonstrate that our proposed method is better than state-of-the-art (SOTA) on the proprietary Amazon dataset, and comparable to SOTA on the public datasets of Web of Science and RCV1-V2. These results underscore the efficacy of our proposed solution, and pave the path toward the next generation of hierarchy-aware query classification systems.

Original languageEnglish (US)
Title of host publicationWWW 2024 Companion - Companion Proceedings of the ACM Web Conference
PublisherAssociation for Computing Machinery, Inc
Pages338-345
Number of pages8
ISBN (Electronic)9798400701726
DOIs
StatePublished - May 13 2024
Event33rd Companion of the ACM World Wide Web Conference, WWW 2023 - Singapore, Singapore
Duration: May 13 2024May 17 2024

Publication series

NameWWW 2024 Companion - Companion Proceedings of the ACM Web Conference

Conference

Conference33rd Companion of the ACM World Wide Web Conference, WWW 2023
Country/TerritorySingapore
CitySingapore
Period5/13/245/17/24

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Software

Fingerprint

Dive into the research topics of 'Hierarchical Query Classification in E-commerce Search'. Together they form a unique fingerprint.

Cite this