Structure Learning of H-colorings

Antonio Blanca, Zongchen Chen, Eric Vigoda, Daniel Štefankovič

Research output: Contribution to journalConference articlepeer-review

3 Scopus citations

Abstract

We study the structure learning problem for H-colorings, an important class of Markov random fields that capture key combinatorial structures on graphs, including proper colorings and independent sets, as well as spin systems from statistical physics. The learning problem is as follows: for a fixed (and known) constraint graph H with q colors and an unknown graph G = (V, E) with n vertices, given uniformly random H-colorings of G, how many samples are required to learn the edges of the unknown graph G? We give a characterization of H for which the problem is identifiable for every G, i.e., we can learn G with an infinite number of samples. We also show that there are identifiable constraint graphs for which one cannot hope to learn every graph G efficiently. We focus particular attention on the case of proper vertex q-colorings of graphs of maximum degree d where intriguing connections to statistical physics phase transitions appear. We prove that in the tree uniqueness region (i.e., when q > d) the problem is identifiable and we can learn G in poly(d, q) × O(n2 log n) time. In contrast for soft-constraint systems, such as the Ising model, the best possible running time is exponential in d. In the tree non-uniqueness region (i.e., when q ≤ d) we prove_ that the problem is not identifiable and thus G cannot be learned. Moreover, when q < d − d + Θ(1) we prove that even learning an equivalent graph (any graph with the same set of H-colorings) is computationally hard—sample complexity is exponential in n in the worst case. We further explore the connection between the efficiency/hardness of the structure learning problem and the uniqueness/non-uniqueness phase transition for general H-colorings and prove that under a well-known condition in statistical physics, known as the Dobrushin uniqueness condition, we can learn G in poly(d, q) × O(n2 log n) time.

Original languageEnglish (US)
Pages (from-to)152-185
Number of pages34
JournalProceedings of Machine Learning Research
Volume83
StatePublished - 2018
Event29th International Conference on Algorithmic Learning Theory, ALT 2018 - Lanzarote, Spain
Duration: Apr 7 2018Apr 9 2018

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Software
  • Control and Systems Engineering
  • Statistics and Probability

Fingerprint

Dive into the research topics of 'Structure Learning of H-colorings'. Together they form a unique fingerprint.

Cite this