TY - JOUR
T1 - Laboratory earthquake forecasting
T2 - A machine learning competition
AU - Johnson, Paul A.
AU - Rouet-Leduc, Bertrand
AU - Pyrak-Nolte, Laura J.
AU - Beroza, Gregory C.
AU - Marone, Chris J.
AU - Hulbert, Claudia
AU - Howard, Addison
AU - Singer, Philipp
AU - Gordeev, Dmitry
AU - Karaflos, Dimosthenis
AU - Levinson, Corey J.
AU - Pfeiffer, Pascal
AU - Puk, Kin Ming
AU - Reade, Walter
N1 - Publisher Copyright:
© 2021 National Academy of Sciences. All rights reserved.
PY - 2021/2/2
Y1 - 2021/2/2
N2 - Earthquake prediction, the long-sought holy grail of earthquake science, continues to confound Earth scientists. Could we make advances by crowdsourcing, drawing from the vast knowledge and creativity of the machine learning (ML) community? We used Google’s ML competition platform, Kaggle, to engage the worldwide ML community with a competition to develop and improve data analysis approaches on a forecasting problem that uses laboratory earthquake data. The competitors were tasked with predicting the time remaining before the next earthquake of successive laboratory quake events, based on only a small portion of the laboratory seismic data. The more than 4,500 participating teams created and shared more than 400 computer programs in openly accessible notebooks. Complementing the now well-known features of seismic data that map to fault criticality in the laboratory, the winning teams employed unexpected strategies based on rescaling failure times as a fraction of the seismic cycle and comparing input distribution of training and testing data. In addition to yielding scientific insights into fault processes in the laboratory and their relation with the evolution of the statistical properties of the associated seismic data, the competition serves as a pedagogical tool for teaching ML in geophysics. The approach may provide a model for other competitions in geosciences or other domains of study to help engage the ML community on problems of significance.
AB - Earthquake prediction, the long-sought holy grail of earthquake science, continues to confound Earth scientists. Could we make advances by crowdsourcing, drawing from the vast knowledge and creativity of the machine learning (ML) community? We used Google’s ML competition platform, Kaggle, to engage the worldwide ML community with a competition to develop and improve data analysis approaches on a forecasting problem that uses laboratory earthquake data. The competitors were tasked with predicting the time remaining before the next earthquake of successive laboratory quake events, based on only a small portion of the laboratory seismic data. The more than 4,500 participating teams created and shared more than 400 computer programs in openly accessible notebooks. Complementing the now well-known features of seismic data that map to fault criticality in the laboratory, the winning teams employed unexpected strategies based on rescaling failure times as a fraction of the seismic cycle and comparing input distribution of training and testing data. In addition to yielding scientific insights into fault processes in the laboratory and their relation with the evolution of the statistical properties of the associated seismic data, the competition serves as a pedagogical tool for teaching ML in geophysics. The approach may provide a model for other competitions in geosciences or other domains of study to help engage the ML community on problems of significance.
UR - http://www.scopus.com/inward/record.url?scp=85100015106&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85100015106&partnerID=8YFLogxK
U2 - 10.1073/pnas.2011362118
DO - 10.1073/pnas.2011362118
M3 - Review article
C2 - 33495346
AN - SCOPUS:85100015106
SN - 0027-8424
VL - 118
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 5
M1 - e2011362118
ER -