TY - JOUR
T1 - Use of multiensemble track clustering to inform medium-range tropical cyclone forecasts
AU - Kowaleski, Alex M.
AU - Evans, Jenni L.
N1 - Funding Information:
Acknowledgments. We thank Judith Berner, whose ideas motivated some of the research in this paper. We are also grateful to Julian Heming at the Met Office for providing deterministic UKMET track forecasts. We acknowledge high-performance computing support from the Penn State Institute for Computational and Data Sciences (ICDS) Advanced CyberInfrastructure (ICDS-ACI). Data accessibility statement: Ensemble track forecasts used in this study may be accessed via the UCAR TIGGE database: https://rda.ucar.edu/datasets/ds330.3. The curve clustering toolbox used in this study can be downloaded via the CC Toolbox website at http:// www.datalab.uci.edu/resources/CCT/#Down. Best track data can be obtained from HURDAT 2 (https:// www.aoml.noaa.gov/hrd/hurdat/Data_Storm.html), the Japan Meteorological Agency (https://www.jma.go.jp/jma/ jma-eng/jma-center/rsmc-hp-pub-eg/trackarchives.html), and from IBTrACS via the University of North Carolina Ashville (http://ibtracs.unca.edu/). Some track data used in this study were provisional and have since been updated online; the provisional data will be provided upon request.
Publisher Copyright:
© 2020 American Meteorological Society.
PY - 2020/8
Y1 - 2020/8
N2 - Tropical cyclone ensemble track forecasts from 153 initialization times during 2017–18 are clustered using regression mixture models. Clustering is performed on a four-ensemble dataset [ECMWF + GEFS + UKMET + CMC (EGUC)], and a three-ensemble dataset that excludes the CMC (EGU). For both datasets, five-cluster partitions are selected to analyze, and the relationship between cluster properties (size, ensemble composition) and 96–144-h cluster-mean error is evaluated. For both datasets, small clusters produce very large errors, with the least populous cluster producing the largest error in more than 50% of forecasts. The mean of the most populous EGUC cluster outperforms the most accurate (EGU) ensemble mean in only 43% of forecasts; however, when the most populous EGUC cluster from each forecast contains ≥30% of the ensemble population, its average cluster-mean error is significantly reduced compared to when the most populous cluster is smaller. Forecasts with a highly populous EGUC cluster also appear to have smaller EGUC-, EGU-, and ECMWF-mean errors. Cluster-mean errors also vary substantially by the ensembles composing the cluster. The most accurate clusters are EGUC clusters that contain threshold memberships of ECMWF, GEFS, and UKMET, but not CMC. The elevated accuracy of EGUC CMC-excluding clusters indicates the potential utility of including the CMC in clustering, despite its large ensemble-mean errors. Pruning ensembles by removing members that belong to small clusters reduces 96–144-h forecast errors for both EGUC and EGU clustering. For five-cluster partitions, a pruning threshold of 10% affects 49% and 35% of EGUC and EGU ensembles, respectively, improving 69%–74% of the forecasts affected by pruning.
AB - Tropical cyclone ensemble track forecasts from 153 initialization times during 2017–18 are clustered using regression mixture models. Clustering is performed on a four-ensemble dataset [ECMWF + GEFS + UKMET + CMC (EGUC)], and a three-ensemble dataset that excludes the CMC (EGU). For both datasets, five-cluster partitions are selected to analyze, and the relationship between cluster properties (size, ensemble composition) and 96–144-h cluster-mean error is evaluated. For both datasets, small clusters produce very large errors, with the least populous cluster producing the largest error in more than 50% of forecasts. The mean of the most populous EGUC cluster outperforms the most accurate (EGU) ensemble mean in only 43% of forecasts; however, when the most populous EGUC cluster from each forecast contains ≥30% of the ensemble population, its average cluster-mean error is significantly reduced compared to when the most populous cluster is smaller. Forecasts with a highly populous EGUC cluster also appear to have smaller EGUC-, EGU-, and ECMWF-mean errors. Cluster-mean errors also vary substantially by the ensembles composing the cluster. The most accurate clusters are EGUC clusters that contain threshold memberships of ECMWF, GEFS, and UKMET, but not CMC. The elevated accuracy of EGUC CMC-excluding clusters indicates the potential utility of including the CMC in clustering, despite its large ensemble-mean errors. Pruning ensembles by removing members that belong to small clusters reduces 96–144-h forecast errors for both EGUC and EGU clustering. For five-cluster partitions, a pruning threshold of 10% affects 49% and 35% of EGUC and EGU ensembles, respectively, improving 69%–74% of the forecasts affected by pruning.
UR - http://www.scopus.com/inward/record.url?scp=85089531330&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85089531330&partnerID=8YFLogxK
U2 - 10.1175/WAF-D-20-0003.1
DO - 10.1175/WAF-D-20-0003.1
M3 - Article
AN - SCOPUS:85089531330
SN - 0882-8156
VL - 35
SP - 1407
EP - 1426
JO - Weather and Forecasting
JF - Weather and Forecasting
IS - 4
ER -