Please use this identifier to cite or link to this item: http://hdl.handle.net/2122/15221
Authors: Jozinović, Dario* 
Lomax, Anthony* 
Štajduhar, Ivan* 
Michelini, Alberto* 
Title: Transfer learning: Improving neural network based prediction of earthquake ground shaking for an area with insufficient training data
Journal: Geophysical Journal International 
Series/Report no.: 1/229 (2022)
Publisher: Oxford University Press
Issue Date: 2022
DOI: 10.1093/gji/ggab488
URL: https://academic.oup.com/gji/advance-article/doi/10.1093/gji/ggab488/6447541
Keywords: Physics - Geophysics; Physics - Geophysics
machine learning
ground motion prediction
Subject Classificationseismology
Abstract: In a recent study (Jozinovi\'c et al, 2020) we showed that convolutional neural networks (CNNs) applied to network seismic traces can be used for rapid prediction of earthquake peak ground motion intensity measures (IMs) at distant stations using only recordings from stations near the epicenter. The predictions are made without any previous knowledge concerning the earthquake location and magnitude. This approach differs from the standard procedure adopted by earthquake early warning systems (EEWSs) that rely on location and magnitude information. In the previous study, we used 10 s, raw, multistation waveforms for the 2016 earthquake sequence in central Italy for 915 events (CI dataset). The CI dataset has a large number of spatially concentrated earthquakes and a dense station network. In this work, we applied the CNN model to an area around the VIRGO gravitational waves observatory sited near Pisa, Italy. In our initial application of the technique, we used a dataset consisting of 266 earthquakes recorded by 39 stations. We found that the CNN model trained using this smaller dataset performed worse compared to the results presented in the original study by Jozinovi\'c et al. (2020). To counter the lack of data, we adopted transfer learning (TL) using two approaches: first, by using a pre-trained model built on the CI dataset and, next, by using a pre-trained model built on a different (seismological) problem that has a larger dataset available for training. We show that the use of TL improves the results in terms of outliers, bias, and variability of the residuals between predicted and true IMs values. We also demonstrate that adding knowledge of station positions as an additional layer in the neural network improves the results. The possible use for EEW is demonstrated by the times for the warnings that would be received at the station PII.
Description: This article has been accepted for publication in Geophysical Journal International ©: The Authors 2021. Published by Oxford University Press on behalf of the Royal Astronomical Society. All rights reserved. Uploaded in accordance with the publisher's self-archiving policy.
Appears in Collections:Article published / in press

Files in This Item:
File Description SizeFormat
ggab488.pdfOpen Access published article3.63 MBAdobe PDFView/Open
Show full item record

Page view(s)

103
checked on Apr 27, 2024

Download(s)

43
checked on Apr 27, 2024

Google ScholarTM

Check

Altmetric