Missing data is the first major problem that appears in many database fields for a set of reasons. It has ">

Use of Long-Short Term Memory network (LSTM) in the reconstruction of missing water level data in the Seine River. 

crossref(2023)

引用 0|浏览1
暂无评分
摘要
<div> <p><span data-contrast="none">Missing data is the first major problem that appears in many database fields for a set of reasons. It has always been necessary to fill them, which becomes unavoidable and more complicated when the missing periods are longer. Several machine-learning-based approaches have been introduced to deal with this problem.</span><span data-ccp-props="{&quot;201341983&quot;:0,&quot;335551550&quot;:6,&quot;335551620&quot;:6,&quot;335559731&quot;:230,&quot;335559739&quot;:160,&quot;335559740&quot;:480}">&#160;</span></p> </div> <div> <p><span data-contrast="none">The purpose of this paper is to discuss the effectiveness of a new methodology added prior to the LSTM deep learning algorithm to fill in the missing data in the hourly surface water level time series of some stations installed along the Seine River in Normandy-France. In our study, due to a lack of data, a challenging situation was faced where only the water level data in the same station, which contain many missing parts, were used as input and output variables to fill the station itself in a self-learning approach. This contrasts with the common work on imputing missing data, where several features are available to take advantage of in a multivariate and spatiotemporally way, e.g.: using the same variable from other stations or exploiting other physical variables and metrological data, etc. The reconstruction accuracy of the proposed method depends on both the size of the available/missing data and the parameters of the networks. Therefore, we performed sensitivity analyses on both the properties of the networks and the structuring of the input and output data to better determine the appropriate strategy. During this analysis process, a data preprocessing method was developed and added prior to the LSTM model. This data processing method was discovered by presenting many scenarios, each of which was an updated version of the last one. Along with these scenarios, limitations were also addressed and overcome. Finally, the last model version was able to impute missing values that may reach one year of hourly data with high accuracy (One-year RMSE = 0.14 m) regardless of neither the location of the missing part in the series nor its size.&#160;</span><span data-ccp-props="{&quot;201341983&quot;:0,&quot;335551550&quot;:6,&quot;335551620&quot;:6,&quot;335559731&quot;:230,&quot;335559739&quot;:160,&quot;335559740&quot;:480}">&#160;</span></p> </div>
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要