Guoqiang Wang
Please Note
6 records found
1
Data assimilation (DA) combines numerical model simulations with observed data to obtain the best possible description of a dynamical system and its uncertainty. Incorrect modeling assumptions can lead to filter divergence, making model identification an important issue in the field of DA. Variations in dynamic model structures can result in differences in parameter dimensions, complicating the resampling step in PFs. To meet this challenge, the Sequential Hierarchical Bayesian Model (SHBM) is proposed in this paper, which integrates the evolution model along with observation model from the DA scheme, and the hierarchical parameter model. A two-step resampling method are also proposed to estimate the SHBM: the first step uses the resampling scheme in the bootstrap filter to resample new particles based on weights, which may produce some duplicate particles; the second step utilizes the Reversible Jump Markov Chain Monte Carlo (RJMCMC) methods to draw new particles from the target distribution. This approach ensures particle diversity, with the first step aiming at avoiding particle degeneracy, and the second step intends to prevent the sample impoverishment. The performance in the Advection Equation example and Lorenz 96 example demonstrates the effectiveness of the proposed method.
In financial time series analysis, the dynamic conditional correlation model is the most popular method for estimating the conditional covariance matrix, which represents financial risk and is critical for risk management, portfolio optimization, and asset pricing. Traditional covariance matrix estimation is often constrained by the rigid parameter settings and the assumption of the normal distribution, leading to the estimation biases when the markets are not normally distributed. To address these limitations, this paper proposes a Bayesian Non-parametric Dynamic Conditional Angular Correlation model based on the Fractionally Integrated GARCH model (BNDCAC-FIGARCH) that incorporates the asymmetric parameter and the student’s t-distribution to increase the adaptability and flexibility. Simulation experiments demonstrate that under overall correlation paths shaped as the sine or ramp functions, our model provides more accurate estimates, showcasing its effectiveness and stability. Empirical studies apply real stock market data, which includes DAX 40, FTSE 100, SSE 50, and CSI 100, to construct the portfolio optimization. The results demonstrate the superiority of the proposed model in terms of both portfolio returns and the reduction of parameter uncertainty. Furthermore, the results indicate that CSI 100 exhibits the weaker asymmetry compared to the other indices, likely due to its higher liquidity and a more accurate reflection of improved economic conditions resulting from national policies.
Machine learning algorithms have demonstrated outstanding performance for disease diagnosis. Kernel function selection plays a crucial role in effectively transforming the nonlinear nature of input data. To enhance breast cancer diagnosis, we propose a novel ensemble algorithm, namely, AUC-Ada- (Formula presented.) MKL-WSVM, which integrates Weighted Support Vector Machines (WSVM), AdaBoost, and Multi-Kernel Learning (MKL). This ensemble algorithm introduces two main innovations. First, it simultaneously updates the weights of training samples and the combined kernel function during classification. Second, it incorporates an AUC-based approach to adjust training sample weights, effectively controlling the growth rate of misclassified sample weights in AdaBoost. Experimental results are provided to demonstrate the effectiveness of our method, which achieves an AUC score of 97.21% and an accuracy of 97.64% on the WDBC dataset, and an AUC of 97.53% and an accuracy of 97.46% on the WBC dataset. Comparative analysis further confirms that our ensemble algorithm outperforms four benchmark models in classification accuracy.
Air quality warning and forecasting systems are usually based on numerical chemical transport models (CTMs). Those dynamic models perform predictions by simulating the life cycles of the atmospheric components, including emission, transport and removal. However, the accuracy of these CTMs are still limited because of many imperfections, e.g., uncertainties in the input sources such as emission inventories, wind fields, boundary conditions, as well as insufficient knowledge about the atmospheric dynamics themselves. All these will mislead the CTM prediction constantly, or in a systematic way. In this paper, an approach based on machine learning is applied to predict model bias in the CTM. It is then combined with the CTM for formulating a hybrid forecast system. To our knowledge, it is the first time that machine learning methods are used in this way. The hybrid system is tested on the fine particular matter (PM2.5) prediction in Shanghai, China. The results showed that machine learning can be an effective tool to improve the accuracy of CTM prediction. In case of short term PM2.5 forecast (forecast length less than 12 h), statistical metrics of the root mean square error, mean absolute error, mean absolute percentage error as well as the air quality rank predicted accuracy all show the forecast skill is remarkably improved; while for long term prediction, improvement is not ensured.