SL

S. Li

info

Please Note

3 records found

Journal article (2025) - Shixiang Li, Sheng Fu, Dianpeng Wang, Piao Chen
Process capability analysis plays a critical role in quality control by evaluating how well manufacturing processes meet defined specifications. However, traditional process capability indices (PCIs) rely on assumptions of symmetric tolerances and normally distributed data, which often do not hold in real-world applications and can lead to misleading conclusions. To overcome these limitations, we propose two novel classes of PCIs designed specifically for asymmetric tolerances, complemented by parametric estimation procedures and asymptotic confidence limits. To address the issue of non-normal data, we further employ an inverse transformation via constrained B-spline regression, which removes the need for the normality assumption. We demonstrate that our proposed PCIs reduce to traditional indices under symmetric conditions and normal data while extending applicability to a broader range of cases. Numerical simulations and a real-world application in an electronics company confirm the effectiveness and practical utility of our approach. ...
Doctoral thesis (2025) - S. Li, G. Jongbloed, Y. Tian, P. Chen
The main concern of this thesis is applying statistical methods to tackle some issues that arise from specific cases where data are hard to obtain, spatiotemporal, or non-normal. The range of such potential applications is huge. These hard-to-solve problems often necessitate innovative strategies, prompting the development of novel statistical models and methods. While the subjects covered in this thesis are diverse, they consistently revolve around the modeling methods in Statistics for complicated data structures, where traditional methods fail to provide viable solutions. Among the topics presented, we have the optimization for a computationally expensive black-box function based on limited data; predicting a spatiotemporal flow at a new input using the data stored as a tensor; the evaluation of a manufacturing process considering non-normal data and asymmetric tolerances; and, in the presence of dependence structures, defining the joint distribution of multivariate Poisson counts.

In this thesis, we separate all topics into two kinds of issues and propose several tailored statistical methods. First, we consider the case where the process of acquiring data is intricate.
Two specific examples are considered, including the optimization for black-box functions and the prediction for spatiotemporal flows. In these two cases, data are both expensive and time-consuming to obtain. Therefore, the key is to achieve specific goals while minimizing collection costs. We design proper inputs to feed and apply the Gaussian process model as a surrogate model. Then, some customized methods are employed. For black-box functions, where no explicit expression is available, the optimal approach is to select inputs that are as close as possible to the true optimum. Thus, a sequential design based on Gaussian process models is conducted.
It will explore the whole feasible region and find a local area close to the true optimum, only based on limited inputs. Then, more points will be chosen over this region to find the optimum.
When predicting flows, the scenario is notably distinct due to the multi-scale nature of the limited data in both spatial and temporal dimensions, which are typically collected and represented as a tensor. Directly applying surrogate models is not feasible because there are plenty of data observations for training, leading to extremely high computational complexity.
To handle such data, a tailored multi-output Gaussian process via a compression method is used to extract spatial and temporal basis functions, along with low-dimensional input-dependent coefficients, respectively. Then, surrogate modelling is implemented based on input-dependent features, allowing for accurate and fast predictions of unobserved flows.

Next, we address the challenge of analysing non-normal data, which includes both continuous and discrete scenarios. The former occurs when assessing the manufacturing processes, and the latter involves analysis of multiple dependent Poisson counts. We handle non-normal distributions based on the following fact: applying the cumulative distribution function (CDF) of a random variable to that random variable yields a uniformly distributed random variable on $[0, 1]$, assuming that this CDF is continuous. When evaluating a manufacturing process, some data on a characteristic of interest are collected first. Then, the process capability indices (PCIs) can be calculated for the quality control. We focus on the case where there are asymmetric tolerances to define PCIs, and the characteristic follows an unknown continuous univariate distribution.
Two tailored PCIs are proposed, with an inverse transformation introduced to handle non-normal data. We estimate the underlying CDF of the data via B-splines, transforming the data to be normal. Then, our PCIs can be used to assess the capability of an in-control production process in manufacturing conforming products based on non-normal data. Furthermore, when dealing with multiple Poisson counts, it is essential for both the marginal and joint model specifications to accurately capture the random behaviour of the variables. Although there has been a growing interest in this type of data across various scientific fields, models that account for multivariate Poisson distributions remain relatively uncommon. To address that, we introduce a novel multivariate Poisson distribution, leveraging multivariate reduction technology and copula methods, which offers unprecedented flexibility in modelling joint distributions and dependence structures. Various probabilistic properties and several estimation techniques are also explored.

In summary, this thesis makes significant contributions to the field by advancing statistical modelling techniques, particularly in addressing complex and challenging data scenarios. ...
Conference paper (2023) - S. Li
Multivariate distributions are indispensable tools for modeling complex data structures with multiple dependent variables. Despite extensive research on discrete multivariate distributions, the multivariate Poisson distribution remains inadequately defined. However, multivariate Poisson counts are not rare and have gained considerable attention in scientific fields such as reliability engineering. Accurately specifying the dependence structure presents a significant challenge in analyzing such data. Although several methods have been proposed in the literature to address this issue, they have limitations in satisfying all feasible correlations. Currently, there is an outstanding question regarding the development of a multivariate Poisson model that is easily interpretable and effectively handles dependent Poisson counts. In this study, we present a novel multivariate Poisson model that leverages multivariate reduction techniques (MRT) to enable greater flexibility in the dependence structure, particularly for negative correlations, than classical constructions. Our proposed model generalizes existing MRT-based methods by having the same parameters when some of our model's parameters are preset. We demonstrate the feasible regions of correlations and show that our model overcomes the limitations of previous methods, making it ideal for analyzing multivariate Poisson counts. Furthermore, we establish several probabilistic properties, including the probability mass function, the probability-generating function, and the Pearson correlation coefficient. We also provide a detailed discussion of maximum likelihood estimation and an algorithm for generating multivariate Poisson random variables. Our model's superiority is demonstrated through simulations and a real-world example. ...