A.A. Nunez Vicencio
Please Note
21 records found
1
Mapping Out-of-Gauge Rail Freight Capacity
A Method for Identifying, Prioritising and Addressing UIC-Gauge Obstructions
The proposed method utilises existing ProRail measurement and asset data stored in the Sigma application, including track geometry and recorded gauge infringements. Following data cleaning, enrichment, and quality filtering, infrastructure margins are calculated in accordance with the static gauging principles of NEN‑EN 15273. These margins account for track irregularities, curvature and cant effects, and dynamic allowances, enabling the estimation of available swept-envelope space for rolling stock and cargo. The first methodological module generates location-specific gauge possibilities and geospatial visualisations, supporting corridor-level diagnostics and revealing spatial concentrations of violations.
To translate technical results into actionable decision support, a bottleneck prioritisation method is applied using a Bottleneck Priority Number (BNPN), combining exceedance magnitude, observation frequency, and transport-demand scenarios. By integrating BNPN rankings with mapped violations, the methodology distinguishes between low-effort adjustments and more extensive redesign measures, supporting phased mitigation planning and the development of a clearance database for proactive and reactive operational use.
A case study demonstrates that the method successfully identifies decisive bottlenecks, prioritisation outcomes, and solution strategies, while highlighting recurring issues related to signage and station environments. Although further case studies are required to confirm statistical robustness, the results indicate strong potential for improving rail network resilience and out-of-gauge freight routing capability. ...
The proposed method utilises existing ProRail measurement and asset data stored in the Sigma application, including track geometry and recorded gauge infringements. Following data cleaning, enrichment, and quality filtering, infrastructure margins are calculated in accordance with the static gauging principles of NEN‑EN 15273. These margins account for track irregularities, curvature and cant effects, and dynamic allowances, enabling the estimation of available swept-envelope space for rolling stock and cargo. The first methodological module generates location-specific gauge possibilities and geospatial visualisations, supporting corridor-level diagnostics and revealing spatial concentrations of violations.
To translate technical results into actionable decision support, a bottleneck prioritisation method is applied using a Bottleneck Priority Number (BNPN), combining exceedance magnitude, observation frequency, and transport-demand scenarios. By integrating BNPN rankings with mapped violations, the methodology distinguishes between low-effort adjustments and more extensive redesign measures, supporting phased mitigation planning and the development of a clearance database for proactive and reactive operational use.
A case study demonstrates that the method successfully identifies decisive bottlenecks, prioritisation outcomes, and solution strategies, while highlighting recurring issues related to signage and station environments. Although further case studies are required to confirm statistical robustness, the results indicate strong potential for improving rail network resilience and out-of-gauge freight routing capability.
The Link Between the Rail Wear Rate and Rolling Contact Fatigue
From Bog Data Analysis to Lab Research
Monitoring the states of a control system is important to ensure the behavior of the system is achieving the control objectives. This can be achieved, among others, by using state observers that estimate the states of the systems regularly. First, we present a literature review of observer design methods for distributed parameter systems. In general, the design requires a dimension-reduction approach to implement the observer. From the dimension reduction, the design approaches can be classified into late and early lumping. In the late lumping perspective, model reduction is performed at the end of the observer design. In the early lumping perspective, dimension reduction is applied to the model of the system. We incorporate both approaches in our literature review.
State observer design requires the model of the systems. This thesis also presents a system identification method for distributed-parameter systems. The identification of such systems typically requires spatially dense and regular measurements, followed by selecting sensors that provide significant measurements to the model to reduce the model complexity. However, these requirements may be challenging to fulfill. In case the sensor locations are irregular and sparse in space, we propose the use of lumped-parameter system identification.
For models with a large number of regressors, we propose a method for reducing the number of regressors using a tree representation. The tree is a way to list models with different numbers of regressors. From all possible regressors for the model, the proposed method builds the tree from the simplest models, i.e., models with one regressor. The number of regressors in the models is incrementally increased to one or more models with the best performance. The addition is repeated until the tree contains models with the desired maximum number of regressors.
System identification is typically performed using a complete data set, i.e., for each input sample, there is an associated output sample available. However, there are cases in which some output samples are not recorded in the data set, making the identification data incomplete. This thesis also considers the problem of incomplete data for Takagi-Sugeno (TS) fuzzy system identification using the product space clustering method. This method comprises two steps: fuzzy clustering and rules construction. The first proposed method enables the use of incomplete system identification data to fuzzy c-means clustering algorithm developed for incomplete classification, which yields different estimates for a missing sample. This can be achieved by fusing those different values into a single value. The second proposed method treats missing samples as optimization variables during the identification process. The optimization is repeated until the change of all optimization variables is small.
...
Monitoring the states of a control system is important to ensure the behavior of the system is achieving the control objectives. This can be achieved, among others, by using state observers that estimate the states of the systems regularly. First, we present a literature review of observer design methods for distributed parameter systems. In general, the design requires a dimension-reduction approach to implement the observer. From the dimension reduction, the design approaches can be classified into late and early lumping. In the late lumping perspective, model reduction is performed at the end of the observer design. In the early lumping perspective, dimension reduction is applied to the model of the system. We incorporate both approaches in our literature review.
State observer design requires the model of the systems. This thesis also presents a system identification method for distributed-parameter systems. The identification of such systems typically requires spatially dense and regular measurements, followed by selecting sensors that provide significant measurements to the model to reduce the model complexity. However, these requirements may be challenging to fulfill. In case the sensor locations are irregular and sparse in space, we propose the use of lumped-parameter system identification.
For models with a large number of regressors, we propose a method for reducing the number of regressors using a tree representation. The tree is a way to list models with different numbers of regressors. From all possible regressors for the model, the proposed method builds the tree from the simplest models, i.e., models with one regressor. The number of regressors in the models is incrementally increased to one or more models with the best performance. The addition is repeated until the tree contains models with the desired maximum number of regressors.
System identification is typically performed using a complete data set, i.e., for each input sample, there is an associated output sample available. However, there are cases in which some output samples are not recorded in the data set, making the identification data incomplete. This thesis also considers the problem of incomplete data for Takagi-Sugeno (TS) fuzzy system identification using the product space clustering method. This method comprises two steps: fuzzy clustering and rules construction. The first proposed method enables the use of incomplete system identification data to fuzzy c-means clustering algorithm developed for incomplete classification, which yields different estimates for a missing sample. This can be achieved by fusing those different values into a single value. The second proposed method treats missing samples as optimization variables during the identification process. The optimization is repeated until the change of all optimization variables is small.
The objective of this thesis is to perform a structural analysis of a geometrically non-linear Timoshenko beam using a physics informed neural network. The network is built using a variational principle (the principle of virtual work) and a force residual. Furthermore, two optimization algorithms, the adaptive weight loss algorithm and the adaptive activation function are separately used in conjunction with the model to examine the potential improvements on the convergence rate.
It is found that the geometrically non-linear Timoshenko beam can be accurately modeled (relative error of below 2% with respect to the finite element output) with a physics informed neural network. This accuracy can be achieved with a model possessing a relatively shallow size of four hidden layers containing eight neurons each. The adaptive weight loss algorithm and the adaptive activation algorithm both improve the convergence rate of the model, though they are not necessary to maintain the practicality of the model, as the convergence rate is adequate without these. It is recommended that the hyperbolic tangent function is utilized in conjunction with the Adam optimizer. The adaptive activation function can be incorporated into the model to improve the convergence rate significantly without substantially increasing the computational cost of the model. ...
The objective of this thesis is to perform a structural analysis of a geometrically non-linear Timoshenko beam using a physics informed neural network. The network is built using a variational principle (the principle of virtual work) and a force residual. Furthermore, two optimization algorithms, the adaptive weight loss algorithm and the adaptive activation function are separately used in conjunction with the model to examine the potential improvements on the convergence rate.
It is found that the geometrically non-linear Timoshenko beam can be accurately modeled (relative error of below 2% with respect to the finite element output) with a physics informed neural network. This accuracy can be achieved with a model possessing a relatively shallow size of four hidden layers containing eight neurons each. The adaptive weight loss algorithm and the adaptive activation algorithm both improve the convergence rate of the model, though they are not necessary to maintain the practicality of the model, as the convergence rate is adequate without these. It is recommended that the hyperbolic tangent function is utilized in conjunction with the Adam optimizer. The adaptive activation function can be incorporated into the model to improve the convergence rate significantly without substantially increasing the computational cost of the model.
These PDEs could be leveraged to simulate the underlying scenarios. The dissertation introduces physics-informed machine learning (PIML) based approaches tailored to simulate the dynamics of beam structures. The aim is to incorporate the physical laws in the neural networks training for more accurate and realistic simulations, handle noisy data effectively, and improve prediction accuracy while mitigating challenges such as multiscale problems and generalization. Chapter 1 outlines the primary challenges tackled in the dissertation. Chapters 2 through 5 detail the methodologies developed to address each challenge.
Chapter 2 presents a physics-informed neural network (PINN) based methodology to simulate complex beam systems with real-world mate- rial properties. In addition, inverse problems are solved in the presence of noisy data to predict unknown parameters, including force acting on the beam systems. It is essential to consider the real-world material parameters to simulate the dynamics of the modeled system and ensure the digital model represents the ground truth. However, incorporating material characteristics leads to multiscale PDE coefficients in the physical model, posing difficulty in training for PINNs. Subsequently, a frame- work is proposed to incorporate nondimensional PDEs into the PINN loss function. This approach facilitates efficient forward and inverse simulations while robust to noise and uncertainty in measurement data. The efficacy of this approach is demonstrated through simulations of Euler- Bernoulli and Timoshenko beam systems, contributing to the challenge of simulating large-scale systems with multiple interconnected components.
Chapter 3 investigates beam dynamic simulations on Winkler foundations for large spatiotemporal domains using PIML. Predictions on expansive spatiotemporal domains are vital for structural integrity, design optimization, and control mechanisms. A causality-respecting PINN frame- work is introduced, enhancing prediction accuracy. Furthermore, integrating transfer learning addresses the need to re-train the network for different initial conditions and computational domains. Numerical experiments based on Euler-Bernoulli and Timoshenko theories validate the methodology for respecting the causality and generalizing the beam dynamics across similar problems. The approach efficiently predicts beam dynamics under diverse engineering scenarios, reducing computational costs and improving convergence.
Chapter 4 explores the generalization abilities of PIML, essential for practical applications requiring accurate predictions in unexplored regions. The proposed framework exploits the inherent causality in the PDE solutions by merging PIML models with recurrent neural architectures, namely neural oscillators. The neural ordinary differential equations in the form of neural oscillators effectively handle long-time dependencies and address gradient-related issues, fostering improved generalization in PIML tasks. Benchmark equations like viscous Burgers, Allen-Cahn, Schrödinger, and biharmonic Euler-Bernoulli beam equations are used to demonstrate the effectiveness of the proposed approach. Through ex- tensive experimentation with time-dependent nonlinear PDEs, the study showcases superior performance compared to existing state-of-the-art methods. The proposed method provides accurate solutions for extrapolation and prediction beyond the training data by enhancing the generalization capabilities of PIML, promising advancements in complex system simulations.
Chapter 5 follows up on generalization of beam dynamics beyond PIML- based approaches. Computer-aided simulations are crucial for advancing engineering industries, but existing simulators often struggle to generalize beyond their training domain. The chapter proposes a two-stage methodology to tackle this challenge. Firstly, it utilizes specialized simulators tailored to the application, such as causal PINNs and black-box finite element simulations. Secondly, it integrates predictions from the first stage into a recurrent neural architecture, incorporating ordinary differential equations to capture intrinsic dynamics and enhance generalization. The approach efficiently captures causality and generalizes dynamics across various data sources. Numerical experiments cover fundamental structural engineering scenarios, including real-world catenary contact wire uplift predictions, and demonstrate superior performance compared to conventional methods, and promise for diverse industrial applications. This dissertation concludes with Chapter 6.
In particular, this dissertation introduces PIML methodologies for simulating complex beam structures, addressing key challenges such as incorporating real material properties, handling noisy data, and improving prediction accuracy. Chapter 2 introduces a PINN-based methodology that efficiently simulates beam systems and predicts unknown parameters, mitigating the difficulties posed by multiscale PDE coefficients. Chapter 3 tackles the challenge of large-domain beam dynamics predictions on the Winkler foundations by using causality-respecting PINNs and integrating transfer learning to reduce computational costs. Chapter 4 addresses the challenge of out-of-domain predictions in PIML by introducing neural oscillators. Chapter 5 proposes a two-stage methodology to generalize beam dynamics simulations, integrating beam dynamics solvers and recurrent neural-based architectures, showcasing its efficacy in real-world applications such as catenary contact wire uplift predictions.
...
These PDEs could be leveraged to simulate the underlying scenarios. The dissertation introduces physics-informed machine learning (PIML) based approaches tailored to simulate the dynamics of beam structures. The aim is to incorporate the physical laws in the neural networks training for more accurate and realistic simulations, handle noisy data effectively, and improve prediction accuracy while mitigating challenges such as multiscale problems and generalization. Chapter 1 outlines the primary challenges tackled in the dissertation. Chapters 2 through 5 detail the methodologies developed to address each challenge.
Chapter 2 presents a physics-informed neural network (PINN) based methodology to simulate complex beam systems with real-world mate- rial properties. In addition, inverse problems are solved in the presence of noisy data to predict unknown parameters, including force acting on the beam systems. It is essential to consider the real-world material parameters to simulate the dynamics of the modeled system and ensure the digital model represents the ground truth. However, incorporating material characteristics leads to multiscale PDE coefficients in the physical model, posing difficulty in training for PINNs. Subsequently, a frame- work is proposed to incorporate nondimensional PDEs into the PINN loss function. This approach facilitates efficient forward and inverse simulations while robust to noise and uncertainty in measurement data. The efficacy of this approach is demonstrated through simulations of Euler- Bernoulli and Timoshenko beam systems, contributing to the challenge of simulating large-scale systems with multiple interconnected components.
Chapter 3 investigates beam dynamic simulations on Winkler foundations for large spatiotemporal domains using PIML. Predictions on expansive spatiotemporal domains are vital for structural integrity, design optimization, and control mechanisms. A causality-respecting PINN frame- work is introduced, enhancing prediction accuracy. Furthermore, integrating transfer learning addresses the need to re-train the network for different initial conditions and computational domains. Numerical experiments based on Euler-Bernoulli and Timoshenko theories validate the methodology for respecting the causality and generalizing the beam dynamics across similar problems. The approach efficiently predicts beam dynamics under diverse engineering scenarios, reducing computational costs and improving convergence.
Chapter 4 explores the generalization abilities of PIML, essential for practical applications requiring accurate predictions in unexplored regions. The proposed framework exploits the inherent causality in the PDE solutions by merging PIML models with recurrent neural architectures, namely neural oscillators. The neural ordinary differential equations in the form of neural oscillators effectively handle long-time dependencies and address gradient-related issues, fostering improved generalization in PIML tasks. Benchmark equations like viscous Burgers, Allen-Cahn, Schrödinger, and biharmonic Euler-Bernoulli beam equations are used to demonstrate the effectiveness of the proposed approach. Through ex- tensive experimentation with time-dependent nonlinear PDEs, the study showcases superior performance compared to existing state-of-the-art methods. The proposed method provides accurate solutions for extrapolation and prediction beyond the training data by enhancing the generalization capabilities of PIML, promising advancements in complex system simulations.
Chapter 5 follows up on generalization of beam dynamics beyond PIML- based approaches. Computer-aided simulations are crucial for advancing engineering industries, but existing simulators often struggle to generalize beyond their training domain. The chapter proposes a two-stage methodology to tackle this challenge. Firstly, it utilizes specialized simulators tailored to the application, such as causal PINNs and black-box finite element simulations. Secondly, it integrates predictions from the first stage into a recurrent neural architecture, incorporating ordinary differential equations to capture intrinsic dynamics and enhance generalization. The approach efficiently captures causality and generalizes dynamics across various data sources. Numerical experiments cover fundamental structural engineering scenarios, including real-world catenary contact wire uplift predictions, and demonstrate superior performance compared to conventional methods, and promise for diverse industrial applications. This dissertation concludes with Chapter 6.
In particular, this dissertation introduces PIML methodologies for simulating complex beam structures, addressing key challenges such as incorporating real material properties, handling noisy data, and improving prediction accuracy. Chapter 2 introduces a PINN-based methodology that efficiently simulates beam systems and predicts unknown parameters, mitigating the difficulties posed by multiscale PDE coefficients. Chapter 3 tackles the challenge of large-domain beam dynamics predictions on the Winkler foundations by using causality-respecting PINNs and integrating transfer learning to reduce computational costs. Chapter 4 addresses the challenge of out-of-domain predictions in PIML by introducing neural oscillators. Chapter 5 proposes a two-stage methodology to generalize beam dynamics simulations, integrating beam dynamics solvers and recurrent neural-based architectures, showcasing its efficacy in real-world applications such as catenary contact wire uplift predictions.
Assessing The Resilience of Railway Organisation For Unexpected External Events
Development of a Semi-Qualitative Assessment Tool
This dissertation develops a new technology based on train-borne LDV for measuring the vibration and load-response relationship of railway tracks over a wide frequency range…
...
This dissertation develops a new technology based on train-borne LDV for measuring the vibration and load-response relationship of railway tracks over a wide frequency range…
The second problem is that the wagon list is not 100% correct due to human errors. Random checks show that the wagon list does not always turn out to be correct and that mistakes are made in creating the list. Therefore, smart camera techniques have been developed to detect which wagon contains which dangerous goods on arrival to confirm the wagon list. This opens up new possibilities concerning the sorting strategy of wagons from the arrival track onto the classification tracks in accordance with their destination.
Research is done to find out what this new sorting strategy should look like. Therefore, the following research question is formulated: ‘What kind of method can distribute wagons, including wagons carrying dangerous goods, on a rail yard, considering safety?’. In order to answer the research question, qualitative research was carried out into the various existing strategies and methods that could be applied. A model is created in which wagons are sorted over classification tracks. The model chooses the track on which the wagons should be placed based on the destination. Specific attention is given to wagons with dangerous goods, for which the model ensures that a specific distance is kept from other wagons on the (neighboring) classification tracks. The model then determines the most appropriate departure time, and the train departs. Different scenarios are tested that the model has to deal with. A distinction is made here between whether the number of classification tracks is equal or not equal to the number of destinations, and whether the rate of dangerous goods over the destinations is equal or not. This makes a total of four scenarios that have been tested.
The effect of using a buffer on the track is examined so that a wagon carrying dangerous goods never stands next to another wagon carrying dangerous goods of a different class if they are not allowed to stand together. This is tested in the model, and the output showed the placement of the wagons and the processing times.
...
The second problem is that the wagon list is not 100% correct due to human errors. Random checks show that the wagon list does not always turn out to be correct and that mistakes are made in creating the list. Therefore, smart camera techniques have been developed to detect which wagon contains which dangerous goods on arrival to confirm the wagon list. This opens up new possibilities concerning the sorting strategy of wagons from the arrival track onto the classification tracks in accordance with their destination.
Research is done to find out what this new sorting strategy should look like. Therefore, the following research question is formulated: ‘What kind of method can distribute wagons, including wagons carrying dangerous goods, on a rail yard, considering safety?’. In order to answer the research question, qualitative research was carried out into the various existing strategies and methods that could be applied. A model is created in which wagons are sorted over classification tracks. The model chooses the track on which the wagons should be placed based on the destination. Specific attention is given to wagons with dangerous goods, for which the model ensures that a specific distance is kept from other wagons on the (neighboring) classification tracks. The model then determines the most appropriate departure time, and the train departs. Different scenarios are tested that the model has to deal with. A distinction is made here between whether the number of classification tracks is equal or not equal to the number of destinations, and whether the rate of dangerous goods over the destinations is equal or not. This makes a total of four scenarios that have been tested.
The effect of using a buffer on the track is examined so that a wagon carrying dangerous goods never stands next to another wagon carrying dangerous goods of a different class if they are not allowed to stand together. This is tested in the model, and the output showed the placement of the wagons and the processing times.
In this report, a moving horizon optimization approach is presented as a conceptual model to improve the efficiency of maintenance of a road network, compared to the currently used maintenance approach.
...
In this report, a moving horizon optimization approach is presented as a conceptual model to improve the efficiency of maintenance of a road network, compared to the currently used maintenance approach.
In the present research, 2 machine learning methods are implemented to classify 3D point cloud data of railway environments into 7 categories of rails, sleepers, track bed, masts, overhead wires, trees and other. To this end, the ensemble method of Random Forest, and the deep learning method of DGCNN (Dynamic Graph Convolutional Neural Network) are implemented. While Random Forest is a simple and handy ensemble algorithm used for a wide range of applications, DGCNN is a deep learning method based on PointNet, the pioneer method on raw point clouds, and graph CNNs. The methods are validated under 3 case studies, produced by structure from motion photogrammetry, airborne laser scanning and terrestrial laser scanning, and locating in 2 different areas within the Netherlands. For each method, 2 scenarios are developed for classifying colored and uncolored point clouds, respectively. Finally, the 2 methods are combined for the first scenario, as a first attempt to further improve the final results.
The obtained results show that, in this work, DGCNN performs better than Random Forest, and both methods approximate state-of-the-art performance. The contribution of colors is important to improve both the overall accuracy of the models, as well as the classification results of the individual classes. More specifically, Random Forest scenario 1, Random Forest scenario 2, DGCNN scenario 1, DGCNN scenario 2, and the combination of Random Forest and DGCNN scenario 1 result in an overall accuracy of 88.65%, 83.39%, 89.19%, 88.20% and 90.57%, respectively. The corresponding per class F1-scores for all methods and scenarios range between 43% and 94%. Both methods meet difficulties in generalizing on data from different sensor systems and different areas with very different point density and missing data. Nonetheless, the individual methods are already very promising, while they are able to achieve the required accuracy of more than 90% when combined. ...
In the present research, 2 machine learning methods are implemented to classify 3D point cloud data of railway environments into 7 categories of rails, sleepers, track bed, masts, overhead wires, trees and other. To this end, the ensemble method of Random Forest, and the deep learning method of DGCNN (Dynamic Graph Convolutional Neural Network) are implemented. While Random Forest is a simple and handy ensemble algorithm used for a wide range of applications, DGCNN is a deep learning method based on PointNet, the pioneer method on raw point clouds, and graph CNNs. The methods are validated under 3 case studies, produced by structure from motion photogrammetry, airborne laser scanning and terrestrial laser scanning, and locating in 2 different areas within the Netherlands. For each method, 2 scenarios are developed for classifying colored and uncolored point clouds, respectively. Finally, the 2 methods are combined for the first scenario, as a first attempt to further improve the final results.
The obtained results show that, in this work, DGCNN performs better than Random Forest, and both methods approximate state-of-the-art performance. The contribution of colors is important to improve both the overall accuracy of the models, as well as the classification results of the individual classes. More specifically, Random Forest scenario 1, Random Forest scenario 2, DGCNN scenario 1, DGCNN scenario 2, and the combination of Random Forest and DGCNN scenario 1 result in an overall accuracy of 88.65%, 83.39%, 89.19%, 88.20% and 90.57%, respectively. The corresponding per class F1-scores for all methods and scenarios range between 43% and 94%. Both methods meet difficulties in generalizing on data from different sensor systems and different areas with very different point density and missing data. Nonetheless, the individual methods are already very promising, while they are able to achieve the required accuracy of more than 90% when combined.
First, a literature review was conducted to assess knowledge on this issue and analyse methods to model train-track systems. Next, based on the literature review, parameters were identified which could influence the rail wear process, e.g. tram velocity, primary yaw stiffness or rail hardness. An evaluation of available data on those parameters at GVB was made. Data that was available and deemed essential was aligned and later used as input for a rail wear prediction model for the tramway of Amsterdam. The essential data analyses are the velocity analysis and wheel qR decay analysis. A tool was developed to obtain a detailed profile of driven velocities at the tram network, based on massive daily data from all the trams. Wheel measurement data from all the trams in the network were analysed to assess the wear of the tram wheels. Statistics were obtained over six years of wheel measurement data measured twice per year per tram. Besides, the usage data processed for inclusion in the model are tonnage, vehicle type distribution, amount of coupled vehicles and average vehicle loading.
Furthermore, more than six hundred simulations were performed at DEKRA rail with GVB tram models in VAMPIRE, to obtain more insight into tram curving behaviour. Based on the simulation outcomes, relationships between energy dissipation (\(T_{\gamma}\)) and radius, velocity, flange angle, vehicle loading and -type were derived. Those relationships were obtained per wheel and aggregated per tram passage. Finally, a rail wear model was made, which combines the relationships derived from the simulations, track characteristics and usage data about the curves. The energy dissipation (\(T_{\gamma}\)) was used as a wear indicator by the prediction model.
From the velocity analysis, trams generally keep to the speed limits, but trams drive too fast at specific curves. The wheel analysis showed qR at the tram's front wheels decays considerably faster than other wheels. Combino tram type's wheel decay was poorer than the older BN tram type. From the simulations, wheel wear has the most adverse effect on rail wear in curves. Increased vehicle velocity or amount of passengers also has a considerable negative influence on rail wear. The expected wear rises exponentially if the curve radius decreases. Also, the dependency on velocity increases exponentially when the curve radius decreases…
...
First, a literature review was conducted to assess knowledge on this issue and analyse methods to model train-track systems. Next, based on the literature review, parameters were identified which could influence the rail wear process, e.g. tram velocity, primary yaw stiffness or rail hardness. An evaluation of available data on those parameters at GVB was made. Data that was available and deemed essential was aligned and later used as input for a rail wear prediction model for the tramway of Amsterdam. The essential data analyses are the velocity analysis and wheel qR decay analysis. A tool was developed to obtain a detailed profile of driven velocities at the tram network, based on massive daily data from all the trams. Wheel measurement data from all the trams in the network were analysed to assess the wear of the tram wheels. Statistics were obtained over six years of wheel measurement data measured twice per year per tram. Besides, the usage data processed for inclusion in the model are tonnage, vehicle type distribution, amount of coupled vehicles and average vehicle loading.
Furthermore, more than six hundred simulations were performed at DEKRA rail with GVB tram models in VAMPIRE, to obtain more insight into tram curving behaviour. Based on the simulation outcomes, relationships between energy dissipation (\(T_{\gamma}\)) and radius, velocity, flange angle, vehicle loading and -type were derived. Those relationships were obtained per wheel and aggregated per tram passage. Finally, a rail wear model was made, which combines the relationships derived from the simulations, track characteristics and usage data about the curves. The energy dissipation (\(T_{\gamma}\)) was used as a wear indicator by the prediction model.
From the velocity analysis, trams generally keep to the speed limits, but trams drive too fast at specific curves. The wheel analysis showed qR at the tram's front wheels decays considerably faster than other wheels. Combino tram type's wheel decay was poorer than the older BN tram type. From the simulations, wheel wear has the most adverse effect on rail wear in curves. Increased vehicle velocity or amount of passengers also has a considerable negative influence on rail wear. The expected wear rises exponentially if the curve radius decreases. Also, the dependency on velocity increases exponentially when the curve radius decreases…
Classification of Damages on Aircraft Inspection Images Using Convolutional Neural Networks
Kick-starting a Deep Learning project with limited data
feasibility of using a deep Convolutional Neural Network (CNN) for (semi-)automated damage detection, with the goal of achieving a high recall (low False Negative Rate (FNR)) on the small damages. The problem is framed as a classification problem on limited and imbalanced data. However, it is not pre-defined if single-label or multi-label classification should be used,
and both approaches are investigated.
The main contribution of this work is to show experimentally how common deep CNN architectures and Deep Learning practices can be used to train classifiers that recognize damages in a specialized domain, with application-specific metrics. We present methods for synthesizing, pre-processing and re-sampling of the necessary dataset. It is shown that pre-trained, parameter-efficient CNN architectures that implement skip-connections, complemented by
global max-pooling before the final layer, are well suited for that dataset. The Xception architecture has been chosen as backbone for the classifier due to its high recall and fast convergence. To mitigate the detrimental influence of imbalanced training data, training data re-sampling that equalizes the class distribution is implemented. It has a positive effect recall, especially
when applied to multi-label classification. When using re-sampling and data augmentation, the performance of multi-label and single-label classification can be brought to the same level. However, the best achieved FNR is 5.4%, with a softmax classifier, combining all regularization methods.
Finally, we investigated how regularization can be used to increase generalization capability with limited training data. Data augmentation is the most effective regularization method, even though its full potential has not been explored yet. Dropout benefits single-label classification
but not multi-label classification. L2-regularization has a moderate positive effect on both. Naively combining the regularization techniques without an exhaustive grid search or automated search on average does not yield any additional gains and shows the limit of manual hyper-parameter tuning. ...
feasibility of using a deep Convolutional Neural Network (CNN) for (semi-)automated damage detection, with the goal of achieving a high recall (low False Negative Rate (FNR)) on the small damages. The problem is framed as a classification problem on limited and imbalanced data. However, it is not pre-defined if single-label or multi-label classification should be used,
and both approaches are investigated.
The main contribution of this work is to show experimentally how common deep CNN architectures and Deep Learning practices can be used to train classifiers that recognize damages in a specialized domain, with application-specific metrics. We present methods for synthesizing, pre-processing and re-sampling of the necessary dataset. It is shown that pre-trained, parameter-efficient CNN architectures that implement skip-connections, complemented by
global max-pooling before the final layer, are well suited for that dataset. The Xception architecture has been chosen as backbone for the classifier due to its high recall and fast convergence. To mitigate the detrimental influence of imbalanced training data, training data re-sampling that equalizes the class distribution is implemented. It has a positive effect recall, especially
when applied to multi-label classification. When using re-sampling and data augmentation, the performance of multi-label and single-label classification can be brought to the same level. However, the best achieved FNR is 5.4%, with a softmax classifier, combining all regularization methods.
Finally, we investigated how regularization can be used to increase generalization capability with limited training data. Data augmentation is the most effective regularization method, even though its full potential has not been explored yet. Dropout benefits single-label classification
but not multi-label classification. L2-regularization has a moderate positive effect on both. Naively combining the regularization techniques without an exhaustive grid search or automated search on average does not yield any additional gains and shows the limit of manual hyper-parameter tuning.
To reduce the life cycle cost and failure rate of catenary in practice, planned and predictive maintenance is desired based on the condition monitoring of catenary. However, the monitoring data are underutilized to effectively assess the catenary condition and facilitate maintenance decision-making. This dissertation contributes in improving the dynamic condition assessment of catenary using the data from condition monitoring. New performance indicators (PIs) of catenary are defined in a way that is adaptive to the variations of monitoring data measured under different circumstances, such as the changes of catenary structure, pantograph type and train speed. The relationship between the monitoring data and the contact wire irregularities is studied using historical data and simulations. Data-based approaches are developed for the quantitative assessment of dynamic catenary condition.
First, an intrinsic wavelength contained in the pantograph-catenary contact force is identified and defined as the catenary structure wavelength (CSW). It is caused by the periodic variations of contact wire stiffness attributed to the cyclic structure of catenary that must regulate the height of contact wire in every spans and interdropper distances. An approach that adaptively extracts the CSWs of pantograph-catenary contact force is proposed based on the empirical mode decomposition algorithm. It extracts the CSW signals corresponding to the span lengths and interdropper distances, respectively, summing to form a characteristic signal of CSWs. The residual signal of the contact force excluding the CSWs is regarded as the non-CSW signal. The mean and standard deviation of the CSWs signal are used as PIs to indicate the condition of the main catenary geometric parameters. A PI based on the quadratic time-frequency representation of the non-CSW signal is proposed for detecting and localizing the local irregularities of contact wire. The proposed PIs are tested by simulation and measurement data and proven effective and adaptive owning to the use of CSWs and non-CSW signal.
Second, the concept of CSW is expanded to the pantograph head acceleration from which the CSWs and non-CSW signal can also be extracted using the same approach developed for the contact force. Considering the characteristics of pantograph head acceleration, the wavelet packet entropy of the CSWs and non-CSW signal is proposed as PIs for detecting contact wire irregularities with different lengths. The entropy of CSWs is used for detecting irregularities with a length longer than 5 m, while the entropy of non-CSW signal is for the short-length local irregularities. An approach to detect and verify contact wire irregularities using the measurement data of pantograph head vertical acceleration from frequent inspections is proposed. The approach is tested using historical inspection data from which irregularities at all lengths are detected and verified. Maintenance resources can thus be specifically allocated to verified detection results to save cost and time.
Third, through analyzing historical inspection data and data-based simulation results, it is found that while the contact wire irregularity deteriorates the pantograph-catenary interaction, the formation of irregularity is also associated with the effects of the interaction like variations of contact and friction forces. Concretely, the contact wire height irregularity with an amplitude of 8 mm can cause considerable increase in the standard deviation of pantograph-catenary contact force. In addition, the irregularity with a certain wavelength can induce the dynamic response with the same wavelength in the contact force. This in turn makes the irregularity part deteriorating faster than the other parts of catenary. At a smaller scale, when the wear irregularity of contact wire has an average wire thickness loss of about 1.5 mm, it can also increase the standard deviation of contact force by more than 5%. Due to the fixing effect at the registration arms and droppers, the wear irregularity commonly contains structural wavelengths of catenary including span lengths and interdropper distances. It is also found that the wear irregularity tends to grow and spread toward in the common or dominant running direction of trains in the specific line. Nevertheless, an existing defect may not affect every pantograph passage and every type of data measured. It is thus advised to measure multiple types of data and perform more frequent inspections to avoid undetected defects.
Last, a data-driven approach using the Bayesian network (BN) to fuse the available inspection data of catenary into an integrated PI is proposed. The BN topology is first structured based on the physical relations between five data types including the train speed, dynamic stagger and height of contact wire, pantograph head acceleration, and pantograph-catenary contact force. Then, tailored PIs are individually defined and extracted from the five types of data as the BN input. As the output of BN, an integrated PI is defined as the overall condition level of catenary considering all defects that can be reflected by the five types of data. Finally, using historical inspections data and maintenance records from a section of high-speed line, the BN parameters are estimated to establish a probabilistic relationship between the input and the output PI. By testing the BN-based approach using new inspection data from the same railway line, it is shown that the integrated PI can adequately represent the catenary condition, leading to considerable reduction in the false alarm rate of catenary defect detection compared with the current practice. The approach can also work acceptably with noisy or partly missing data.
In summary, this dissertation answers how to adequately transform the condition monitoring data of catenary into quantitative assessments of the dynamic catenary condition. The proposed approaches are intended for generic implementations in railway catenaries worldwide. ...
To reduce the life cycle cost and failure rate of catenary in practice, planned and predictive maintenance is desired based on the condition monitoring of catenary. However, the monitoring data are underutilized to effectively assess the catenary condition and facilitate maintenance decision-making. This dissertation contributes in improving the dynamic condition assessment of catenary using the data from condition monitoring. New performance indicators (PIs) of catenary are defined in a way that is adaptive to the variations of monitoring data measured under different circumstances, such as the changes of catenary structure, pantograph type and train speed. The relationship between the monitoring data and the contact wire irregularities is studied using historical data and simulations. Data-based approaches are developed for the quantitative assessment of dynamic catenary condition.
First, an intrinsic wavelength contained in the pantograph-catenary contact force is identified and defined as the catenary structure wavelength (CSW). It is caused by the periodic variations of contact wire stiffness attributed to the cyclic structure of catenary that must regulate the height of contact wire in every spans and interdropper distances. An approach that adaptively extracts the CSWs of pantograph-catenary contact force is proposed based on the empirical mode decomposition algorithm. It extracts the CSW signals corresponding to the span lengths and interdropper distances, respectively, summing to form a characteristic signal of CSWs. The residual signal of the contact force excluding the CSWs is regarded as the non-CSW signal. The mean and standard deviation of the CSWs signal are used as PIs to indicate the condition of the main catenary geometric parameters. A PI based on the quadratic time-frequency representation of the non-CSW signal is proposed for detecting and localizing the local irregularities of contact wire. The proposed PIs are tested by simulation and measurement data and proven effective and adaptive owning to the use of CSWs and non-CSW signal.
Second, the concept of CSW is expanded to the pantograph head acceleration from which the CSWs and non-CSW signal can also be extracted using the same approach developed for the contact force. Considering the characteristics of pantograph head acceleration, the wavelet packet entropy of the CSWs and non-CSW signal is proposed as PIs for detecting contact wire irregularities with different lengths. The entropy of CSWs is used for detecting irregularities with a length longer than 5 m, while the entropy of non-CSW signal is for the short-length local irregularities. An approach to detect and verify contact wire irregularities using the measurement data of pantograph head vertical acceleration from frequent inspections is proposed. The approach is tested using historical inspection data from which irregularities at all lengths are detected and verified. Maintenance resources can thus be specifically allocated to verified detection results to save cost and time.
Third, through analyzing historical inspection data and data-based simulation results, it is found that while the contact wire irregularity deteriorates the pantograph-catenary interaction, the formation of irregularity is also associated with the effects of the interaction like variations of contact and friction forces. Concretely, the contact wire height irregularity with an amplitude of 8 mm can cause considerable increase in the standard deviation of pantograph-catenary contact force. In addition, the irregularity with a certain wavelength can induce the dynamic response with the same wavelength in the contact force. This in turn makes the irregularity part deteriorating faster than the other parts of catenary. At a smaller scale, when the wear irregularity of contact wire has an average wire thickness loss of about 1.5 mm, it can also increase the standard deviation of contact force by more than 5%. Due to the fixing effect at the registration arms and droppers, the wear irregularity commonly contains structural wavelengths of catenary including span lengths and interdropper distances. It is also found that the wear irregularity tends to grow and spread toward in the common or dominant running direction of trains in the specific line. Nevertheless, an existing defect may not affect every pantograph passage and every type of data measured. It is thus advised to measure multiple types of data and perform more frequent inspections to avoid undetected defects.
Last, a data-driven approach using the Bayesian network (BN) to fuse the available inspection data of catenary into an integrated PI is proposed. The BN topology is first structured based on the physical relations between five data types including the train speed, dynamic stagger and height of contact wire, pantograph head acceleration, and pantograph-catenary contact force. Then, tailored PIs are individually defined and extracted from the five types of data as the BN input. As the output of BN, an integrated PI is defined as the overall condition level of catenary considering all defects that can be reflected by the five types of data. Finally, using historical inspections data and maintenance records from a section of high-speed line, the BN parameters are estimated to establish a probabilistic relationship between the input and the output PI. By testing the BN-based approach using new inspection data from the same railway line, it is shown that the integrated PI can adequately represent the catenary condition, leading to considerable reduction in the false alarm rate of catenary defect detection compared with the current practice. The approach can also work acceptably with noisy or partly missing data.
In summary, this dissertation answers how to adequately transform the condition monitoring data of catenary into quantitative assessments of the dynamic catenary condition. The proposed approaches are intended for generic implementations in railway catenaries worldwide.
Simultaneous Multi-Robot Task Scheduling and Path Planning
An integrated approach to task scheduling and path planning for mobile robots in production environments
In scientific literature, path planning and task scheduling are treated as separate problems. Typically, path planners use initial and goal positions as a given and task schedulers consider tasks of fixed duration. In this thesis, the integration of task scheduling and path planning is treated, resulting in three contributions. First, a novel cost function to minimise the duration of paths is proposed for an existing Mixed-Integer Linear Programming (MILP) formulation of the multi-robot path planning problem. Second, a new MILP formulation is developed for the scheduling problem of tasks with deadlines to be completed by mobile robots. Third, two methods for integrating the path planner and task scheduler are proposed. Through simulations these methods are compared with an integration method that uses rule-based scheduling, which assigns tasks to robots in a predefined order.
Simulation results show that the novel path planning objective function improves upon that from the literature by striking a balance between duration, distance, energy use and computation time. Furthermore, integration methods using the MILP scheduler have trouble outperforming a rule-based approach, caused by a practical limit on the computation time. Moreover, compared with single-robot path planning the use of multi-robot path planning is shown to improve performance at the cost of higher computation time. However, this improvement is negligible for environments with a lot of free space, where robots can easily avoid each other without making detours.
These results lead to the conclusion that the MILP path planner with the new path planning objective function performs well when integrated with a task scheduler. For restrictive physical environments, multi-robot path planning can further improve results. On the other hand, for practical applications a scheduling method other than the MILP scheduler should be employed to integrate task scheduling and path planning. The scheduling method should at least outperform rule-based scheduling with a practical limit on computation time, even if an optimal solution is not guaranteed. ...
In scientific literature, path planning and task scheduling are treated as separate problems. Typically, path planners use initial and goal positions as a given and task schedulers consider tasks of fixed duration. In this thesis, the integration of task scheduling and path planning is treated, resulting in three contributions. First, a novel cost function to minimise the duration of paths is proposed for an existing Mixed-Integer Linear Programming (MILP) formulation of the multi-robot path planning problem. Second, a new MILP formulation is developed for the scheduling problem of tasks with deadlines to be completed by mobile robots. Third, two methods for integrating the path planner and task scheduler are proposed. Through simulations these methods are compared with an integration method that uses rule-based scheduling, which assigns tasks to robots in a predefined order.
Simulation results show that the novel path planning objective function improves upon that from the literature by striking a balance between duration, distance, energy use and computation time. Furthermore, integration methods using the MILP scheduler have trouble outperforming a rule-based approach, caused by a practical limit on the computation time. Moreover, compared with single-robot path planning the use of multi-robot path planning is shown to improve performance at the cost of higher computation time. However, this improvement is negligible for environments with a lot of free space, where robots can easily avoid each other without making detours.
These results lead to the conclusion that the MILP path planner with the new path planning objective function performs well when integrated with a task scheduler. For restrictive physical environments, multi-robot path planning can further improve results. On the other hand, for practical applications a scheduling method other than the MILP scheduler should be employed to integrate task scheduling and path planning. The scheduling method should at least outperform rule-based scheduling with a practical limit on computation time, even if an optimal solution is not guaranteed.
However, a railway network typically consists of multiple track sections, each of them with different degradation level and parameters. Hence, the optimization of track maintenance can be considered as a large-scale problem which has a large number of decision variables. For such kind of problem, the conventional centralized optimization is very difficult or even not tractable to solve due to limitations on the computational time and resources. One way to overcome this issue is by applying the so-called distributed optimization scheme. In such approach, the original optimization problem is partitioned into multiple smaller, tractable subproblems. Therefore, the optimization is tractable and more preferred for real-life implementations.
This thesis develops distributed optimization approaches for track maintenance operations planning problem. Three different schemes are compared: Parallel Augmented Lagrangian Relaxation (PALR), Alternating Direction Method of Multipliers (ADMM), and Distributed Robust Safe But Knowledgeable (DRSBK). As these distributed approaches basically designed for convex problems, extension techniques to handle non-convex nature of the proposed optimization problem are implemented. Furthermore, some case studies are defined to evaluate the algorithms from both performance and numerical perspectives. In simulations of small, medium, and large-scale instances, it is shown that in most cases, DRSBK can outperform the other distributed approaches, by providing the closest-to-optimum solution to the centralized optimization problem while having the shortest computation time.
...
However, a railway network typically consists of multiple track sections, each of them with different degradation level and parameters. Hence, the optimization of track maintenance can be considered as a large-scale problem which has a large number of decision variables. For such kind of problem, the conventional centralized optimization is very difficult or even not tractable to solve due to limitations on the computational time and resources. One way to overcome this issue is by applying the so-called distributed optimization scheme. In such approach, the original optimization problem is partitioned into multiple smaller, tractable subproblems. Therefore, the optimization is tractable and more preferred for real-life implementations.
This thesis develops distributed optimization approaches for track maintenance operations planning problem. Three different schemes are compared: Parallel Augmented Lagrangian Relaxation (PALR), Alternating Direction Method of Multipliers (ADMM), and Distributed Robust Safe But Knowledgeable (DRSBK). As these distributed approaches basically designed for convex problems, extension techniques to handle non-convex nature of the proposed optimization problem are implemented. Furthermore, some case studies are defined to evaluate the algorithms from both performance and numerical perspectives. In simulations of small, medium, and large-scale instances, it is shown that in most cases, DRSBK can outperform the other distributed approaches, by providing the closest-to-optimum solution to the centralized optimization problem while having the shortest computation time.
In this thesis the optimization algorithms will be investigated with respect to the reduction of the cost function as well as the computation time needed compute the (sub)optimal solution. The cost function can consist of the Total Time Spent (TTS) by the vehicles inside the network, the Total Emissions (TE) the vehicles exhaust while inside the network, or a combination of both.
We make use of Matlab for implementation of the models and the optimization algorithms and SUMO as a traffic simulator, which will be used to simulate a real-time traffic network.
The traffic flow model and the emission model are both non-smooth and non-convex, which in general would require the use of a global optimization algorithm together with multiple starting points. We also use a smoothening function on the models to work with a local optimization algorithm that works with derivatives of the cost function and both models. By using multiple starting points for this method, we hope to obtain similar results. The optimization algorithms that are implemented and investigated in this thesis are: the Genetic Algorithm (GA), the Simulated Annealing (SA) algorithm, the Pattern Search (PS) algorithm, and the Resilient backPROPagation (RPROP) algorithm.
To compare the results of the four optimization algorithms, four different scenarios are considered. The final results show that all control methods perform better than the FT controller. Overall the GA algorithm performs best without using a multi-start approach, with PS and SA having similar results. The RPROP method is either close to the other methods (0-2%) or quite far off (10-15%), depending on the scenario. When looking at computation time, PS is the fastest. It is twice as fast compared to the GA in most scenarios. SA however takes around fifteen times the amount of computation time compared to the GA. RPROP has varying results again compared to the GA algorithm. A better cost function for the TTS as well as a more optimized algorithm for the RPROP method can resolve both of these issues but future work might need to prove this. ...
In this thesis the optimization algorithms will be investigated with respect to the reduction of the cost function as well as the computation time needed compute the (sub)optimal solution. The cost function can consist of the Total Time Spent (TTS) by the vehicles inside the network, the Total Emissions (TE) the vehicles exhaust while inside the network, or a combination of both.
We make use of Matlab for implementation of the models and the optimization algorithms and SUMO as a traffic simulator, which will be used to simulate a real-time traffic network.
The traffic flow model and the emission model are both non-smooth and non-convex, which in general would require the use of a global optimization algorithm together with multiple starting points. We also use a smoothening function on the models to work with a local optimization algorithm that works with derivatives of the cost function and both models. By using multiple starting points for this method, we hope to obtain similar results. The optimization algorithms that are implemented and investigated in this thesis are: the Genetic Algorithm (GA), the Simulated Annealing (SA) algorithm, the Pattern Search (PS) algorithm, and the Resilient backPROPagation (RPROP) algorithm.
To compare the results of the four optimization algorithms, four different scenarios are considered. The final results show that all control methods perform better than the FT controller. Overall the GA algorithm performs best without using a multi-start approach, with PS and SA having similar results. The RPROP method is either close to the other methods (0-2%) or quite far off (10-15%), depending on the scenario. When looking at computation time, PS is the fastest. It is twice as fast compared to the GA in most scenarios. SA however takes around fifteen times the amount of computation time compared to the GA. RPROP has varying results again compared to the GA algorithm. A better cost function for the TTS as well as a more optimized algorithm for the RPROP method can resolve both of these issues but future work might need to prove this.