Y Chen | TU Delft Repository

Building segmentation from airborne vhr images using mask r-cnn

Journal article (2019) - K. Zhou, Y. Chen, I. Smal, R. Lindenbergh

Up-to-date 3D building models are important for many applications. Airborne very high resolution (VHR) images often acquired annually give an opportunity to create an up-to-date 3D model. Building segmentation is often the first and utmost step. Convolutional neural networks (CNNs) draw lots of attention in interpreting VHR images as they can learn very effective features for very complex scenes. This paper employs Mask R-CNN to address two problems in building segmentation: detecting different scales of building and segmenting buildings to have accurately segmented edges. Mask R-CNN starts from feature pyramid network (FPN) to create different scales of semantically rich features. FPN is integrated with region proposal network (RPN) to generate objects with various scales with the corresponding optimal scale of features. The features with high and low levels of information are further used for better object classification of small objects and for mask prediction of edges. The method is tested on ISPRS benchmark dataset by comparing results with the fully convolutional networks (FCN), which merge high and low level features by a skip-layer to create a single feature for semantic segmentation. The results show that Mask R-CNN outperforms FCN with around 15% in detecting objects, especially in detecting small objects. Moreover, Mask R-CNN has much better results in edge region than FCN. The results also show that choosing the range of anchor scales in Mask R-CNN is a critical factor in segmenting different scale of objects. This paper provides an insight into how a good anchor scale for different dataset should be chosen. ...

Surface multiple leakage extraction using local primary- and-multiple orthogonalization

Conference paper (2019) - D. Zhang, E. Verschuur, Y. Chen

Surface-related multiple elimination (SRME) is a solid and effective approach for primary estimation. However, due to the imperfections in data and method (e.g. coarsely-sampled dataset and balancing effect of adaptive subtraction) multiple energy leakage is commonly seen in the results of SRME-predicted primaries. Assuming that the primaries and multiples do not correlate locally in the time-space domain, we are able to extract the leaked multiples from the initially estimated primaries using local primary-and-multiple orthogonalization. The proposed framework consists of two steps: an initial primary/multiple estimation step and a multiple-leakage extraction step. The initial step corresponds to SRME, which produces the initial estimated primary and multiple models. The second step is based on local primary-and-multiple orthogonalization to retrieve the leaked multiples, which can be seen as a remedy for correcting the initial estimated primary and multiple models. Thus, we can obtain a better primary output which has much less leaked multiple energy. We demonstrate a good performance of our proposed framework on both synthetic and field data, where it repairs the leakage of standard adaptive subtraction. ...

Building classification of VHR airborne stereo images using fully convolutional networks and free training samples

Journal article (2018) - Y. Chen, W. Gao, E. Widyaningrum, M. Zheng, Kaixuan Zhou

Semantic segmentation, especially for buildings, from the very high resolution (VHR) airborne images is an important task in urban mapping applications. Nowadays, the deep learning has significantly improved and applied in computer vision applications. Fully Convolutional Networks (FCN) is one of the tops voted method due to their good performance and high computational efficiency. However, the state-of-art results of deep nets depend on the training on large-scale benchmark datasets. Unfortunately, the benchmarks of VHR images are limited and have less generalization capability to another area of interest. As existing high precision base maps are easily available and objects are not changed dramatically in an urban area, the map information can be used to label images for training samples. Apart from object changes between maps and images due to time differences, the maps often cannot perfectly match with images. In this study, the main mislabeling sources are considered and addressed by utilizing stereo images, such as relief displacement, different representation between the base map and the image, and occlusion areas in the image. These free training samples are then fed to a pre-trained FCN. To find the better result, we applied fine-tuning with different learning rates and freezing different layers. We further improved the results by introducing atrous convolution. By using free training samples, we achieve a promising building classification with 85.6% overall accuracy and 83.77% F1 score, while the result from ISPRS benchmark by using manual labels has 92.02% overall accuracy and 84.06% F1 score, due to the building complexities in our study area. ...

Full waveform inversion using an automatic directional total variation constraint

Conference paper (2017) - S. Qu, D. J. Verschuur, Y. Chen

As full waveform inversion (FWI) is a non-unique and typically ill-posed inversion problem, it needs proper regularization. To optimally include geologic information, we propose full waveform inversion with directional total variation as an L1-norm regularization. We design the directional total variation (TV) operator based on the local dip field, derived from the seismic image, instead of ignoring the local structural direction and only using horizontal/vertical gradients in traditional TV. With a synthetic example with complex structures based on the Marmousi model, we demonstrate that the proposed method is much more effective compared to both FWI without regularization and FWI with conventional TV regularization. ...

Panta Rhei 2013-2015

Global perspectives on hydrology, society and change

Journal article (2016) - H McMillan, A Montanari, G Di Baldassarre, Y. Huang, D Mazvimavi, M Rogger, B Sivakumar, Tatiana Bibikova, A Castellarin, Y Chen, DAvid Finger, A Gelfan, C Cudennec, David M. Hannah, AY Hoekstra, H Li, S Maskey, T Mathevet, Ana Mijic, Adrian Pedrozo Acuna, M.J. Polo, Victor Rosales, Paul Smith, Huub Savenije, A Viglione, V Srinivasan, E Toth, Ronald van Nooijen, Jun Xia, Heidi Kreibich, T Krueger, J Liu, Alfonso Mejia, A van Loon, H. Aksoy

In 2013, the International Association of Hydrological Sciences (lAHS) launched the hydrological decade 2013-2022 with the theme "Panta Rhei; Change in Hydrology and Society". The decade recognizes the urgency of hydrological research to understand and predict the interactions of society and water, to support sustainable water resource use under changing climatic and environmental conditions. This paper reports on the first Panta Rhei biennium 2013-2015, providing a comprehensive resource that describes the scope and direction of Panta Rhei. We bring together the knowledge of all the Panta Rhei working groups, to summarize the most pressing research questions and how the hydrological community is progressing towards those goals. We draw out interconnections between different strands of research, and reflect on the need to take a global view on hydrology in the current era of human impacts and environmental change. Finally, we look back to the six driving science questions identified at the outset of Panta Rhei, to quantify progress towards those aims. ...

Rope Caging and Grasping

Conference paper (2016) - T.H. Kwok, W. Wan, J. Pan, Charlie Wang, J. Yuan, K Harada, Y. Chen

We present a novel method for caging grasps in this paper by stretching ropes on the surface of a 3D object. Both topology and shape of a model to be grasped has been
considered in our approach. Our algorithm can guarantee generating local minimal rings on every topological branches of a given model with the help of a Reeb graph. Cages and
grasps can then be computed from these rings, and physical experimental tests have been conducted to verify the robustness of our approach.
...

Breakdown phenomenon study in the bidirectional pedestrian flow

Conference paper (2014) - X Yang, W Daamen, SP Hoogendoorn, Y Chen, H Dong

An approach to scenario analysis, generation and evaluation

Conference paper (2014) - Y Chen, HJ van Zuylen

Dynamic origin-destination matrix estimation using probe vehicle data as a priori information

Book chapter (2010) - R Asmundsdottir, Y Chen, HJ van Zuylen