Q. Tao
Please Note
35 records found
1
Pretrained segmentation models for cardiac magnetic resonance imaging (MRI) struggle to generalize across different imaging sequences due to significant variations in image contrast. These variations arise from changes in imaging protocols, yet the same fundamental spin properties, including proton density, T1, and T2 values, govern all acquired images. With this core principle, we introduce Reverse Imaging, a novel physics-driven method for cardiac MRI data augmentation and domain adaptation to fundamentally solve the generalization problem. Our method reversely infers the underlying spin properties from observed cardiac MRI images, by solving ill-posed nonlinear inverse problems regularized by the prior distribution of spin properties. We acquire this “spin prior” by learning a generative diffusion model from the multiparametric SAturation-recovery single-SHot acquisition sequence (mSASHA) dataset, which offers joint cardiac T1 and T2 maps. Our method enables approximate but meaningful spin-property estimates from MR images, which provide an interpretable “latent variable” that lead to highly flexible image synthesis of arbitrary novel sequences. We show that Reverse Imaging enables highly accurate segmentation across vastly different image contrasts and imaging protocols, realizing wide-spectrum generalization of cardiac MRI segmentation.
Diffusion boost
Leveraging diffusion model for groupwise registration in myocardial T1 mapping
Background: As a quantitative magnetic resonance imaging (MRI) technique, myocardial T1 mapping plays a crucial role in the diagnosis and treatment of cardiovascular diseases. In practice, involuntary cardiac and respiratory motion often results in reduced accuracy and precision in T1 estimation. Therefore, image registration remains crucial for accurate and precise myocardial T1 mapping. Compared with pairwise registration that warps each baseline image to a predefined template, groupwise registration aligns all images from one sequence simultaneously without the need for a template. However, a persistent challenge is the difficulty of extracting the structural representation of T1 mapping data that contains vastly varying contrast, which severely undermines the performance of image registration. Purpose: The purpose of this study is to incorporate the learning capabilities of the diffusion model to tackle the main challenge encountered in the registration of myocardial T1 mapping. Our goal is to align all images within an image series simultaneously in a groupwise manner. Methods: In this article, we propose a novel template-free groupwise registration framework that can align one T1-weighted image series through a single forward propagation. Notably, we introduce the diffusion process to effectively boost the structural information extraction under the drastic contrast changes for reliable image registration. Furthermore, we design a Hybrid Attention Feature Fusion (HAFF) module to promote the multi-scale feature fusion from diffusion to registration. To evaluate the registration performance of the proposed model, experiments are conducted on a publicly available myocardial T1 mapping dataset comprising 210 consecutive patients, using an independent test set for comparison experiments and ablation studies. Results: Experimental results demonstrated the great superiority of our proposed method in the registration of myocardial T1 mapping. Quantitatively, the proposed method resulted in a Dice score of 0.839, groupwise Dice score of 0.601, Hausdorff distance of 10.389 mm, and T1 mapping error of 11.372 ms, surpassing the current state-of-the-art approaches. Conclusions: Our proposed framework realizes robust groupwise registration for myocardial T1 mapping by leveraging the state-of-the-art diffusion model, demonstrating its strong feature extraction capacity for image registration, beyond image generation.
The low degree of labeling and limited photon count of fluorescent emitters in single molecule localization microscopy results in poor quality images of macro-molecular complexes. Particle fusion provides a single reconstruction with high signal-to-noise ratio by combining many single molecule localization microscopy images of the same structure. The underlying assumption of homogeneity is not always valid, heterogeneity can arise due to geometrical shape variations or distinct conformational states. We introduce a Point Cloud Variational Auto-Encoder that works directly on 2D and 3D localization data, to detect multiple modes of variation in such datasets. The computing time is on the order of a few minutes, enabled by the linear scaling with dataset size, and fast network training in just four epochs. The use of lists of localization data instead of pixelated images leads to just minor differences in computational burden between 2D and 3D cases. With the proposed method, we detected radius variation in 2D Nuclear Pore Complex data, height variations in 3D DNA origami tetrahedron data, and both radius and height variations in 3D Nuclear Pore Complex data. In all cases, the detected variations were on the few nanometer scale.
The state-of-the-art in cardiac MRI reconstruction
Results of the CMRxRecon challenge in MICCAI 2023
Advanced Cardiac MRI
Integrating AI and Machine Learning for Post Imaging Reconstruction
Purpose of Review: This review explores the advancements in deep learning (DL)-based cardiac magnetic resonance (CMR) reconstruction, focusing on its role in accelerating imaging, denoising, super-resolution, motion artifact correction, and quantitative mapping. It highlights the transition from parallel imaging and compressed sensing to artificial intelligence (AI)-driven approaches that enhance image quality and diagnostic accuracy. Recent Findings: Supervised and self-supervised DL models can significantly reduce scan times, enabling high-fidelity reconstructions from undersampled data. Generative adversarial network (GAN)-based super-resolution techniques enhance spatial resolution, while denoising networks improve signal-to-noise ratio. Motion correction strategies, including spatiotemporal learning, have enhanced free-breathing acquisitions. Physics-guided models incorporate MRI signal constraints for improved T1/T2 mapping and myocardial tissue characterization. Summary: DL-driven CMR reconstruction optimizes imaging speed, quality, and artifact suppression. Despite challenges in dataset standardization and clinical validation, AI is advancing real-time, high-fidelity CMR, facilitating broader clinical adoption.
Cardiac T1 mapping by magnetic resonance imaging (MRI) is an important clinical tool for the diagnosis and treatment of cardiovascular diseases. In practice, involuntary cardiac and respiratory motion often results in reduced accuracy and precision in T1 estimation. Motion correction is an essential preprocessing step, however, with intensive contrast changes among baseline images, both optimization-based and deep-learning (DL)-based registration methods still struggle to estimate structural similarity between images, especially when image contrast is poor and displacement is large. In this work, we propose a novel registration metric that is highly insensitive to large contrast changes, based on modified modality independent neighborhood descriptor (mo-MIND). To accommodate severe motions, we further propose pre-deformation as an augmentation strategy at the training stage. We combine the proposed mo-MIND-based metric and the augmentation strategy in a U-Net architecture to tackle the challenges of motion correction for cardiac T1 mapping. Experimental results and ablation studies demonstrated that our method achieved improved registration performance compared to several established baselines, leading to significantly reduced T1 mapping error and improved landmark stability.
Objectives: To evaluate a deep learning (DL) model for reducing the agent dose of contrast-enhanced T1-weighted MRI (T1ce) of the cerebellopontine angle (CPA) cistern. Materials and methods: In this multicenter retrospective study, T1 and T1ce of vestibular schwannoma (VS) patients were used to simulate low-dose T1ce with varying reductions of contrast agent dose. DL models were trained to restore standard-dose T1ce from the low-dose simulation. The image quality and segmentation performance of the DL-restored T1ce were evaluated. A head and neck radiologist was asked to rate DL-restored images in multiple aspects, including image quality and diagnostic characterization. Results: 203 MRI studies from 72 VS patients (mean age, 58.51 ± 14.73, 39 men) were evaluated. As the input dose increased, the structural similarity index measure of the restored T1ce increased from 0.639 ± 0.113 to 0.993 ± 0.009, and the peak signal-to-noise ratio increased from 21.6 ± 3.73 dB to 41.4 ± 4.84 dB. At a 10% input dose, using DL-restored T1ce for segmentation improved the Dice from 0.673 to 0.734, the 95% Hausdorff distance from 2.38 mm to 2.07 mm, and the average surface distance from 1.00 mm to 0.59 mm. Both DL-restored T1ce from 10% and 30% input doses showed excellent image quality (3, interquartile range (IQR) [Q3-Q1] = 3–3 and 3, IQR [Q3-Q1] = 4–3), with the latter being considered more informative (4, IQR [Q3-Q1] = 4–3). Conclusion: The DL model improved the image quality of low-dose MRI of the CPA cistern, which makes lesion detection and diagnostic characterization possible with 10–30% of the standard dose. Key Points: Question Deep learning models that aid in the reduction of contrast agent dose are not extensively evaluated for MRI of the cerebellopontine angle cistern. Findings Deep learning models restored the low-dose MRI of the cerebellopontine angle cistern, yielding images sufficient for vestibular schwannoma diagnosis and management. Clinical relevance Deep learning models make it possible to reduce the use of gadolinium-based contrast agents for contrast-enhanced MRI of the cerebellopontine angle cistern.
This study presents a novel image-based machine learning (ML) method for automating I–V parameter extraction in gallium nitride (GaN) devices. Using Ampleon’s GEAR model, a dataset of 100000 simulated I–V curves are converted into I–V images through specifically designed transfer functions to train a convolutional neural network. The proposed method outperforms the existing ML method based on a fully connected neural network, particularly for I–V curves in the subthreshold region. Validation with measured pulse I–V data shows its superior accuracy, achieving a normalized mean square error (NMSE) of −30 dB compared with −24 dB with the existing ML method. The proposed method demonstrates a strong potential to accelerate the extraction and enhance the accuracy of GaN device modeling.
We extensively evaluated RIIR on brain MRI, lung CT, and quantitative cardiac MRI datasets, in terms of both registration accuracy and training data efficiency. Our experiments showed that RIIR outperformed a range of deep learning-based methods, even with only 5% of the training data, demonstrating high data efficiency. Key findings from our ablation studies highlighted the important added value of the hidden states introduced in the recurrent inference framework for meta-learning. Our proposed RIIR offers a highly data-efficient framework for deep learning-based medical image registration. ...
We extensively evaluated RIIR on brain MRI, lung CT, and quantitative cardiac MRI datasets, in terms of both registration accuracy and training data efficiency. Our experiments showed that RIIR outperformed a range of deep learning-based methods, even with only 5% of the training data, demonstrating high data efficiency. Key findings from our ablation studies highlighted the important added value of the hidden states introduced in the recurrent inference framework for meta-learning. Our proposed RIIR offers a highly data-efficient framework for deep learning-based medical image registration.
Methods: DIR-labeling was implemented for double ECG-gated myoASLsequences and compared with conventional Flow-sensitive Alternating Inversion Recovery (FAIR) labeling using single inversions. In DIR-preparations, the FAIR-inversion pulses were immediately followed by an identical reinversion pulse, applied either slice-selectively or nonselectively. Bloch-equation-based simulation and phantom experiments were performed to evaluate the PN and SNR across a range of HR variabilities. Data from six healthy subjects were acquired to evaluate myocardial blood flow (MBF), PN, and SNR in vivo.
Results: Simulation experiments showed that the averageMBFvalues remained nearly constant across the range of HR variabilities and were comparable across all three sequences. However, DIR-labeling allowed for greater recovery of the myocardial background signal, which mitigates the sensitivity to HR-dependent changes in the inversion time. Consequently, PN in the presence of HR variability was substantially reduced with DIR-labeling. For HR variabilities corresponding to the mean value observed in vivo, this resulted in a simulated SNR gain of 1.79 ± 0.90 for selective and 1.55 ± 0.77 for nonselective DIR-labeling. In vivo, DIR-labeling showed reduced PN, with 53% (p < 0.05)/44% (p = 0.16) less PN compared with conventional FAIR-myoASL, leading to an average SNR gain of 1.47 ± 0.63 (p = 0.09)/1.32 ± 0.57 (p = 0.84) with selective/nonselective reinversions.
Conclusion: The proposed DIR-preparations reduce sensitivity to HR variations and alleviate PN in double ECG-gated myoASL, improving the precision of myoASL-based perfusion quantification.
...
Methods: DIR-labeling was implemented for double ECG-gated myoASLsequences and compared with conventional Flow-sensitive Alternating Inversion Recovery (FAIR) labeling using single inversions. In DIR-preparations, the FAIR-inversion pulses were immediately followed by an identical reinversion pulse, applied either slice-selectively or nonselectively. Bloch-equation-based simulation and phantom experiments were performed to evaluate the PN and SNR across a range of HR variabilities. Data from six healthy subjects were acquired to evaluate myocardial blood flow (MBF), PN, and SNR in vivo.
Results: Simulation experiments showed that the averageMBFvalues remained nearly constant across the range of HR variabilities and were comparable across all three sequences. However, DIR-labeling allowed for greater recovery of the myocardial background signal, which mitigates the sensitivity to HR-dependent changes in the inversion time. Consequently, PN in the presence of HR variability was substantially reduced with DIR-labeling. For HR variabilities corresponding to the mean value observed in vivo, this resulted in a simulated SNR gain of 1.79 ± 0.90 for selective and 1.55 ± 0.77 for nonselective DIR-labeling. In vivo, DIR-labeling showed reduced PN, with 53% (p < 0.05)/44% (p = 0.16) less PN compared with conventional FAIR-myoASL, leading to an average SNR gain of 1.47 ± 0.63 (p = 0.09)/1.32 ± 0.57 (p = 0.84) with selective/nonselective reinversions.
Conclusion: The proposed DIR-preparations reduce sensitivity to HR variations and alleviate PN in double ECG-gated myoASL, improving the precision of myoASL-based perfusion quantification.
Dynamic brain data are becoming increasingly accessible, providing a gateway to understanding the inner workings of the brain in living participants. However, the size and complexity of the data pose a challenge in extracting meaningful information across various data sources. Here we introduce a generalizable unsupervised deep manifold learning for exploration of neurocognitive and behavioral patterns. Unlike existing methods that extract patterns directly from the input data, the proposed brain-dynamic convolutional-network-based embedding (BCNE) captures brain-state trajectories by analyzing temporospatial correlations within the data and applying manifold learning. The results demonstrate that BCNE effectively delineates scene transitions, underscores the involvement of different brain regions in memory and narrative processing, distinguishes dynamic learning processes and identifies differences between active and passive behaviors. BCNE provides an effective tool for exploring general neuroscience inquiries or individual-specific patterns.
Methods: A longitudinal magnetization preparation module was designed to encode |B+1 |. After magnetization tip-down, off-resonant Fermi pulses, placed symmetrically around two refocusing pulses, induced BSS, followed by tipping back of the magnetization. Bloch simulations were used to optimize refocusing pulse parameters and to assess the mapping sensitivity. Relaxation-induced B+1 error was simulated for various T 1 /T 2 times. The effective mapping range was determined in phantom experiments, and |B+1 | maps were compared to the conventional BSS method and subadiabatic hyperbolic-secant 8 (HS8) pulse-sensitized method. Cardiac B+1 maps were acquired in healthy subjects, and evaluated for repeatability and imaging plane intersection consistency. The technique was modified for three-dimensional (3D) acquisition of the whole heart in a single breath-hold, and compared to two-dimensional (2D) acquisition.
Results: Simulations indicate that the proposed preparation can be tailored to achieve high mapping sensitivity across various B+1 ranges, with maximum sensitivity at the upper B+1 range. T 1 /T 2 -induced bias did not exceed 5.2%. Experimentally reproduced B+1 sensitization closely matched simulations for B+1 ≥ 0.3B+1, max (mean difference 0.031±0.022, compared to 0.018±0.025 in the HS8-sensitized method), and showed 20-fold reduction in the standard deviation of repeated scans, compared with conventional BSS B+1 mapping, and an equivalent 2-fold reduction compared with HS8-sensitization. Robust cardiac B+1 map quality was obtained, with an average test-retest variability of 0.027±0.043 relative to normalized B+1 magnitude, and plane intersection bias of 0.052±0.031. 3D acquisitions showed good agreement with2D scans (mean absolute deviation 0.055±0.061).
Conclusion: BSS-based preparations enable robust and tailorable 2D/3D cardiac B+1 mapping at 3 T in a single breath-hold. ...
Methods: A longitudinal magnetization preparation module was designed to encode |B+1 |. After magnetization tip-down, off-resonant Fermi pulses, placed symmetrically around two refocusing pulses, induced BSS, followed by tipping back of the magnetization. Bloch simulations were used to optimize refocusing pulse parameters and to assess the mapping sensitivity. Relaxation-induced B+1 error was simulated for various T 1 /T 2 times. The effective mapping range was determined in phantom experiments, and |B+1 | maps were compared to the conventional BSS method and subadiabatic hyperbolic-secant 8 (HS8) pulse-sensitized method. Cardiac B+1 maps were acquired in healthy subjects, and evaluated for repeatability and imaging plane intersection consistency. The technique was modified for three-dimensional (3D) acquisition of the whole heart in a single breath-hold, and compared to two-dimensional (2D) acquisition.
Results: Simulations indicate that the proposed preparation can be tailored to achieve high mapping sensitivity across various B+1 ranges, with maximum sensitivity at the upper B+1 range. T 1 /T 2 -induced bias did not exceed 5.2%. Experimentally reproduced B+1 sensitization closely matched simulations for B+1 ≥ 0.3B+1, max (mean difference 0.031±0.022, compared to 0.018±0.025 in the HS8-sensitized method), and showed 20-fold reduction in the standard deviation of repeated scans, compared with conventional BSS B+1 mapping, and an equivalent 2-fold reduction compared with HS8-sensitization. Robust cardiac B+1 map quality was obtained, with an average test-retest variability of 0.027±0.043 relative to normalized B+1 magnitude, and plane intersection bias of 0.052±0.031. 3D acquisitions showed good agreement with2D scans (mean absolute deviation 0.055±0.061).
Conclusion: BSS-based preparations enable robust and tailorable 2D/3D cardiac B+1 mapping at 3 T in a single breath-hold.
Quantitative cardiac magnetic resonance imaging (MRI) is an increasingly important diagnostic tool for cardiovascular diseases. Yet, co-registration of all baseline images within the quantitative MRI sequence is essential for the accuracy and precision of quantitative maps. However, co-registering all baseline images from a quantitative cardiac MRI sequence remains a nontrivial task because of the simultaneous changes in intensity and contrast, in combination with cardiac and respiratory motion. To address the challenge, we propose a novel motion correction framework based on robust principle component analysis (rPCA) that decomposes quantitative cardiac MRI into low-rank and sparse components, and we integrate the groupwise CNN-based registration backbone within the rPCA framework. The low-rank component of rPCA corresponds to the quantitative mapping (i.e. limited degree of freedom in variation), while the sparse component corresponds to the residual motion, making it easier to formulate and solve the groupwise registration problem. We evaluated our proposed method on cardiac T1 mapping by the modified Look-Locker inversion recovery (MOLLI) sequence, both before and after the Gadolinium contrast agent administration. Our experiments showed that our method effectively improved registration performance over baseline methods without introducing rPCA, and reduced quantitative mapping error in both in-domain (pre-contrast MOLLI) and out-of-domain (post-contrast MOLLI) inference. The proposed rPCA framework is generic and can be integrated with other registration backbones.
Deep learning-based methods have achieved prestigious performance for magnetic resonance imaging (MRI) reconstruction, enabling fast imaging for many clinical applications. Previous methods employ convolutional networks to learn the image prior as the regularization term. In quantitative MRI, the physical model of nuclear magnetic resonance relaxometry is known, providing additional prior knowledge for image reconstruction. However, traditional reconstruction networks are limited to learning the spatial domain prior knowledge, ignoring the relaxometry prior. Therefore, we propose a relaxometry-guided quantitative MRI reconstruction framework to learn the spatial prior from data and the relaxometry prior from MRI physics. Additionally, we also evaluated the performance of two popular reconstruction backbones, namely, recurrent variational networks (RVN) and variational networks (VN) with U-Net. Experiments demonstrate that the proposed method achieves highly promising results in quantitative MRI reconstruction.
Lost in Tracking
Uncertainty-Guided Cardiac Cine MRI Segmentation at Right Ventricle Base