Skip to main content

Showing 1–17 of 17 results for author: Willcocks, C G

  1. arXiv:2406.18422  [pdf, other

    cs.CV eess.IV

    Repeat and Concatenate: 2D to 3D Image Translation with 3D to 3D Generative Modeling

    Authors: Abril Corona-Figueroa, Hubert P. H. Shum, Chris G. Willcocks

    Abstract: This paper investigates a 2D to 3D image translation method with a straightforward technique, enabling correlated 2D X-ray to 3D CT-like reconstruction. We observe that existing approaches, which integrate information across multiple 2D views in the latent space, lose valuable signal information during latent encoding. Instead, we simply repeat and concatenate the 2D views into higher-channel 3D v… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: CVPRW 2024 - DCA in MI; Best Paper Award

  2. arXiv:2308.14152  [pdf, other

    cs.CV

    Unaligned 2D to 3D Translation with Conditional Vector-Quantized Code Diffusion using Transformers

    Authors: Abril Corona-Figueroa, Sam Bond-Taylor, Neelanjan Bhowmik, Yona Falinie A. Gaus, Toby P. Breckon, Hubert P. H. Shum, Chris G. Willcocks

    Abstract: Generating 3D images of complex objects conditionally from a few 2D views is a difficult synthesis problem, compounded by issues such as domain gap and geometric misalignment. For instance, a unified framework such as Generative Adversarial Networks cannot achieve this unless they explicitly define both a domain-invariant and geometric-invariant joint latent distribution, whereas Neural Radiance F… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

    Comments: Camera-ready version for ICCV 2023

  3. arXiv:2303.18242  [pdf, other

    cs.LG cs.CV

    $\infty$-Diff: Infinite Resolution Diffusion with Subsampled Mollified States

    Authors: Sam Bond-Taylor, Chris G. Willcocks

    Abstract: This paper introduces $\infty$-Diff, a generative diffusion model defined in an infinite-dimensional Hilbert space, which can model infinite resolution data. By training on randomly sampled subsets of coordinates and denoising content only at those locations, we learn a continuous function for arbitrary resolution sampling. Unlike prior neural field-based infinite-dimensional models, which use poi… ▽ More

    Submitted 1 March, 2024; v1 submitted 31 March, 2023; originally announced March 2023.

    Comments: Accepted at ICLR 2024

  4. arXiv:2211.12285  [pdf, other

    cs.CV cs.GR

    Exact-NeRF: An Exploration of a Precise Volumetric Parameterization for Neural Radiance Fields

    Authors: Brian K. S. Isaac-Medina, Chris G. Willcocks, Toby P. Breckon

    Abstract: Neural Radiance Fields (NeRF) have attracted significant attention due to their ability to synthesize novel scene views with great accuracy. However, inherent to their underlying formulation, the sampling of points along a ray with zero width may result in ambiguous representations that lead to further rendering artifacts such as aliasing in the final scene. To address this issue, the recent varia… ▽ More

    Submitted 25 March, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: 15 pages,10 figures

  5. arXiv:2206.12351  [pdf, other

    cs.CV cs.LG

    Megapixel Image Generation with Step-Unrolled Denoising Autoencoders

    Authors: Alex F. McKinney, Chris G. Willcocks

    Abstract: An ongoing trend in generative modelling research has been to push sample resolutions higher whilst simultaneously reducing computational requirements for training and sampling. We aim to push this trend further via the combination of techniques - each component representing the current pinnacle of efficiency in their respective areas. These include vector-quantized GAN (VQ-GAN), a vector-quantiza… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

    Comments: 17 pages + 9 appendix pages. 20 figures

  6. arXiv:2202.01020  [pdf, other

    eess.IV cs.CV

    MedNeRF: Medical Neural Radiance Fields for Reconstructing 3D-aware CT-Projections from a Single X-ray

    Authors: Abril Corona-Figueroa, Jonathan Frawley, Sam Bond-Taylor, Sarath Bethapudi, Hubert P. H. Shum, Chris G. Willcocks

    Abstract: Computed tomography (CT) is an effective medical imaging modality, widely used in the field of clinical medicine for the diagnosis of various pathologies. Advances in Multidetector CT imaging technology have enabled additional functionalities, including generation of thin slice multiplanar cross-sectional body imaging and 3D reconstructions. However, this involves patients being exposed to a consi… ▽ More

    Submitted 8 April, 2022; v1 submitted 2 February, 2022; originally announced February 2022.

    Comments: 6 pages, 4 figures, accepted at IEEE EMBC 2022

    ACM Class: I.4; J.7

  7. arXiv:2111.12701  [pdf, other

    cs.CV cs.LG

    Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes

    Authors: Sam Bond-Taylor, Peter Hessey, Hiroshi Sasaki, Toby P. Breckon, Chris G. Willcocks

    Abstract: Whilst diffusion probabilistic models can generate high quality image content, key limitations remain in terms of both generating high-resolution imagery and their associated high computational requirements. Recent Vector-Quantized image models have overcome this limitation of image resolution but are prohibitively slow and unidirectional as they generate tokens via element-wise autoregressive sam… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

    Comments: 19 pages, 14 figures

    MSC Class: 68T01 (Primary); 68T07 (Secondary) ACM Class: I.5.0; I.4.0; G.3

  8. arXiv:2104.05358  [pdf, other

    cs.CV cs.LG eess.IV

    UNIT-DDPM: UNpaired Image Translation with Denoising Diffusion Probabilistic Models

    Authors: Hiroshi Sasaki, Chris G. Willcocks, Toby P. Breckon

    Abstract: We propose a novel unpaired image-to-image translation method that uses denoising diffusion probabilistic models without requiring adversarial training. Our method, UNpaired Image Translation with Denoising Diffusion Probabilistic Models (UNIT-DDPM), trains a generative model to infer the joint distribution of images over both domains as a Markov chain by minimising a denoising score matching obje… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: 10 pages, 8 figures

  9. Unmanned Aerial Vehicle Visual Detection and Tracking using Deep Neural Networks: A Performance Benchmark

    Authors: Brian K. S. Isaac-Medina, Matt Poyser, Daniel Organisciak, Chris G. Willcocks, Toby P. Breckon, Hubert P. H. Shum

    Abstract: Unmanned Aerial Vehicles (UAV) can pose a major risk for aviation safety, due to both negligent and malicious use. For this reason, the automated detection and tracking of UAV is a fundamental task in aerial security systems. Common technologies for UAV detection include visible-band and thermal infrared imaging, radio frequency and radar. Recent advances in deep neural networks (DNNs) for image-b… ▽ More

    Submitted 18 August, 2021; v1 submitted 25 March, 2021; originally announced March 2021.

  10. arXiv:2103.04922  [pdf, other

    cs.LG cs.CV stat.ML

    Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive Models

    Authors: Sam Bond-Taylor, Adam Leach, Yang Long, Chris G. Willcocks

    Abstract: Deep generative models are a class of techniques that train deep neural networks to model the distribution of training samples. Research has fragmented into various interconnected approaches, each of which make trade-offs including run-time, diversity, and architectural restrictions. In particular, this compendium covers energy-based models, variational autoencoders, generative adversarial network… ▽ More

    Submitted 28 March, 2022; v1 submitted 8 March, 2021; originally announced March 2021.

    Comments: 20 pages, 9 figures, will appear in IEEE Transactions on Pattern Analysis and Machine Intelligence

    MSC Class: 68T01 (Primary); 68T07 (Secondary) ACM Class: I.5.0; I.4.0; G.3

  11. arXiv:2103.01299  [pdf, other

    eess.IV cs.CV cs.LG

    Robust 3D U-Net Segmentation of Macular Holes

    Authors: Jonathan Frawley, Chris G. Willcocks, Maged Habib, Caspar Geenen, David H. Steel, Boguslaw Obara

    Abstract: Macular holes are a common eye condition which result in visual impairment. We look at the application of deep convolutional neural networks to the problem of macular hole segmentation. We use the 3D U-Net architecture as a basis and experiment with a number of design variants. Manually annotating and measuring macular holes is time consuming and error prone. Previous automated approaches to macul… ▽ More

    Submitted 7 April, 2021; v1 submitted 1 March, 2021; originally announced March 2021.

  12. arXiv:2010.01942  [pdf, other

    eess.IV cs.CV

    Unsupervised Region-based Anomaly Detection in Brain MRI with Adversarial Image Inpainting

    Authors: Bao Nguyen, Adam Feldman, Sarath Bethapudi, Andrew Jennings, Chris G. Willcocks

    Abstract: Medical segmentation is performed to determine the bounds of regions of interest (ROI) prior to surgery. By allowing the study of growth, structure, and behaviour of the ROI in the planning phase, critical information can be obtained, increasing the likelihood of a successful operation. Usually, segmentations are performed manually or via machine learning methods trained on manual annotations. In… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: 5 pages, 6 figures

    ACM Class: I.5.0; I.4.0

  13. arXiv:2007.02798  [pdf, other

    cs.CV cs.LG

    Gradient Origin Networks

    Authors: Sam Bond-Taylor, Chris G. Willcocks

    Abstract: This paper proposes a new type of generative model that is able to quickly learn a latent representation without an encoder. This is achieved using empirical Bayes to calculate the expectation of the posterior, which is implemented by initialising a latent vector with zeros, then using the gradient of the log-likelihood of the data with respect to this zero vector as new latent points. The approac… ▽ More

    Submitted 24 March, 2021; v1 submitted 6 July, 2020; originally announced July 2020.

    Comments: 16 pages, 17 figures, accepted at ICLR 2021, camera-ready version

    MSC Class: 68T01 (Primary); 68T07 (Secondary) ACM Class: I.5.0; I.4.0; G.3

  14. arXiv:2005.04697  [pdf, other

    eess.IV cs.CV cs.LG

    Segmentation of Macular Edema Datasets with Small Residual 3D U-Net Architectures

    Authors: Jonathan Frawley, Chris G. Willcocks, Maged Habib, Caspar Geenen, David H. Steel, Boguslaw Obara

    Abstract: This paper investigates the application of deep convolutional neural networks with prohibitively small datasets to the problem of macular edema segmentation. In particular, we investigate several different heavily regularized architectures. We find that, contrary to popular belief, neural architectures within this application setting are able to achieve close to human-level performance on unseen t… ▽ More

    Submitted 10 May, 2020; originally announced May 2020.

    Comments: 7 pages, 5 figures

  15. arXiv:2005.02436  [pdf, other

    cs.CV cs.LG eess.IV

    Data Augmentation via Mixed Class Interpolation using Cycle-Consistent Generative Adversarial Networks Applied to Cross-Domain Imagery

    Authors: Hiroshi Sasaki, Chris G. Willcocks, Toby P. Breckon

    Abstract: Machine learning driven object detection and classification within non-visible imagery has an important role in many fields such as night vision, all-weather surveillance and aviation security. However, such applications often suffer due to the limited quantity and variety of non-visible spectral domain imagery, in contrast to the high data availability of visible-band imagery that readily enables… ▽ More

    Submitted 1 January, 2021; v1 submitted 5 May, 2020; originally announced May 2020.

    Comments: 9 pages, 9 figures, accepted at the 25th International Conference on Pattern Recognition (ICPR 2020)

  16. arXiv:1910.04543  [pdf, other

    physics.bio-ph cs.LG

    Learning protein conformational space by enforcing physics with convolutions and latent interpolations

    Authors: Venkata K. Ramaswamy, Chris G. Willcocks, Matteo T. Degiacomi

    Abstract: Determining the different conformational states of a protein and the transition paths between them is key to fully understanding the relationship between biomolecular structure and function. This can be accomplished by sampling protein conformational space with molecular simulation methodologies. Despite advances in computing hardware and sampling techniques, simulations always yield a discretized… ▽ More

    Submitted 25 March, 2020; v1 submitted 10 October, 2019; originally announced October 2019.

    MSC Class: 68T05; 92C05 ACM Class: I.2.6

    Journal ref: Phys. Rev. X 11, 011052 (2021)

  17. TMIXT: A process flow for Transcribing MIXed handwritten and machine-printed Text

    Authors: Fady Medhat, Mahnaz Mohammadi, Sardar Jaf, Chris G. Willcocks, Toby P. Breckon, Peter Matthews, Andrew Stephen McGough, Georgios Theodoropoulos, Boguslaw Obara

    Abstract: Handling large corpuses of documents is of significant importance in many fields, no more so than in the areas of crime investigation and defence, where an organisation may be presented with a large volume of scanned documents which need to be processed in a finite time. However, this problem is exacerbated both by the volume, in terms of scanned documents and the complexity of the pages, which ne… ▽ More

    Submitted 28 April, 2019; originally announced April 2019.

    Comments: big data, unstructured data, Optical Character Recognition (OCR), Handwritten Text Recognition (HTR), machine-printed text recognition, IAM handwriting database, TMIXT

    Journal ref: IEEE International Conference on Big Data (Big Data) 2018