Articles

High-throughput deep learning variant effect prediction with Sequence UNET

Dunham, Alistair S.; Beltrao, Pedro; AlQuraishi, Mohammed N.

Understanding coding mutations is important for many applications in biology and medicine but the vast mutation space makes comprehensive experimental characterisation impossible. Current predictors are often computationally intensive and difficult to scale, including recent deep learning models. We introduce Sequence UNET, a highly scalable deep learning architecture that classifies and predicts variant frequency from sequence alone using multi-scale representations from a fully convolutional compression/expansion architecture. It achieves comparable pathogenicity prediction to recent methods. We demonstrate scalability by analysing 8.3B variants in 904,134 proteins detected through large-scale proteomics. Sequence UNET runs on modest hardware with a simple Python package.

Files

  • thumnail for 13059_2023_Article_2948.pdf 13059_2023_Article_2948.pdf application/pdf 727 KB Download File

Also Published In

More About This Work

Academic Units
Systems Biology
Published Here
March 26, 2025

Notes

Variant effect prediction, Deep learning, Mutation, PSSM, Pathogenicity, Machine learning