TY - JOUR
T1 - Optimal trade-off control in machine learning-based library design, with application to adeno-associated virus (AAV) for gene therapy
AU - Zhu, Danqing
AU - Brookes, David H.
AU - Busia, Akosua
AU - Carneiro, Ana
AU - Fannjiang, Clara
AU - Popova, Galina
AU - Shin, David
AU - Donohue, Kevin C.
AU - Lin, Li F.
AU - Miller, Zachary M.
AU - Williams, Evan R.
AU - Chang, Edward F.
AU - Nowakowski, Tomasz J.
AU - Listgarten, Jennifer
AU - Schaffer, David V.
N1 - Publisher Copyright:
Copyright © 2024 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works. Distributed under a Creative Commons Attribution NonCommercial License 4.0 (CC BY-NC).
PY - 2024/1
Y1 - 2024/1
N2 - Adeno-associated viruses (AAVs) hold tremendous promise as delivery vectors for gene therapies. AAVs have been successfully engineered-for instance, for more efficient and/or cell-specific delivery to numerous tissues-by creating large, diverse starting libraries and selecting for desired properties. However, these starting libraries often contain a high proportion of variants unable to assemble or package their genomes, a prerequisite for any gene delivery goal. Here, we present and showcase a machine learning (ML) method for designing AAV peptide insertion libraries that achieve fivefold higher packaging fitness than the standard NNK library with negligible reduction in diversity. To demonstrate our ML-designed library's utility for downstream engineering goals, we show that it yields approximately 10-fold more successful variants than the NNK library after selection for infection of human brain tissue, leading to a promising glial-specific variant. Moreover, our design approach can be applied to other types of libraries for AAV and beyond.
AB - Adeno-associated viruses (AAVs) hold tremendous promise as delivery vectors for gene therapies. AAVs have been successfully engineered-for instance, for more efficient and/or cell-specific delivery to numerous tissues-by creating large, diverse starting libraries and selecting for desired properties. However, these starting libraries often contain a high proportion of variants unable to assemble or package their genomes, a prerequisite for any gene delivery goal. Here, we present and showcase a machine learning (ML) method for designing AAV peptide insertion libraries that achieve fivefold higher packaging fitness than the standard NNK library with negligible reduction in diversity. To demonstrate our ML-designed library's utility for downstream engineering goals, we show that it yields approximately 10-fold more successful variants than the NNK library after selection for infection of human brain tissue, leading to a promising glial-specific variant. Moreover, our design approach can be applied to other types of libraries for AAV and beyond.
UR - https://www.webofscience.com/wos/woscc/full-record/WOS:001185091400012
UR - https://openalex.org/W4391203107
UR - https://www.scopus.com/pages/publications/85183336311
U2 - 10.1126/sciadv.adj3786
DO - 10.1126/sciadv.adj3786
M3 - Journal Article
C2 - 38266077
SN - 2375-2548
VL - 10
JO - Science Advances
JF - Science Advances
IS - 4
ER -