TY - JOUR
T1 - Polyphony
T2 - an Interactive Transfer Learning Framework for Single-Cell Data Analysis
AU - Cheng, Furui
AU - Keller, Mark S.
AU - Qu, Huamin
AU - Gehlenborg, Nils
AU - Wang, Qianwen
N1 - Publisher Copyright:
© 1995-2012 IEEE.
PY - 2023/1/1
Y1 - 2023/1/1
N2 - Reference-based cell-type annotation can significantly reduce time and effort in single-cell analysis by transferring labels from a previously-annotated dataset to a new dataset. However, label transfer by end-to-end computational methods is challenging due to the entanglement of technical (e.g., from different sequencing batches or techniques) and biological (e.g., from different cellular microenvironments) variations, only the first of which must be removed. To address this issue, we propose Polyphony, an interactive transfer learning (ITL) framework, to complement biologists' knowledge with advanced computational methods. Polyphony is motivated and guided by domain experts' needs for a controllable, interactive, and algorithm-assisted annotation process, identified through interviews with seven biologists. We introduce anchors, i.e., analogous cell populations across datasets, as a paradigm to explain the computational process and collect user feedback for model improvement. We further design a set of visualizations and interactions to empower users to add, delete, or modify anchors, resulting in refined cell type annotations. The effectiveness of this approach is demonstrated through quantitative experiments, two hypothetical use cases, and interviews with two biologists. The results show that our anchor-based ITL method takes advantage of both human and machine intelligence in annotating massive single-cell datasets.
AB - Reference-based cell-type annotation can significantly reduce time and effort in single-cell analysis by transferring labels from a previously-annotated dataset to a new dataset. However, label transfer by end-to-end computational methods is challenging due to the entanglement of technical (e.g., from different sequencing batches or techniques) and biological (e.g., from different cellular microenvironments) variations, only the first of which must be removed. To address this issue, we propose Polyphony, an interactive transfer learning (ITL) framework, to complement biologists' knowledge with advanced computational methods. Polyphony is motivated and guided by domain experts' needs for a controllable, interactive, and algorithm-assisted annotation process, identified through interviews with seven biologists. We introduce anchors, i.e., analogous cell populations across datasets, as a paradigm to explain the computational process and collect user feedback for model improvement. We further design a set of visualizations and interactions to empower users to add, delete, or modify anchors, resulting in refined cell type annotations. The effectiveness of this approach is demonstrated through quantitative experiments, two hypothetical use cases, and interviews with two biologists. The results show that our anchor-based ITL method takes advantage of both human and machine intelligence in annotating massive single-cell datasets.
KW - Human-AI Interaction
KW - Interactive Machine Learning
KW - Single-cell Data Analysis
KW - Transfer Learning
UR - https://www.webofscience.com/wos/woscc/full-record/WOS:000901991800010
UR - https://openalex.org/W4297094730
UR - https://www.scopus.com/pages/publications/85139459007
U2 - 10.1109/TVCG.2022.3209408
DO - 10.1109/TVCG.2022.3209408
M3 - Journal Article
C2 - 36155452
SN - 1077-2626
VL - 29
SP - 591
EP - 601
JO - IEEE Transactions on Visualization and Computer Graphics
JF - IEEE Transactions on Visualization and Computer Graphics
IS - 1
ER -