Sign in

VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling

By Siyuan Li and others
Similar to natural language models, pre-trained genome language models are proposed to capture the underlying intricacies within genomes with unsupervised sequence modeling. They have become essential tools for researchers and practitioners in biology. However, the hand-crafted tokenization policies used in these models may not encode the most discriminative patterns from... Show more
June 2, 2024
=
0
Loading PDF…
Loading full text...
Similar articles
Loading recommendations...
=
0
x1
VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling
Click on play to start listening