Grand biological universe paper revision
组织者
演讲者
时间
2024年05月20日 15:30 至 16:00
地点
理科楼A-304
摘要
Understanding the differences in genome sequences of different organisms is crucial for biological classification and phylogenetic evolution. The k-mer natural vector method encodes sequences into numerical vectors, transforming the problem of sequence comparison in genomic space into vector comparison in high-dimensional Euclidean space. We downloaded all reliable sequences from seven datasets in NCBI and determined the embedding Euclidean dimension and natural metric in genomic space. We proposed the concept of a large biological universe, where the convex hulls formed by the seven datasets are mutually exclusive, and the convex hulls formed by different biological populations within each dataset are also mutually exclusive. This study provides a new perspective for molecular biology and enables accurate comparison of large-scale sequences in real time, revealing the differences in metrics across the universe and standardizing metrics that are not suitable for comprehensive analysis.