Manifold fitting in genome sequence analysing
Organizer
Speaker
(Tsinghua) Tao Zhou
Time
Thursday, August 31, 2023 2:30 PM - 3:00 PM
Venue
Online
Abstract
The manifold hypothesis posits that high-dimensional data typically lie close to a low-dimensional manifold. The genesis of the manifold hypothesis stems from the observation that numerous physical systems possess a limited number of underlying variables that determine their behavior, even when they display intricate and diverse phenomena in high-dimensional spaces. Previously, we used the natural vector method to characterize the genome sequence, which inevitably introduces a high-dimensional Euclidean space when considering k-mer fragments, and we are now interested in if there exists some low-dimensional manifold approximation of natural space. This week I will explain the principle of manifold fitting and report on some primary attempt on its application to computational genomics.