A summary report on GISAID project
Organizer
Speaker
Mengcen Guan
Time
Thursday, September 21, 2023 8:30 PM - 9:00 PM
Venue
Online
Abstract
Based on GISAID dataset, we have downloaded covid sequences and finished convex hull analysis and KNN classification analysis. These methods can achieve good results when we choose 7-mer natural vector and use 1/2^k weights. Besides, we also finshed statistic analysis. For example, we compared ACGT statistics between different sub-types, countries and genders. We also drew natural graphs of these sub-types and got several features from the graphs.