Beijing Institute of Mathematical Sciences and Applications Beijing Institute of Mathematical Sciences and Applications

  • About
    • President
    • Governance
    • Partner Institutions
    • Visit
  • People
    • Management
    • Faculty
    • Postdocs
    • Visiting Scholars
    • Administration
    • Academic Support
  • Research
    • Research Groups
    • Courses
    • Seminars
  • Join Us
    • Faculty
    • Postdocs
    • Students
  • Events
    • Conferences
    • Workshops
    • Forum
  • Life @ BIMSA
    • Accommodation
    • Transportation
    • Facilities
    • Tour
  • News
    • News
    • Announcement
    • Downloads
About
President
Governance
Partner Institutions
Visit
People
Management
Faculty
Postdocs
Visiting Scholars
Administration
Academic Support
Research
Research Groups
Courses
Seminars
Join Us
Faculty
Postdocs
Students
Events
Conferences
Workshops
Forum
Life @ BIMSA
Accommodation
Transportation
Facilities
Tour
News
News
Announcement
Downloads
Qiuzhen College, Tsinghua University
Yau Mathematical Sciences Center, Tsinghua University (YMSC)
Tsinghua Sanya International  Mathematics Forum (TSIMF)
Shanghai Institute for Mathematics and  Interdisciplinary Sciences (SIMIS)
BIMSA > Seminar on Bioinformatics Asymmetric Natural Vector Method for Predicting Ambiguous Non-standard Base Codes and Research on gene regulatory relationships
Asymmetric Natural Vector Method for Predicting Ambiguous Non-standard Base Codes and Research on gene regulatory relationships
Organizer
Stephen S-T. Yau
Speaker
Guoqing Hu
Time
Monday, November 4, 2024 9:00 AM - 9:30 AM
Venue
Online
Abstract
In this report, we introduce a novel approach based on the Asymmetric Natural Vector (ANV) method to address the problem of ambiguity in DNA sequences. We propose using ANV to predict the bases represented by non-standard codes in DNA sequences. Our approach involves developing a deep learning framework to establish a correspondence between DNA sequences (in FASTA format) and natural vectors, which encode relevant sequence properties. By training on a large dataset, we learn the distribution of these ambiguous base codes within the dataset. This method allows us to accurately predict masked or ambiguous bases in genomic fragments. It is particularly applicable to datasets, such as the COVID-19 genome data, which contain numerous non-standard base codes like R, Y, S, W, K, M, B, D, H, and V. By employing our algorithm, we can effectively estimate the corresponding standard bases and assign confidence scores to each prediction, aiding in the resolution of sequencing uncertainties. In addition, we will introduce research on gene regulatory relationships. Our ultimate goal:Given a genome sequence (1) Determine whether it is a regulatory factor (2) If so, which genomes does it have regulatory relationships with (3) Is this regulatory relationship promotion or inhibition.
Speaker Intro
Researcher at Beijing Institute of Mathematical Sciences and Applications
Xi'an Jiaotong University, Bachelor's and Master's in Computational Mathematics.
UIC University, PhD in Computer Science, Research Direction: Nonlinear Filter Control, Supervisor: Stephen S.-T Yau 丘成栋
After graduation, he mainly worked in the field of wireless communications in the United States. He worked as a senior engineer in Lucent, Alcatel-Lucent, Nokia and other companies.
Joined Beijing Institute of Mathematical Sciences and Applications (BIMSA) in Jan. of 2024, currently engaged in research on neural networks, artificial intelligence, big data, machine learning and biomathematics.
Beijing Institute of Mathematical Sciences and Applications
CONTACT

No. 544, Hefangkou Village Huaibei Town, Huairou District Beijing 101408

北京市怀柔区 河防口村544号
北京雁栖湖应用数学研究院 101408

Tel. 010-60661855 Tel. 010-60661855
Email. administration@bimsa.cn

Copyright © Beijing Institute of Mathematical Sciences and Applications

京ICP备2022029550号-1

京公网安备11011602001060 京公网安备11011602001060