바로가기메뉴

본문 바로가기 주메뉴 바로가기

ACOMS+ 및 학술지 리포지터리 설명회

  • 한국과학기술정보연구원(KISTI) 서울분원 대회의실(별관 3층)
  • 2024년 07월 03일(수) 13:30
 

logo

메뉴

생물정보학을 활용한 멸종위기 연체동물 전사체 서열에 오염된 곰팡이 유전자 발굴

Identification of Fungal Gene Sequence Contamination in Transcriptome Sequence Data of Endangered Molluscs using Bioinformatics

Abstract

The amount of data is growing very fast as advances in NGS technology enable the acquisition of large amounts of genome and transcriptome data. Moreover, the accuracy and speed of bioinformatic analysis of NGS data remains of great importance these days. However, the sequence database of mollusks is fall short of other organisms groups, and it thus appears that the annotation results after BLAST analysis are not accurate and reliable due to potential contamination with fungal sequences in mollusks sequence database. In this context, we constructed a BLAST database with 20 species of mollusk unigene sequences and 32 species of fungal sequences derived from previous studies. In order to confirm the contamination of fungal gene sequences in the unigenes of 20 endangered species, bioinformatics analysis was performed using BLAST. It reveals that the NGS sequences of mollusks are mixed with fungal sequences. Taken together, our results suggest that it is essential to reconfirm mollusks sequence information before publication.

keywords
NGS, Mollusks, Fungal sequences

logo