CARD注释结果文件说明

结果简介

微生物抗生素耐药性问题,是在人类为抑制病原微生物生长繁殖,而高频次、大剂量使用抗生素的背景下凸显出来的。微生物通过自身基因突变,或者在环境中由于基因水平转移,得到的这些(突变的)基因,使药物作用的靶位发生变异或药物不能正常地发挥作用,从而获得对特定抗生素的抗性。CARD(Comprehensive Antibiotic Resistance Database): 综合性抗生素抗性数据库,是目前使用最广泛的抗性基因数据库之一,目前包括约4000个抗性基因分类。

我们使用card_rgi分析工具,以CARD数据库作为参考数据库,对组装结果assembly.fasta文件,做细菌的抗生素抗性基因注释,找出基因组上的抗性基因,生成注释结果表格文件*.CARD.txt,推荐使用Excel或WPS软件来查看。

目录结构

CARD/ └── *.CARD.txt

格式说明

制表符分割的文本文档,使用 excel 打开。

文件内容举例如下:

image-20230411160158568

文件内容说明如下:

列数列标题说明
1ORF_IDOpen Reading Frame identifier (internal to RGI)
2ContigSource Sequence
3StartStart co-ordinate of ORF
4StopEnd co-ordinate of ORF
5OrientationStrand of ORF
6Cut_OffRGI Detection Paradigm (Perfect, Strict, Loose)
7Pass_BitscoreStrict detection model bitscore cut-off
8Best_Hit_BitscoreBitscore value of match to top hit in CARD
9Best_Hit_AROARO term of top hit in CARD
10Best_IdentitiesPercent identity of match to top hit in CARD
11AROARO accession of match to top hit in CARD
12Model_typeCARD detection model type
13SNPs_in_Best_Hit_AROMutations observed in the ARO term of top hit in CARD (if applicable)
14Other_SNPsMutations observed in ARO terms of other hits indicated by model id (if applicable)
15Drug ClassARO Categorization
16Resistance MechanismARO Categorization
17AMR Gene FamilyARO Categorization
18Predicted_DNAORF predicted nucleotide sequence
19Predicted_ProteinORF predicted protein sequence
20CARD_Protein_SequenceProtein sequence of top hit in CARD
21Percentage Length of Reference Sequence(length of ORF protein / length of CARD reference protein)
22IDHSP identifier (internal to RGI)
23Model_idCARD detection model id
24NudgedTRUE = Hit nudged from Loose to Strict
25NoteReason for nudge or other notes

重要的信息包含在:第9~11列,第15~17列以及第21列。

目录