VDJ Dominant Contigs AIRR


File: [sample_name]_VDJ_Dominant_Contigs_AIRR.tsv

Putative cells only, dominant contig for each CellID–chain combination. DBEC adjustment is applied. The file is compliant with the AIRR rearrangement schema and contains additional informational columns in addition to all the mandatory ones.

Data columns: Cell Identifiers, Read and Molecule counts, Full trimmed contig nucleotide and amino acid sequence, Framework and CDR region nucleotide and amino acid sequence, V, D, J, and C gene segments, full length and productive status.

Refer to docs.airr-community.org/en/stable/datarep/rearrangements.html

This file is only output when the experiment included an appropriate TCR/BCR assay, and the VDJ_Version option is selected.

MetricDefinitionMajor contributing factors
cell_idUnique cell ID for the cell represented by this row. Cell index will match between VDJ data and gene/AbSeq expression data tablesSequencing quality
Library quality
cell_type_experimentalInferred cell type. Cell type is inferred, either from mRNA targeted panel expression data or from relative counts of BCR vs TCRSample type mRNA Panel
high_quality_cellTrue/False - This cell was designated as high quality, having a B or T type, a productive contig, and sufficient VDJ moleculesSample quality
Library quality
locusType of VDJ sequence: one of TRA, TRB, TRG, TRD, IGH, IGK, and IGLCell viability
Library quality
sequence_idUnique ID for contig formatted as [cell_id]_[locus]_[number]Sequencing quality
Library quality
consensus_countNumber of reads for this contigCell viability
Library quality
umi_countNumber of unique molecules (UMI) for this contig. Previously called "duplicate_count" in an earlier AIRR standardCell viability
Library quality
sequenceAssembled nucleotide sequence of contig after trimmingLibrary quality
VDJ recombination
sequence_lengthLength of full contig nucleotide sequence after trimmingLibrary quality
VDJ recombination
sequence_aaAmino acid sequence of contig after trimmingLibrary quality
VDJ recombination
sequence_aa_lengthLength of full contig amino acid sequence after trimmingLibrary quality
VDJ recombination
sequence_alignmentNucleotide sequence corresponding to VDJ coding region after trimmingVDJ recombination
sequence_alignment_lengthLength of nucleotide sequence corresponding to VDJ coding region after trimmingVDJ recombination
sequence_alignment_aaAmino acid sequence corresponding to VDJ coding region after trimmingVDJ recombination
sequence_alignment_aa_lengthLength of amino acid sequence corresponding to VDJ coding region after trimmingVDJ recombination
junctionJunction region nucleotide sequence, where the junction is defined as the CDR3 plus the two flanking conserved codonsVDJ recombination
junction_aaAmino acid translation of the junctionVDJ recombination
productiveTrue/False — there are no stop codons in the protein-coding portion of the sequenceVDJ recombination
rev_compTrue/False — the alignment is on the opposite strand (reverse complemented) with respect to the contig sequence. This field is always False for contig sequences from the BD Rhapsody VDJ librarySequencing quality
Library quality
complete_VDJTrue/False — this cell chain combination contains some amino acid sequence for each framework (FR1-FR4) region and each CDR (1-3) regionLibrary quality
VDJ recombination
v_callV gene segment identified for this contigVDJ recombination
v_supportQuality of V gene alignment - lower is betterSequencing quality
Library quality
v_cigarCIGAR string for the V gene alignmentVDJ recombination
v_sequence_startStart position of the V gene in the contig sequence (1-based closed interval)VDJ recombination
v_sequence_endEnd position of the V gene in the contig sequence (1-based closed interval)VDJ recombination
d_callFirst or only D gene segment identified for this contigVDJ recombination
d_supportQuality of D gene alignment, lower is betterSequencing quality
Library quality
d_cigarCIGAR string for the D gene alignmentVDJ recombination
d_sequence_startStart position of the D gene in the contig sequence (1-based closed interval)VDJ recombination
d_sequence_endEnd position of the D gene in the contig sequence (1-based closed interval)VDJ recombination
j_callJ gene segment identified for this contigVDJ recombination
j_supportQuality of J gene alignment - lower is betterSequencing quality
Library quality
j_cigarCIGAR string for the J gene alignmentVDJ recombination
j_sequence_startStart position of the J gene in the contig sequence (1-based closed interval)VDJ recombination
j_sequence_endEnd position of the J gene in the contig sequence (1-based closed interval)VDJ recombination
c_callC gene segment identified for this contigVDJ recombination
fwr1Nucleotide sequence of the FR1 for the contigVDJ recombination
fwr1_aaAmino acid sequence of the FR1 for the contigVDJ recombination
fwr2Nucleotide sequence of the FR2 for the contigVDJ recombination
fwr2_aaAmino acid sequence of the FR2 for the contigVDJ recombination
fwr3Nucleotide sequence of the FR3 for the contigVDJ recombination
fwr3_aaAmino acid sequence of the FR3 for the contigVDJ recombination
fwr4Nucleotide sequence of the FR4 for the contigVDJ recombination
fwr4_aaAmino acid sequence of the FR4 for the contigVDJ recombination
cdr1Nucleotide sequence of the CDR1 for the contigVDJ recombination
cdr1_aaAmino acid sequence of the CDR1 for the contigVDJ recombination
cdr2Nucleotide sequence of the CDR2 for the contigVDJ recombination
cdr2_aaAmino acid sequence of the CDR2 for the contigVDJ recombination
cdr3Nucleotide sequence of the CDR3 for the contigVDJ recombination
cdr3_aaAmino acid sequence of the CDR3 for the contigVDJ recombination
germline_alignmentAssembled, aligned, full-length inferred germline sequence spanning the same region as the sequence_alignment fieldVDJ recombination
germline_alignment_aaAmino acid translation of the assembled germline sequenceVDJ recombination
v_germline_alignmentAligned V gene germline sequence spanning the same region as the v_sequence_alignment field and including the same set of corrections and spacers (if any)VDJ recombination
v_germline_alignment_aaAmino acid translation of the v_germline_alignment fieldVDJ recombination
d_germline_alignmentAligned D gene germline sequence spanning the same region as the d_sequence_alignment field and including the same set of corrections and spacers (if any)VDJ recombination
d_germline_alignment_aaAmino acid translation of the d_germline_alignment fieldVDJ recombination
j_germline_alignmentAligned J gene germline sequence spanning the same region as the j_sequence_alignment field and including the same set of corrections and spacers (if any)VDJ recombination
j_germline_alignment_aaAmino acid translation of the j_germline_alignment fieldVDJ recombination
v_germline_startAlignment start position in the V gene reference sequence (1-based closed interval)VDJ recombination
v_germline_endAlignment end position in the V gene reference sequence (1-based closed interval)VDJ recombination
d_germline_startAlignment start position in the D gene reference sequence for the first or only D gene (1-based closed interval)VDJ recombination
d_germline_endAlignment end position in the D gene reference sequence for the first or only D gene (1-based closed interval)VDJ recombination
j_germline_startAlignment start position in the J gene reference sequence (1-based closed interval)VDJ recombination
j_germline_endAlignment end position in the J gene reference sequence (1-based closed interval)VDJ recombination
np1_lengthNucleotide sequence length of the combined N/P region between the V gene and first D gene alignment or between the V gene and J gene alignmentsVDJ recombination
np2_lengthNucleotide sequence length of the combined N/P region between either the first D gene and J gene alignments or the first D gene and second D gene alignmentsVDJ recombination

VDJ Unfiltered Contigs AIRR


File: [sample_name]_VDJ_Unfiltered_Contigs_AIRR.tsv

All cell IDs, all assembled contigs that were successfully annotated.

The file is compliant with the AIRR rearrangement schema and contains additional informational columns in addition to all the mandatory ones.

Data columns: Cell Identifiers, Read and Molecule counts, Full trimmed contig nucleotide and amino acid sequence, Framework and CDR region nucleotide and amino acid sequence, V, D, J, and C gene segments, full length, and productive status.

Refer to docs.airr-community.org/en/stable/datarep/rearrangements.html

Shared column definitions are identical to the VDJ_Dominant_Contigs_AIRR.tsv file. Here are listed the columns unique to this file.

MetricDefinitionMajor contributing factors
DominantTrue/False — this contig was selected as the dominant contig for this cell-chain combination.Library quality
VDJ recombination
Putative_ CellTrue/False — this cell index was selected as a putative cell based on the mRNA panel.Cell viability
mRNA panel