RS_Long_Amplicon_Analysis Protocol

Use this protocol to determine phased consensus sequences for pooled amplicon data.

 

The protocol includes four main steps:

  1. Coarse clustering: Group reads from different amplicons into different clusters; detect read-read overlaps; build an overlap graph, then cluster the overlap graph to break the graph into the final clusters.

  2. Phasing: Load the reads for each cluster into the Quiver consensus software and find an initial consensus. Recursively split reads from different haplotypes or other PCR products based on high scoring mutations proposed by Quiver.

  3. Consensus:  Generate a final consensus for each haplotype or PCR product using Quiver.

  4. Post-Processing Filters: Detect and remove PCR artifacts. Chimeric sequences are identified using the UCHIME algorithm, and other PCR artifacts are identified by overall consensus quality.

 

Barcode Parameters (Barcode Module v1)

 

Amplicon Parameters (LongAmpliconAnalysis v1)