The present invention provides compositions, methods, and kits for inserting a plurality of synthetic transposons each comprising a different
nucleic acid sequence (i.e., molecular
barcode) in a target
nucleic acid of interest to allow extraction of
contiguity information in the target
nucleic acid. The molecular barcodes are also useful for reducing amplification or sequencing bias and errors, and for guiding accurate
sequence assembly of the target nucleic acid from sequencing reads. The compositions, methods, and kits described herein have many applications, including haplotyping,
genome assembly, sequencing of repetitive regions, detection of structural variations and copy number variations, chromosomal conformation analysis, and
methylation analysis.