Non-supervision classification method for metagenome contigs
A classification method and metagenomics technology, which can be applied in special data processing applications, instruments, electronic digital data processing, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0041] The steps of the present invention are:
[0042] ① Acquisition of contig data; the present invention is applicable to all metagenomic contig data sets, and various metagenomic data can be downloaded from network public databases. For example, the metagenomic data of the human gut can be downloaded from http: / / gutmeta.genomics.org.cn / .
[0043] ②Establishment of eigenvectors;
[0044] (1) The present invention uses the k-mer frequency of the DNA sequence as the classification feature of the contig. The k-mer frequency refers to the frequency of occurrence of a subsequence of k length in the contig sequence. In the present invention, the value of k is 4. Since DNA is composed of four nucleotides, A (adenine), T (thymine), G (guanine), and C (cytosine), the dimension of 4-mer frequency is 256 dimensions.
[0045] (2) Normalize the eigenvector calculated in step (1), by dividing each element in the eigenvector by the maximum value of the element in the eigenvector, namely...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More - R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com



