A system for analysis of forensic DNA sequencing data and applications thereof

CN122201589APending Publication Date: 2026-06-12BGI FORENSIC TECH (SHENZHEN) CO LTD

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
BGI FORENSIC TECH (SHENZHEN) CO LTD
Filing Date
2024-12-11
Publication Date
2026-06-12

AI Technical Summary

Technical Problem

Existing DNA sequencing data analysis systems cannot meet the needs of forensic identification in terms of top-level architecture, user interaction, data processing capabilities, and identification technology applications. They also suffer from problems such as limited analyzable genetic markers, lack of support for domestic database formats, and limited functionality of identification modules.

Method used

A forensic DNA sequencing data analysis system was designed, including a sample center module, a task center module, an analysis center module, a report center module, and an identification center module. It supports multiple genetic marker analyses, automatically analyzes and generates forensic identification results, is compatible with domestic database formats, and provides multi-functional identification capabilities.

🎯Benefits of technology

It automates forensic genetic marker analysis, saving manpower and time costs, supports multifunctional forensic identification, is compatible with domestic database formats, and shortens the development cycle.

✦ Generated by Eureka AI based on patent content.

Smart Images

  • Figure CN122201589A_ABST
    Figure CN122201589A_ABST
Patent Text Reader

Abstract

The application discloses a forensic DNA sequencing data analysis system and application thereof. The analysis system comprises a sample center module, a task center module, an analysis center module, a report center module and an identification center module; the sample center module is used for inputting and storing samples to be analyzed and sample information; the task center module is used for creating an analysis task, inputting experimental information, monitoring a sequencing process and checking sequencing data, and sending a start signal to the analysis center module; the analysis center module is used for starting a corresponding bioinformatics analysis process according to the start signal, and is used for checking analysis state information of the analysis task and sample quality control results; the report center module is used for generating a report and allowing manual auditing of the report; and the identification center module is used for forensic identification and generating an identification result. The analysis system can automatically analyze information, save labor and time costs, support common forensic DNA database format export and support expandable forensic identification functions.
Need to check novelty before this filing date? Find Prior Art

Description

Technical Field

[0001] This application relates to the field of forensic DNA sequencing data analysis technology, and in particular to an analysis system for forensic DNA sequencing data and its application. Background Technology

[0002] With the continuous development of society, public security work, especially forensic identification, increasingly relies on DNA testing technology, which places higher demands on next-generation sequencing technology and DNA sequencing data analysis. Current DNA sequencing data analysis systems are primarily based on mainframe analysis workflows, which no longer meet the needs in many aspects, including top-level architecture, user interaction, data processing capabilities, application of identification techniques, and display of analytical results. Mainframe-based analysis systems are constrained by data center requirements, resulting in high maintenance costs; furthermore, they require specialized personnel to build bioinformatics analysis workflows and manage the analysis process according to user needs. These factors significantly hinder the commercial adoption of next-generation sequencing technology.

[0003] To promote the use of next-generation sequencing technology among non-professionals, there are also well-established integrated solutions on the market. These solutions utilize high-performance servers to replace large mainframes that require significant professional management and integrate established bioinformatics analysis workflows, allowing users to easily customize their data analysis. For example, Thermo Fisher Scientific's Converge server and software. Converge software aims to improve the efficiency of forensic and kinship DNA testing laboratories by centralizing data creation, analysis, and storage at a single access point. It includes a built-in case management application to support case, topic, genotyping analysis, and laboratory data management, allowing for configurable user interfaces and report output.

[0004] These integrated solutions provide data from next-generation sequencing to genotyping and customized identification reports, but their application is often limited to self-produced reagent kits, and the identification module has limited functionality. In summary, existing integrated solutions have the following drawbacks and shortcomings:

[0005] 1) The available genetic markers for analysis are limited and do not support analysis of genetic markers other than mitochondrial DNA (mtDNA), single nucleotide polymorphisms (SNPs), and short tandem repeats (STRs).

[0006] 2) Insufficient localization; does not support existing domestic second-generation DNA database format interfaces.

[0007] 3) The identification module has limited functionality, only providing individual identification applications including kinship and parentage identification modules.

[0008] Therefore, developing new integrated solutions for DNA sequencing data analysis is a pressing problem that needs to be solved in the field of forensic identification technology. Summary of the Invention

[0009] The purpose of this application is to provide a new system for analyzing forensic DNA sequencing data and its applications.

[0010] To achieve the above objectives, this application adopts the following technical solution:

[0011] One aspect of this application discloses a forensic DNA sequencing data analysis system, including a sample center module, a task center module, an analysis center module, a report center module, and an identification center module. The sample center module includes functions for inputting and storing samples to be analyzed and their information for other modules to read or access. The task center module includes functions for creating analysis tasks, inputting experimental information, monitoring the sequencing process, and verifying sequencing data. Creating an analysis task includes creating a task number, selecting the required bioinformatics analysis process, and sending a signal to the analysis center module to start the selected bioinformatics analysis process. The analysis center module includes functions for starting the corresponding bioinformatics analysis process according to the start signal sent by the task center module, and for viewing the analysis status information and sample quality control results of the analysis task. The report center module includes functions for reading or accessing the results of the analysis task, generating sample reports for preview or download, and allowing manual review of the sample report content. The identification center module includes functions for performing forensic identification based on the analysis task results or sample reports, generating forensic identification results for preview or download, and allowing manual review of the forensic identification results content.

[0012] Sample information, such as sample ID, is used to associate with other modules so that other modules can access the corresponding sample information. Task ID is also used to associate with other modules so that other modules can access the corresponding task and experimental information; for example, the analysis center module views analysis tasks by associating with the task center module through the task ID. In one implementation of this application, the task center module and the sample center module are associated through sample information; the analysis center module is associated with the task center module through the task ID and also with the sample center module through sample information; the report center module is associated with the sample center module through sample information and with both the task center module and the analysis center module through the task ID; the identification center module is associated with the sample center module through sample information and with the task center module, analysis center module, and report center module through the task ID. In one implementation of this application, the bioinformatics analysis workflow (i.e., bioinformatics analysis process) is indicated by the reagent kit name; therefore, selecting the corresponding bioinformatics analysis workflow simply means selecting the corresponding reagent kit name.

[0013] It should be noted that the forensic DNA sequencing data analysis system of this application can automate information analysis and complete forensic genetic marker analysis without human intervention, saving manpower and time costs. In addition, it supports exporting common forensic DNA database formats, reducing the workload in format conversion and data delivery. Furthermore, the forensic DNA sequencing data analysis system of this application also supports expandable forensic identification functions, and the unified access mode shortens the development cycle in applications such as individual identification, kinship calculation, and species identification.

[0014] It should also be noted that the bioinformatics analysis procedures that this application can perform include genetic marker analyses commonly used in the forensic field, such as mitochondrial DNA analysis, single nucleotide polymorphism analysis, short tandem repeat sequence analysis, microhaplotype analysis, mitochondrial COI gene analysis, and mitochondrial cytochrome B gene analysis. The analysis system of this application supports multifunctional forensic identification, including di / triplast paternity testing, mitochondrial locus difference analysis, individual identification likelihood calculation, complex kinship calculation, pedigree analysis, mixed typing splitting, and ancestry analysis.

[0015] In one implementation of this application, the sample information in the sample center module includes sample number, submitter, commissioning unit, acceptance date, inspection start date, inspection location, and sample information.

[0016] In one implementation of this application, the sample center module further includes a method for classifying and managing sample information to facilitate information retrieval and extraction.

[0017] In one implementation of this application, the information of the tested sample includes the name, gender, age, date of birth, and identification document number of the person from whom the tested sample originates.

[0018] In one implementation of this application, the method for entering the sample to be analyzed and its sample information is manual input and / or table import.

[0019] It should be noted that the analysis system of this application can manually input the sample to be analyzed and its sample information, or it can automatically import or read it; for example, in one implementation of this application, the analysis system of this application interfaces with the sequencer to support automatic data transmission after sequencing, that is, directly obtain the sample to be analyzed and its sample information from the sequencer, and support subsequent sequencing progress monitoring and other functions.

[0020] In one implementation of this application, the experimental information of the task center module includes at least one of the following:

[0021] a. Sequencing instrument ID, which is the unique serial number of the sequencing instrument, is used to indicate the storage location of the data captured by the system;

[0022] b. Chip ID, i.e., the unique serial number of the sequencing chip, is used to indicate the naming method of the data;

[0023] c. Lane ID, which is the channel number of the sequencing chip, is used to indicate the naming method of the data;

[0024] d.DNB ID, which records the experimental batch;

[0025] e. Whether to use double barcodes, i.e., setting the barcode splitting method;

[0026] f.barcode ID1, which is the barcode number added to the sample during the experiment;

[0027] g. barcode ID2, which is the number of the other barcode added to the sample during the experiment; if the barcode rule is a single barcode, it is not necessary to fill in this field; if the barcode rule is a double barcode, it is required.

[0028] h. Reagent kit name, used to indicate the bioinformatics analysis workflow to be invoked.

[0029] In one implementation of this application, the analysis status information of the analysis center module includes the total number of samples, analysis progress, task start time, and experimental information entered by the task center module.

[0030] In one implementation of this application, the analysis center module further includes the ability to control the pause, stop, and re-analysis of the analysis task by clicking the corresponding button, and to enter the information quality control page by clicking the analysis task number, and to view the sample quality control results after the analysis task is completed.

[0031] In one implementation of this application, the information quality control page has a built-in data export button, which is used to export DNA database format files according to the usage situation.

[0032] In one implementation of this application, the sample quality control results include sample number, number of libraries, sequencing method, sequencing read length, number of detected common SNP sites, number of detected common STR sites, number of detected X chromosome STR sites, number of detected Y chromosome STR sites, number of detected mitochondrial sites, number of detected microhaplotypes, average insert length and shortest insert length of mitochondrial COI gene, average insert length and shortest insert length of mitochondrial cytochrome B gene, number of serialized data sequences, GC content, Q30 index, overall alignment rate, low-quality alignment rate, average quality value, insert sequence length, average sequencing depth of the target region, and target region coverage.

[0033] In one implementation of this application, the report center module further includes a method for visually displaying sample reports using statistical charts.

[0034] In one implementation of this application, the identification center module also includes a sub-module for distinguishing different identification functions through mini-program registration, which involves the script path, parameter settings, and runtime environment of the identification mini-program and is configured following the same logic.

[0035] In one implementation of this application, the configuration information of the identification center module's identification applet includes:

[0036] a. Mini Program name, used for name management;

[0037] b. Mini Program version, used for version management;

[0038] c. Basic parameters, i.e., the list of parameters required for the operation;

[0039] d. Program path, i.e., the specific path where the script is uploaded;

[0040] e. The corresponding process, which is the name of the upstream analysis process that can be input;

[0041] f. Process version, i.e., the version of the upstream analysis process that can be input;

[0042] g. Number of parameters, i.e., the number of configurable parameters;

[0043] h. Estimated memory usage, i.e., the server memory expected to be used by a single task;

[0044] i. Estimated number of threads, i.e., the number of server threads expected to be used per task;

[0045] j. Output file, i.e., the directory for outputting the identification results;

[0046] k. Is it necessary to upload the mini-program, i.e., via script upload?

[0047] l. Priority, which determines the running priority when multiple identification tasks are initiated simultaneously.

[0048] In one implementation of this application, the forensic DNA sequencing data analysis system further includes a database for storing or temporarily storing the sample to be analyzed, sample information, analysis tasks, analysis status information, sample quality control results, sample reports, and forensic identification results.

[0049] Another aspect of this application discloses the application of the forensic DNA sequencing data analysis system in animal species identification, plant species identification, or microbial species identification.

[0050] It should be noted that the key feature of the forensic DNA sequencing data analysis system in this application lies in its ability to perform individual identification, kinship calculation, and species identification; therefore, it can not only meet the needs of various functions of forensic identification, but also be used for species identification of animals, plants, or microorganisms. Among these, animal species identification includes human-derived identification, as well as species identification of other animals.

[0051] The beneficial effects of this application are as follows:

[0052] The forensic DNA sequencing data analysis system of this application can perform automated information analysis efficiently, saving labor and time costs; it supports exporting common forensic DNA database formats, facilitating data delivery; and it supports expandable forensic identification functions, better meeting the needs of multifunctional forensic identification. Attached Figure Description

[0053] Figure 1 This is a schematic diagram of the structure of the forensic DNA sequencing data analysis system in the embodiments of this application. Detailed Implementation

[0054] To address the shortcomings and deficiencies of existing integrated solutions for forensic DNA sequencing data, this application develops a novel forensic DNA sequencing data analysis system. This system provides a standard access method that is widely adaptable to various genetic marker analysis workflows, allowing bioinformatics workflows with varying computer resource usage to run synchronously. It is compatible with common domestic forensic DNA database format interfaces, supporting the export of Common Message Format (CMF) and Common DNA Database Exchange Format (DDEM). Furthermore, it offers an expandable identification function module design, adapting not only to conventional kinship and parentage identification modules but also to non-human identification functions such as animal and plant species identification.

[0055] This application presents an automated analysis system for large-scale parallel sequencing data, developed based on a standard bioinformatics analysis workflow architecture. It includes various bioinformatics analysis workflows for commonly used genetic markers in forensic medicine, such as mitochondrial DNA (mtDNA), single nucleotide polymorphisms (SNPs), and short tandem repeats (STRs), as well as micro-haplotypes (MHs), mitochondrial COI genes, and mitochondrial cytochrome B genes (CYTB) for specific applications. The system interfaces with the sequencer, supporting sequencing progress monitoring and automatic data transfer after sequencing. It can automatically achieve a one-stop output from raw sequencing data (FASTQ) to genotyping results and identification reports. The forensic DNA sequencing data analysis system of this application, such as... Figure 1 As shown, it includes a sample center module 11, a task center module 12, an analysis center module 13, a report center module 14, and an identification center module 15.

[0056] 1) Sample Center Module: Used for sample information entry and sample information management.

[0057] The system supports both manual input and table import to manage information such as sample number, submitter, commissioning unit, acceptance date, test start date, test location, examinee's name, gender, age, date of birth, and ID number. This information is recorded in the system's backend database after being bound, and can be retrieved by reporting centers and testing centers when issuing reports.

[0058] Meanwhile, the built-in classification management function of the sample center can classify and manage samples by project number, project name and remarks. Samples can be associated with projects at any stage of analysis and can be retrieved by the associated project information, making it easy for users to extract historical data from massive samples.

[0059] 2) Task Center Module: Used to create analysis tasks, enter experimental information, monitor the sequencing process, and verify sequencing data.

[0060] The task center and sample center are linked by unique sample numbers. Both manual input and table import are supported. When a message is sent, the system captures data and starts the analysis task. The following information needs to be imported:

[0061] a. Sequencing Instrument ID: A unique serial number for the sequencing instrument, indicating the location where the system stores the captured data;

[0062] b. Chip ID: A unique serial number for the sequencing chip, indicating the naming method for the data;

[0063] c. Lane ID: The channel number of the sequencing chip (L01 / L02 / L03 / L04), indicating the naming method of the data;

[0064] d.DNB ID: Records the experimental batch;

[0065] e. Whether to use double barcodes: Sets the barcode splitting method;

[0066] f.barcode ID1: During the experiment, the sample is given the corresponding barcode number;

[0067] barcode ID2: If the barcode rule is a single barcode, this field is not required; if the barcode rule is a double barcode, this field is required.

[0068] h. Reagent kit name: Indicates the bioinformatics analysis workflow to be invoked.

[0069] This system binds the reagent kit to the bioinformatics analysis workflow through a configuration file. The configuration file needs to specify the startup file path of the bioinformatics analysis script to be run, the directory name involved in the operation, and the path of the index file to be called during the analysis process, as shown in Table 1.

[0070] Table 1 Configuration File Requirements Items

[0071] Field Name (Example) Field Explanation RootDir The script list output directory is usually the root directory of the process. WorkDir Intermediate file output directory ResultDir Output directory for results files libpath Process Dependency Module Directory exportpath Process depends on software environment variable settings Analysis Content In principle, the parameter settings for the process steps do not need to be changed. key Whether to generate a script based on field settings rank Script generation order reference Reference Genome Pathway annotation Comment file path bedFile Regional file path

[0072] Bioinformatics analysis workflows break down analysis scripts into multiple steps using AnalysisContent. The order in which these scripts execute must be explicitly specified; later scripts will automatically initiate new tasks after the previous script completes. This section also needs to define the memory and thread usage required for each script's execution. An example is shown below:

[0073] "AnalysisContent":{

[0074] "step1":{

[0075] "name":"step1",

[0076] "type":"example",

[0077] "priorStep":["step0"],

[0078] "nextStep":["step2"],

[0079] "mem":2,

[0080] "thread":2,

[0081] "key":["SNP","STR","MT"],

[0082] "rank":1

[0083] },

[0084] ...

[0085] }

[0086] 3) Analysis Center Module: View the analysis status of tasks and sample quality control results.

[0087] After an analysis task is started, it will be automatically assigned a unique task number by the system. On this function module page, you can view the chip ID, total number of samples, reagent kit name, analysis progress, and task start time. You can stop or re-analyze the task by clicking the corresponding button. Clicking the task number will take you to the information quality control page. After the task is completed, you can view the following quality control information, as shown in Table 2. The information quality control page has a built-in data export button, which can export DNA database format files according to the usage.

[0088] Table 2 Analysis Results Quality Control List

[0089]

[0090]

[0091] 4) Report Center Module: Preview or download sample reports, and manually review the report content.

[0092] After the genetic marker analysis is completed, you can perform operations on the sample report in the report center, such as downloading the report, previewing the report, and regenerating the report. Clicking on the sample number will take you to the genetic marker genotyping visualization interface, which displays the depth distribution of alleles detected for different genetic markers based on a bar chart.

[0093] 5) Forensic Center Module: Performs various forensic examinations on samples.

[0094] Different authentication modules are distinguished through mini-program registration, involving the script path, parameter settings, and runtime environment of the authentication mini-program, all configured following the same logic. Specifically, this includes the following:

[0095] a. Mini Program Name: Name Management;

[0096] b. Mini Program Version: Version Management;

[0097] c. Basic parameters: A list of parameters required to run the program;

[0098] d. Program path: The specific path where the script was uploaded;

[0099] e. Corresponding process: The name of the upstream analysis process that can be entered;

[0100] f. Process Version: The version of the upstream analysis process available for input;

[0101] g. Number of parameters: The number of configurable parameters;

[0102] h. Estimated Memory (GB): The server memory expected to be used by a single task;

[0103] i. Estimated number of threads: The number of server threads expected to be used per task;

[0104] j. Output file: Directory for outputting the identification results;

[0105] k. Is it necessary to upload the mini-program: script upload method;

[0106] l. Priority: Determines the running priority when multiple identification tasks are initiated simultaneously.

[0107] Compared with existing integrated solutions for forensic DNA sequencing data, the forensic DNA sequencing data analysis system proposed in this application has the following advantages:

[0108] 1) It automates information analysis, enabling forensic genetic marker analysis to be completed unattended, saving manpower and time costs;

[0109] 2) It can synchronize the development progress of reagent kits and configure the information analysis process, making project management more standardized;

[0110] 3) Supports exporting common forensic DNA database formats, reducing workload in format conversion and data delivery processes;

[0111] 4) Supports expandable forensic identification functions, and the unified access mode shortens the development cycle in applications such as individual identification, kinship calculation, species identification and more fields in the future.

[0112] The present application will be further described in detail below through specific embodiments. The following embodiments are only for further illustration of the present application and should not be construed as limiting the present application.

[0113] Unless otherwise specified, the technical means used in the embodiments are conventional means well known to those skilled in the art.

[0114] Example

[0115] Following the above method, this example used 14 standard cell lines (9948, male DNA; 9947A, female DNA) with different starting DNA amounts as positive controls, plus 7 real samples for testing. The next-generation sequencing data from the standard cell lines and real samples were obtained from the multiplex amplification system of the FGID Mitochondrial Whole Genome Detection Kit from BGI Genomics Co., Ltd., Shenzhen. FASTQ format data was obtained using the SE400 sequencing mode on the DNBSEQ-G99 platform (BGI Genomics Co., Ltd.), with the specific chip number FT100042236 and lane number L01.

[0116] The entire process involves steps such as entering sample information, creating an analysis task, sequencing off the sequencing machine, automatic start of analysis, reporting, and identification. The "Enter Sample Information" and "Create Analysis Task" steps up to the sequencing off-machine are manually filled in by the experimenter based on records, selecting the "FGID Mitochondrial Whole Genome Detection Kit" to complete task creation. After sequencing off the machine, the analysis system will automatically complete the following steps. After 15 minutes of operation, once the analysis status displays "Analysis Complete," the subtask number can be clicked to access the quality control information page and export the corresponding result file or database format file, as shown in Tables 3 and 4.

[0117] Table 3 shows the progress of the analysis center.

[0118]

[0119] Table 4 Exported File Directory Structure

[0120]

[0121]

[0122] Enter the "Site Difference Analysis" section of the "Identification Center Module," review the completed identification documents, and compare the mitochondrial whole genome sequences of the standard 9948 cell line and the 9947A cell line. The results are shown in Table 5.

[0123] Table 5. Mitochondrial test results

[0124] rCRS sequence start position 9947A-lng-1 9948-lng-1 chrM:73 73A 73G chrM:93 93G 93A chrM:152 152T 152C chrM:195 195C 195C chrM:214 214G 214A chrM:263 263G 263G chrM:309 309.2CC 309.1C chrM:315 315.1C 315.1C chrM:709 709G 709A chrM:750 750G 750G chrM:1393 1393R 1393G chrM:1438 1438G 1438G

[0125] Table 5 shows the sequence differences of all detected individual base sites between the whole mitochondrial genome sequence (16569 bp) of the tested samples and the rCRS* sequence. The results in Table 5 indicate that the standard 9948 cell line and the 9947A cell line did not originate from the same individual.

[0126] This application develops an automated analysis system for forensic DNA sequencing data, supporting common forensic genetic markers such as SNP, STR, mtDNA, MH, COI, and CYTB. The system incorporates multiple identification functions, including paternity testing for di / triple pairs, mitochondrial locus difference analysis, individual likelihood calculation, complex kinship calculation, pedigree analysis, mixed typing, and ancestry analysis. The system also includes built-in report preview, report download, result file download, and dedicated database format file export functions. Furthermore, the system provides a genetic marker allele visualization function, displaying allele depth distribution in bar charts, and allows users to select whether to display or hide the graph.

[0127] In summary, the automated analysis system of this application is a high-performance server equipped with various management software. Besides the functional modules described above, it also includes conventional electronic components such as processors, hard drives, and memory modules found in most analysis systems. By modifying the server's form factor or hardware model, and integrating task management software and bioinformatics workflows, it can analyze forensic DNA next-generation sequencing data. Similarly, minor modifications to the configuration items within the analysis system or direct replacement of the analysis scripts can achieve similar results. Therefore, it is understood that the analysis system of this application, in addition to its use in forensic next-generation sequencing data analysis, is also applicable to applications such as genetic disease testing in hospitals, identification of smuggled animals and plants by customs, and microbial identification, by registering corresponding bioinformatics analysis workflows and identification functions.

[0128] The above description, in conjunction with specific embodiments, provides a further detailed explanation of this application and should not be construed as limiting the specific implementation of this application to these descriptions. Those skilled in the art to which this application pertains can make several simple deductions or substitutions without departing from the concept of this application.

Claims

1. A system for analyzing forensic DNA sequencing data, characterized in that: It includes a sample center module, a task center module, an analysis center module, a report center module, and an identification center module; The sample center module includes a function for inputting and storing the samples to be analyzed and their information, so that other modules can read or call them. The task center module includes functions for creating analysis tasks, entering experimental information, monitoring the sequencing process, and verifying sequencing data. Creating an analysis task includes creating a task number, selecting the required bioinformatics analysis process, and sending a signal to the analysis center module to start the selected bioinformatics analysis process. The analysis center module includes a function to initiate the corresponding bioinformatics analysis process based on the start signal issued by the task center module, and a function to view the analysis status information and sample quality control results of the analysis task. The report center module includes functions for reading or retrieving the results of analysis tasks, generating sample reports for preview or download, and allowing manual review of the sample report content. The identification center module includes functions for conducting forensic identification based on the results of analysis tasks or sample reports, generating forensic identification results for preview or download, and allowing manual review of the content of the forensic identification results.

2. The analysis system according to claim 1, characterized in that: In the sample center module, the sample information includes sample number, submitter, commissioning unit, acceptance date, test start date, test location, and tested sample information; Preferably, the sample center module further includes a method for classifying and managing sample information to facilitate information retrieval and extraction; Preferably, the sample information includes the name, gender, age, date of birth, and identification number of the person from whom the sample was obtained; Preferably, the method for entering the sample to be analyzed and its information is manual input and / or table import.

3. The analysis system according to claim 1, characterized in that: In the task center module, the experimental information includes at least one of the following: a. Sequencing instrument ID, which is the unique serial number of the sequencing instrument, is used to indicate the storage location of the data captured by the system; b. Chip ID, i.e., the unique serial number of the sequencing chip, is used to indicate the naming method of the data; c. Lane ID, which is the channel number of the sequencing chip, is used to indicate the naming method of the data; d.DNB ID, which records the experimental batch; e. Whether to use double barcodes, i.e., setting the barcode splitting method; f.barcode ID1, which is the barcode number added to the sample during the experiment; g. barcode ID2, which is the number of the other barcode added to the sample during the experiment; if the barcode rule is a single barcode, it does not need to be filled in; if the barcode rule is a double barcode, it is required. h. Reagent kit name, used to indicate the bioinformatics analysis workflow to be invoked.

4. The analysis system according to claim 1, characterized in that: In the analysis center module, the analysis status information includes the total number of samples, analysis progress, task start time, and experimental information entered by the task center module.

5. The analysis system according to claim 1, characterized in that: The analysis center module also includes the ability to control the pause, stop, and re-analysis of analysis tasks by clicking the corresponding buttons, and to access the information quality control page by clicking the analysis task number, and to view the sample quality control results after the analysis task is completed; Preferably, the information quality control page has a built-in data export button for exporting DNA database format files according to usage conditions; Preferably, the sample quality control results include sample number, number of libraries, sequencing method, sequencing read length, number of detected common SNP sites, number of detected common STR sites, number of detected X chromosome STR sites, number of detected Y chromosome STR sites, number of detected mitochondrial sites, number of detected microhaplotypes, average insert length and shortest insert length of mitochondrial COI gene, average insert length and shortest insert length of mitochondrial cytochrome B gene, number of serialized data sequences, GC content, Q30 index, overall alignment rate, low-quality alignment rate, average quality value, insert sequence length, average sequencing depth of the target region, and target region coverage.

6. The analysis system according to claim 1, characterized in that: The report center module also includes a method of visually displaying sample reports using statistical charts.

7. The analysis system according to claim 1, characterized in that: The identification center module also includes a sub-module for distinguishing different identification functions through mini-program registration. This sub-module involves the script path, parameter settings, and operating environment of the identification mini-program, and is configured following the same logic.

8. The analysis system according to claim 7, characterized in that: The configuration information of the identification center module, including the identification mini-program, includes: a. Mini Program name, used for name management; b. Mini Program version, used for version management; c. Basic parameters, i.e., the list of parameters required to run the program; d. Program path, i.e., the specific path where the script is uploaded; e. The corresponding process, which is the name of the upstream analysis process that can be input; f. Process version, i.e., the version of the upstream analysis process that can be input; g. Number of parameters, i.e., the number of configurable parameters; h. Estimated memory usage, i.e., the server memory expected to be used by a single task; i. Estimated number of threads, i.e., the number of server threads expected to be used per task; j. Output file, i.e., the directory for outputting the identification results; k. Is it necessary to upload the mini-program, i.e., via script upload? l. Priority, which determines the running priority when multiple identification tasks are initiated simultaneously.

9. The analysis system according to any one of claims 1-8, characterized in that: It also includes a database for storing or temporarily storing samples to be analyzed, sample information, analysis tasks, analysis status information, sample quality control results, sample reports, and forensic identification results.

10. The application of the analytical system according to any one of claims 1-9 in the identification of animal species, plant species, or microbial species.