Method and system for hierarchical extraction of legal elements based on large models and knowledge graphs

By combining large language models and knowledge graphs, the system automatically matches and generates annotated templates, solving the problem of handling multi-layered structured information of crimes in complex legal documents, which is difficult in existing technologies, and achieving efficient identification and analysis of multiple crime groups.

CN121996801BActive Publication Date: 2026-06-12SHANDONG UNIV

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Patents(China)
Current Assignee / Owner
SHANDONG UNIV
Filing Date
2026-04-09
Publication Date
2026-06-12

AI Technical Summary

Technical Problem

Existing technologies struggle to effectively process the multi-layered and multi-dimensional structured information of charges in complex legal documents, especially the identification and analysis of confused charge groups. This results in problems such as high rule writing costs, poor model scalability, and inconsistent outputs.

Method used

By employing a method based on large language models and knowledge graphs, the system acquires case information, automatically matches and confuses crime groups, reads the four elements and element metadata, generates annotated templates, and calls the large language model for reasoning, thereby achieving the merging and templated extraction of multiple crime elements.

Benefits of technology

It enables automatic merging and template-based extraction of multiple crime elements, improving the accuracy and consistency of crime identification, reducing the cost of manual intervention, and supporting unified analysis and expansion of multiple crime groups.

✦ Generated by Eureka AI based on patent content.

Smart Images

  • Figure CN121996801B_ABST
    Figure CN121996801B_ABST
Patent Text Reader

Abstract

The application relates to the technical field of artificial intelligence and judicial big data, and particularly provides a legal element hierarchical extraction method and system based on a large model and a knowledge graph. The method comprises the following steps: acquiring basic information and factual text of a case; automatically matching a confused charge group Gi according to the basic information and the factual text; reading four requirements and element metadata of the confused charge group Gi through a knowledge graph KG; performing multi-charge element merging and unified field modeling to generate an annotated confused charge group template; constructing a prompt word and calling a large language model LLM to perform reasoning, so as to extract case elements of the confused charge group template; and according to the case elements, analyzing LLM output, performing verification, and generating and storing a confused charge group element table. The method realizes automatic merging, automatic annotation and template extraction of a multi-charge requirement group.
Need to check novelty before this filing date? Find Prior Art

Description

Technical Field

[0001] This invention relates to the fields of artificial intelligence and judicial big data technology, and in particular to a method and system for hierarchical extraction of legal elements based on large models and knowledge graphs. Background Technology

[0002] With the advancement of online court judgments and judicial transparency, a vast number of criminal judgments have been generated in judicial practice. These documents are mostly written in natural language, with loose structures, long lengths, and scattered information. Structured processing of these judgments has become a fundamental task in the application of judicial big data.

[0003] In criminal law theory and judicial practice, the determination of a crime typically revolves around the four elements of criminal law: the object of the crime, namely the legally protected interests and the victims; the objective aspect, including the nature of the act, the harmful act, the harmful result, the constitutive pattern, the characteristics of the act, and the stage of the act; the subject of the crime, including the type of subject, the age of criminal responsibility, and the capacity for criminal responsibility; and the subjective aspect, including the form of culpability, the purpose of the crime, the element of knowledge, the motive, and the mental state. In specific cases, many types of crimes overlap and intersect in these elements, especially some typical and easily confused crimes, such as fraud and contract fraud; embezzlement and misappropriation of funds. These crimes share commonalities in terms of harmful act, constitutive pattern, the existence of a contractual relationship, the object of the crime, and the purpose of the crime, but differ in key constitutive elements. An inappropriate characterization will directly affect the severity of the sentence and adversely affect the credibility of the judiciary.

[0004] In the existing technology, (1) the traditional extraction method based on rules and keywords: through manually set rules, regular expressions, keyword dictionaries, etc., fields such as defendant information, crime name, amount involved, and time are extracted from documents; for complex elements such as behavior description and subjective malice, it usually relies on keyword triggering and human experience for preliminary judgment. For example, in "Research and Implementation of Relationship Extraction Technology for Knowledge Elements in Legal Texts", the approach of using rules and dictionaries as the basis for entity recognition and combining semi-supervised template matching to iteratively extract relationships is adopted. However, this method has the following disadvantages: ① The cost of rule writing and maintenance is high, and it relies heavily on the experience of domain experts. Once the legal provisions or trial standards change, large-scale adjustments are required; ② It has limited ability to identify complex sentence structures and implicit elements (such as "knowing" and "illegal possession purpose"), and it is difficult to handle long-distance dependencies and implicit expressions; ③ It is difficult to naturally express the four elements of criminal law and the multi-level and multi-dimensional structured information such as the confused crime groups; ④ It generally only extracts around a certain established crime and cannot perform element filling and comparative analysis on multiple confused crime groups in the same case in parallel.

[0005] (2) Sequence labeling and classification methods based on deep learning: CRF, BiLSTM, BERT and other models are used to perform entity recognition, element classification and sentence-level label prediction on legal documents. For example, the research on the construction technology of criminal law knowledge graph proposes the JLB-BiLSTM-CRF model, which uses BERT to enhance representation and perform entity recognition; the research on the generative named entity recognition framework for Chinese legal domain explores the use of a sequence-to-sequence generative framework for legal entity recognition. This approach has the following problems: ① Existing models mostly adopt a flat label system, which is difficult to directly carry the hierarchical structure of four elements-sub-elements-specific crimes; ② The model output is usually a label ID or a brief category, which requires one or more rounds of rule and normalization processing to map to a unified and comparable set of elements; ③ When facing multiple easily confused crimes, it is often necessary to design labels or models separately for each crime or a small range of crimes, which is not conducive to comparison and transfer under a unified framework; ④ The corresponding labeling cost is high. Once a new crime or a new element system is introduced, it is often necessary to re-label and retrain, which limits the scalability.

[0006] (3) Extraction method based on large language model + simple JSON template: In recent years, attempts have gradually emerged in practice to extract elements from legal texts using general large language models (such as GPT-type models). The usual practice is to write explanatory prompts for the model and attach a simple JSON template. This method has the following shortcomings: ① The field values ​​lack strong constraints, and the model is prone to generating a large number of synonymous, ambiguous or non-standardized expressions, which are difficult to directly map to a fixed set of legal elements; ② In order to map these free texts to a finite and stable set of elements, a large number of rules, mapping tables and manual post-processing are required; ③ Existing JSON templates are mostly based on the perspective of a single case or a single crime, and have not built a unified comparison template structure for the confused crime group.

[0007] (4) Preliminary attempts at knowledge graph + template: Some studies have constructed knowledge graphs in the field of criminal law that are of the type of crime-constituent elements-legal provisions-judicial interpretation. For example, "From Graph to Word Bag: Introducing DomainKnowledge to Confusing Charge Prediction" uses a graph structure to show the common elements and distinguishing features between different crimes, and then automatically filters them to form a bag of words, and uses the bag of words to supervise the model's attention to improve the accuracy of predicting confusing crimes. This type of approach has achieved certain results in the task of predicting confusing crimes, but it still has the following limitations: ① Insufficient structured output capability: The method remains at the bag of words level and fails to generate a structured template that conforms to the thinking habits of legal professionals. First, the constituent elements of law are a system with time sequence and logical connection, rather than a simple set of keywords. Key elements in judgment documents (such as "knowingly") are often reflected through continuous contexts, and may not have corresponding words. Keywords are disconnected from facts. Second, the method itself contains a pre-judgment, relying on specific keywords to distinguish crimes, which is actually making a qualitative judgment in advance. For example, it's impossible to explain whether key elements of contract fraud versus fraud, such as "during the signing / performance of the contract," are met. ② Fundamental Misalignment: Technology-Driven vs. Problem-Driven Approaches applying readily available NLP technologies (knowledge graphs, attention mechanisms) to legal scenarios instead of addressing pain points in judicial practice. Legal judgment is not keyword matching; simplifying medical diagnosis to symptom keyword retrieval is technically feasible but lacks professional interpretability. Summary of the Invention

[0008] In view of this, the present invention provides a hierarchical extraction method and system for legal elements based on large models and knowledge graphs, which can realize automatic merging, automatic annotation and template extraction of multiple crime elements groups.

[0009] In a first aspect, the present invention provides a hierarchical extraction method for legal elements based on large models and knowledge graphs, the method comprising:

[0010] Step 1: Obtain basic information and factual text of the case;

[0011] Step 2: Based on the basic information and factual text, automatically match the confusing charge group Gi;

[0012] Step 3: Read the four elements and metadata of the confusing crime group Gi through the knowledge graph KG;

[0013] Step 4: Merge multiple crime elements and model unified fields to generate annotated templates for confusing crime groups;

[0014] Step 5: Construct cue words and call the Large Language Model (LLM) for reasoning to extract case elements from the confused charge group template;

[0015] Step 6: Based on the case elements, parse the LLM output, perform verification, and generate and store the element table of the confused crime group.

[0016] Optionally, the basic information in step 1 includes the case number (case_no), the charge (charge), the facts of the crime (FD), and the court's opinion (Hold_that).

[0017] Optionally, step 2 includes:

[0018] Based on the case charge or the previous coarse-grained classification results, the corresponding confusing charge group Gi is automatically determined by the mapping relationship of [charge → confusing charge group]. If no confusing charge group is matched, the process ends directly or enters other processing paths.

[0019] Optionally, step 3 includes:

[0020] Using the knowledge graph KG, we read the four elements and element metadata corresponding to each crime in the confused crime group Gi. The read content includes the candidate value set content and annotation note information; the element metadata includes the nature of the behavior, the harmful behavior, the characteristics of the behavior, the knowledge element, and the standard of completion.

[0021] The knowledge graph KG used is organized as follows:

[0022] The root node represents the criminal law knowledge graph; the child nodes represent specific crimes.

[0023] Under the crime node, it is further broken down into:

[0024] The object of a crime, its objective aspect; the subject of a crime, its subjective aspect; the standard for completion of a crime.

[0025] Record at the leaf nodes of each element:

[0026] The content is the set of candidate values ​​for the current element under the current charge; the note is the annotation information for the content of the current element, which is used to supplement the legal connotation and applicable boundaries briefly expressed in the content, and is used to generate the description field in the confusing charge group template.

[0027] Optionally, step 4 includes:

[0028] The crime element structure is recursively merged. Through multi-crime element merging and field alignment algorithms, the nature of the behavior, harmful behavior, behavior characteristics, known elements, and completed standards are horizontally aligned and deduplicated. For each field, the candidate value set of each crime is retained to form a unified, multi-level element metadata structure.

[0029] A JSON template is generated based on the element metadata, and annotation information is automatically added to each field in the template to obtain an annotated confusion charge group template for large language model inference.

[0030] Optionally, step 5 includes:

[0031] The descriptive text, the annotated template for the confusing charge group, and the case fact FD are taken as input and submitted to the large language model. The request is to populate the elements for each charge in the confusing charge group according to the template structure, and to limit the field values ​​to the set of candidate values ​​given in the template annotation.

[0032] Optionally, step 6 includes:

[0033] Using regular expressions, extract JSON paragraphs or strip markers from the LLM output text, and call the JSON parsing function to convert them into a dictionary structure; perform field integrity checks and candidate value validity checks on the parsing results; organize the results that pass the checks into a confusing crime group element table and save it for subsequent analysis and modeling.

[0034] Optionally, step 6 may be followed by downstream applications;

[0035] The elements table of confused charges will be provided to the given charge / characterization auxiliary analysis module, the similar case retrieval module, and the conviction prediction or sentencing prediction model, respectively, and will be used to determine the charge requirements that a case meets, to search by element similarity, and as feature input.

[0036] Secondly, this invention provides a hierarchical legal element extraction system based on a large model and knowledge graph, the system comprising:

[0037] The data preprocessing module is used to obtain basic information and factual text of the case;

[0038] The automatic matching module for confusing charge groups is used to automatically match the confusing charge group Gi based on basic information and factual text;

[0039] The knowledge graph management and field metadata extraction module is used to store the criminal law knowledge graph KG and read the four elements and element metadata of the confused crime group Gi through the knowledge graph KG;

[0040] The module for merging multiple crime elements and confusing crime groups is used for merging multiple crime elements and unifying field modeling, generating annotated confusing crime group templates;

[0041] The Large Language Model (LLM) prompt construction and reasoning module is used to construct prompt words and call the Large Language Model (LLM) for reasoning to extract case elements from the confused crime group template.

[0042] The parsing output and legality verification module is used to parse the LLM output based on case elements, perform verification, and generate and store the element table of the confusing crime group.

[0043] The technical solution provided by this invention includes the following steps: acquiring basic case information and factual text; automatically matching a confused charge group Gi based on the basic information and factual text; reading the four elements and element metadata of the confused charge group Gi through a knowledge graph (KG); merging multiple charge elements and modeling unified fields to generate an annotated confused charge group template; constructing prompt words and calling a large language model (LLM) for reasoning to extract case elements from the confused charge group template; parsing the LLM output based on the case elements, performing verification, generating and storing a confused charge group element table. This method achieves automatic merging, automatic annotation, and templated extraction of multiple charge element groups. Attached Figure Description

[0044] To more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings used in the embodiments will be briefly introduced below. Obviously, the drawings described below are only some embodiments of the present invention. For those skilled in the art, other drawings can be obtained based on these drawings without creative effort.

[0045] Figure 1 A flowchart illustrating the hierarchical extraction method of legal elements based on large models and knowledge graphs provided in this embodiment of the invention;

[0046] Figure 2 A schematic diagram of the confusing crime group and its element structure in a knowledge graph, provided for embodiments of the present invention;

[0047] Figure 3 This is a schematic diagram of a hierarchical legal element extraction system based on a large model and knowledge graph, provided in an embodiment of the present invention. Detailed Implementation

[0048] To make the objectives, technical solutions, and advantages of the embodiments of the present invention clearer, the technical solutions of the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings. Obviously, the described embodiments are only some embodiments of the present invention, not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those skilled in the art without creative effort are within the scope of protection of the present invention.

[0049] The terminology used in the embodiments of this invention is for the purpose of describing particular embodiments only and is not intended to limit the invention. The singular forms “a,” “the,” and “the” used in the embodiments of this invention are also intended to include the plural forms unless the context clearly indicates otherwise.

[0050] It should be understood that the term "and / or" used herein is merely a correlative relationship describing related objects, indicating that there can be three relationships. For example, A and / or B can represent: A exists alone, A and B exist simultaneously, and B exists alone. Additionally, the character " / " in this text generally indicates that the related objects before and after are in an "or" relationship.

[0051] Depending on the context, the word "if" as used herein can be interpreted as "when" or "while" or "in response to determining" or "in response to detecting". Similarly, depending on the context, the phrase "if determined" or "if detected (stated condition or event)" can be interpreted as "when determined" or "in response to determining" or "when detecting (stated condition or event)" or "in response to detecting (stated condition or event)".

[0052] The present invention provides a method for hierarchical extraction of legal elements based on a large model and a knowledge graph, as Figure 1 and Figure 2 shown, the method includes:

[0053] Step 1: Obtain the basic information and factual text of the case.

[0054] In an embodiment of the present invention, the basic information in Step 1 includes the case number case_no, the case charge charge, the case criminal facts FD, and the court's opinion Hold_that of the case.

[0055] preprocessed_data={

[0056] "case_no": "2020 Liao 0422 Xing Chu X Hao",

[0057] "charge": "Fraud",

[0058] "FD": "Upon trial, it was found that in July 2019, Peng XX1, the father of the victim Peng XX, found the defendant Zhang XX and asked Zhang XX to help introduce a girlfriend for Peng XX. The defendant Zhang XX then falsely claimed that he had a niece named 'Li XX' and communicated with Peng XX in the name of 'Li XX' via WeChat. From July 2019 to January 2020, during the communication between the defendant Zhang XX and Peng XX, Zhang XX repeatedly fabricated reasons such as seeing a doctor and buying medicine, and demanded Peng XX to transfer money to him. During this period, Peng XX transferred a total of 41,921.00 yuan to Zhang XX via WeChat transfer.",

[0059] "Hold_that": "This court holds that the defendant, Zhang, with the intent of illegal possession, defrauded others of a substantial amount of property by fabricating facts. The facts of the crime are clear, and the evidence is conclusive and sufficient. His actions constitute the crime of fraud, and the charges brought by the public prosecutor are established and supported by this court. The defendant, Zhang, was summoned to the case and truthfully confessed to the crime after being apprehended, which constitutes a confession. Therefore, this court accepts the public prosecutor's opinion that he has a confession and can be given a lighter punishment according to law. The defendant, Zhang, voluntarily and truthfully confessed to his crimes, admitted the charged facts, and is willing to accept punishment. The sentencing recommendation of the public prosecutor is appropriate. In accordance with Articles 266, 47, 52, 61, 64, and 67 Paragraph 3 of the Criminal Law of the People's Republic of China, Article 15 of the Criminal Procedure Law of the People's Republic of China, and Article 1 of the Interpretation of the Supreme People's Court and the Supreme People's Procuratorate on Several Issues Concerning the Specific Application of Law in Handling Criminal Cases of Fraud, the judgment is as follows:"

[0060] }

[0061] Step 2: Based on the basic information and factual text, automatically match the confusing charge group Gi.

[0062] matched_groups=["G1"], G1:{fraud, contract fraud}.

[0063] In this embodiment of the invention, step 2 includes:

[0064] Based on the case charge (e.g., labeled as "fraud") or the previous coarse-grained classification results, the corresponding confused charge group Gi is automatically determined according to the mapping relationship of "charge → confused charge group". For example, G1={fraud, contract fraud}, G2={traffic accident, dangerous driving} or G3={fraud, embezzlement}. If no confused charge group is matched, the process ends directly or enters other processing paths.

[0065] Generate a template for elements with confusing charges and automatic annotations. Based on the element metadata: organize a set of candidate values ​​for each field; summarize the note information into a brief description; generate an element template with "unified field + charge column" and add "optional value + description" annotations after the field to constrain subsequent extraction.

[0066] Step 3: Read the four elements and metadata of the confusion charge group Gi through the knowledge graph KG.

[0067] In this embodiment of the invention, step 3 includes:

[0068] Using the knowledge graph KG, we read the four elements and element metadata corresponding to each crime in the confused crime group Gi. The read content includes the candidate value set content and annotation note information; the element metadata includes the nature of the behavior, the harmful behavior, the characteristics of the behavior, the knowledge element, and the standard of completion.

[0069] The knowledge graph KG used is organized as follows:

[0070] The root node is a criminal law knowledge graph; the child nodes are specific crimes, such as traffic accident crime, fraud, contract fraud, etc.

[0071] Under the crime node, it is further broken down into:

[0072] The object of a crime includes the objective aspect (including the nature of the act, the harmful act, the harmful result, the constitutive pattern, the characteristics of the act, the stage of the act, etc.), the subject of the crime (subject type, age of criminal responsibility, capacity for criminal responsibility, etc.), and the subjective aspect (form of culpability, criminal purpose, element of knowledge, etc.); the standard for completion of the crime.

[0073] Record at the leaf nodes of each element:

[0074] The content is the set of candidate values ​​for the current element under the current crime; the note is the annotation information for the content of the current element, which is used to supplement the legal connotation and applicable boundaries of the brief expression in the content. It is used to generate the explanatory field in the template of confusing crime groups, which helps to improve the understandability and consistency of the extraction results. For example, "the victim disposes of the property he has the right to dispose of to the other party" refers to judging whether there is a "disposal act". The crime of fraud requires the existence of a "disposal act".

[0075] {

[0076] "Fraud": {

[0077] "Object of the crime": [{"content": ["ownership of public and private property"],"note": ["......"]}],

[0078] "Objective aspects": {

[0079] "Behavioral Essence": {"content": ["Deceived to obtain"],"note": ["......"]},

[0080] "Harmful behavior": ["Fabricating facts...concealing the truth"],

[0081] "Harmful Consequences": ["Risk of loss to public and private property......"],

[0082] "Constitutional Pattern": [[".The perpetrator commits deceptive acts......the victim suffers property loss"]],

[0083] "Behavioral characteristics": {"content": ["Victim's disposal of property......"], "note":["......"]},

[0084] "Stage of Behavior": ["The victim controls property...within their domain"]

[0085] },

[0086] "Subject of the crime": [{"content": "natural person","note": ["......"]}],

[0087] "Subjective aspect": {

[0088] "Form of culpability": ["Direct intent"],

[0089] "Criminal intent": ["illegally appropriating another person's property"]

[0090] "Knowingly" element: ["Knowingly engaging in deceptive conduct...disposing of property"]

[0091] },

[0092] "Standard for Completion of the Crime": {"content": ["The actor controlled the property...loss"],"note":["......"]},

[0093] "Sentencing Factors": {"content": ["relatively large amount", "huge amount", "especially huge amount", "principal offender", "accomplice", "coerced accomplice", "instigator", "criminal preparation", "criminal abandonment", "attempted crime", "legitimate defense", "excessive defense"......], "note": ["......"]}

[0094] },

[0095] "Contract fraud": {

[0096] "Object of the Crime": [{"content": ["Public and private property ownership......contract management order"],"note":["......"]}],

[0097] "Objective aspects": {

[0098] "Behavioral Essence": {"content": ["Victim's disposal of property......", "Defrauding by using a contract"],"note": ["......"]},

[0099] "Harmful Acts": ["Fabricating facts...receiving stolen goods and fleeing"],

[0100] "Harmful Consequences": ["Property loss risk...disruption of economic order"],

[0101] "Constitutional Pattern": [[".The perpetrator deceives...the victim suffers financial loss"]],

[0102] "Behavioral Characteristics": {"content": ["Contractual Relevance"],"note": ["......"]},

[0103] "Behavioral Stage": {"content": ["During the signing / performance of the contract"], "note": ["......"]}

[0104] },

[0105] "Subject of the crime": [{"content": ["natural person......unit"],"note": ["......"]}],

[0106] "Subjective aspect": {

[0107] "Form of culpability": ["Direct intent"],

[0108] "Criminal Purpose": ["Illegally appropriating the property of the other party to the contract"]

[0109] "Knowingly" element: {"content": ["Knowingly lacking the ability to fulfill obligations...absconding"],"note":["......"]}

[0110] },

[0111] "Standard for Completion of the Case": {"content": ["The victim's disposal of property based on the contract...losses"], "note":["......"]},

[0112] "Sentencing Factors": {"content": ["relatively large amount", "huge amount", "especially huge amount", "principal offender", "accomplice", "coerced accomplice", "instigator", "criminal preparation", "criminal abandonment", "attempted crime", "legitimate defense", "excessive defense"......], "note": ["......"]}

[0113] }}}.

[0114] Step 4: Merge multiple crime elements and model unified fields to generate annotated templates for confusing crime groups.

[0115] In this embodiment of the invention, step 4 includes:

[0116] The crime element structure is recursively merged. Through multi-crime element merging and field alignment algorithms, the nature of the behavior, harmful behavior, behavior characteristics, known elements, and completed standards are horizontally aligned and deduplicated. For each field, the candidate value set of each crime is retained to form a unified, multi-level element metadata structure.

[0117] A JSON template is generated based on the element metadata, and annotation information (such as optional values ​​and descriptions) is automatically appended to each field in the template to obtain an annotated confusion charge group template for large language model inference.

[0118] This invention designs the element template for the confusing crime group as a comparison table with unified element fields and columns by crime, and introduces an automatic annotation mechanism on this basis to form the element template for the confusing crime group and the automatic annotation structure.

[0119] The overall structure can be summarized as follows: Rows: representing unified element fields, such as the nature of the behavior, harmful behavior, behavioral characteristics, known elements, and completion standards; Columns: representing various crimes within the confused crime group, such as fraud and contract fraud; Cell content: the set of candidate values ​​for a specific crime under a specific element field, along with a brief description. After extracting specific cases, the cell will mark the values ​​actually selected for that case.

[0120] In JSON format, it can be represented as the following example structure:

[0121] {

[0122] "Confusing Charges" group: ["Fraud", "Contract Fraud"]

[0123] Case Element Table: {

[0124] "Fraud": {

[0125] "Object of Crime": [

[0126] {

[0127] "content": "Ownership of public and private property"

[0128] "note": [

[0129] "

Public and Private Property Ownership

[0130] "The concept of 'ownership of public and private property' does not completely overlap with the concept of ownership in civil law; it places greater emphasis on protecting existing property interests from illegal infringement." ]

[0132] } ]

[0134] "Objective aspects": {

[0135] "The essence of behavior": {

[0136] "content": [

[0137] "To obtain through deception"

[0138] ],

[0139] "note": [

[0140] "

Deceived Payment

[0142] }

[0143] "Harmful Behavior": [

[0144] "Fictional facts"

[0145] "Concealing the truth"

[0146] ],

[0147] "Harmful Consequences": [

[0148] "Public and private property is at risk of being damaged or lost."

[0149] ],

[0150] "Compositional Pattern": [ [

[0152] "The perpetrator committed a deceptive act,"

[0153] "The victim develops or maintains a mistaken understanding"

[0154] "The victim disposed of property based on a mistake"

[0155] "The actor obtains property"

[0156] "The victim suffered property loss" ]

[0158] ],

[0159] "Behavioral characteristics": {

[0160] "content": [

[0161] "The victim disposes of property that they possess and have the right to dispose of to the other party."

[0162] "The victim was aware of the actual existence of the property that was disposed of." ]

[0164] "note": [

[0165] "

Behavioral Characteristics

[0166] [Behavioral Characteristics] Even if there is an act of "disposition," a second step of judgment is needed: whether there is "awareness of disposition," that is, whether the victim is aware that they are disposing of a real property. The victim must be aware of the type, name, and other physical characteristics of the property being disposed of. In cases of quantity discrepancies, if the property is disposed of as a whole (sold by weight), it is presumed that the victim was aware of the property's actual existence. Therefore, fraud is characterized by "awareness of disposition." ]

[0168] },

[0169] "Behavioral Stages": [

[0170] "The victim had de facto physical control over the property."

[0171] "Although the victim had no physical control, according to general societal norms, the property was still within their domain of control." ]

[0173] },

[0174] "Subject of the crime": [

[0175] {

[0176] "content": "natural person"

[0177] "note": [

[0178] "A natural person must simultaneously meet the following conditions: 1) be at least 16 years old; 2) possess full criminal responsibility, be mentally sound and able to recognize and control their actions. The person must possess both the ability to recognize and control their actions to be criminally responsible; intermittent mental patients, when mentally sound, are considered to have full criminal responsibility." ]

[0180] } ]

[0182] "Subjective aspect": {

[0183] "Forms of culpability": [

[0184] "Direct intent" ]

[0186] "Criminal Purpose": [

[0187] "Illegally appropriating other people's property" ]

[0189] "Known elements": [

[0190] "Knowingly engaging in deceptive behavior",

[0191] "Knowing that the victim would dispose of the property based on a mistaken understanding" ]

[0193] }

[0194] "Completion Standard": {

[0195] "content": [

[0196] "The perpetrator actually controls the property"

[0197] "The victim suffered property loss" ]

[0199] "note": [

[0200] "

Completed Case Standard

[0202] }

[0203] Sentencing factors: {

[0204] "content": [

[0205] "Relatively large amount", "huge amount", "particularly huge amount", "principal offender", "accomplice", "coerced accomplice", "instigator", "criminal preparation", "criminal abandonment", "attempted crime", "legitimate defense", "excessive defense", "emergency avoidance", "excessive avoidance", "juvenile delinquency", "elderly delinquency", "criminal offense", "criminal offense by a mentally ill person with limited capacity for civil conduct", "criminal offense by a deaf-mute or blind person", "surrender", "confession", "general meritorious service", "special meritorious service", "recidivist", "prisoner", "first-time offender", "occasional offender", "plead guilty and accept punishment", "voluntarily plead guilty in court", "forgiveness", "active compensation and reaching forgiveness", "active compensation and reaching a settlement"

[0206] "note": [

[0207] "

A relatively large amount

A huge amount

An especially huge amount

Juvenile delinquency

Elderly delinquency

[0208] "Contract fraud": { ...}

[0209] }

[0210] }

[0211] In this embodiment of the invention, as shown in Table 1, the table is presented in tabular form: the header is the crime within the confused crime group, the left side is the unified element field, and each cell displays the optional values, descriptions, and other information; in practical applications, the specific element values ​​selected by the model for the case can be indicated by underlines, bolding, or other marking methods.

[0212] Table 1. Templates and Automatic Annotations for Elements of the Confusing Charges Group

[0213]

[0214] The above templates are automatically generated by the multi-crime element merging and field metadata extraction module and the automatic annotation template generation module, eliminating the need for manual writing of each item. They can also be automatically updated as the knowledge graph and the configuration of the confused crime groups change, ensuring the maintainability and scalability of the system.

[0215] Metadata after merging the crimes of fraud and contract fraud;

[0216] {"Merged metadata":{

[0217] "Object of Crime":{

[0218] "Candidate value":[{

[0219] "value": ["ownership of public and private property"],

[0220] "source_charges":["fraud","contract fraud"],

[0221] "note": ["

Public and Private Property Ownership

Public and Private Property Ownership

[0222] {

[0223] "value":["Economic Order (Contract Management Order)"],

[0224] "source_charges":["Contract fraud"],

[0225] "note": ["

Economic Order (Contract Management Order)

Economic Order (Contract Management Order)

Economic Order (Contract Management Order)

[0226] "Objective aspects":{

[0227] "The essence of behavior":{

[0228] "Candidate value":[{

[0229] "value": ["obtained through deception"],

[0230] "source_charges":["fraud"],

[0231] "note": ["

Deceiving to Obtain

[0232] {

[0233] "value": ["defrauded by using a contract"],

[0234] "source_charges":["Contract fraud"],

[0235] "note": ["

Contract Fraud

Contract Fraud

[0236] "Harmful behavior":{

[0237] "Candidate value":[{

[0238] "value":["fictional facts","concealing the truth"]

[0239] "source_charges":["fraud","contract fraud"]}

[0240] {

[0241] "value":["fictitious entity","empty promise","receive stolen goods and abscond"]

[0242] "source_charges":["Contract fraud"]}]},

[0243] "Behavioral characteristics":{

[0244] "Candidate value":[{

[0245] "value": ["The victim disposes of property that they have the right to dispose of to the other party," "The victim is aware of the actual existence of the property being disposed of"],

[0246] "source_charges":["fraud","contract fraud"],

[0247] "note": ["

The victim disposes of property they have the right to dispose of to the other party

The victim is aware of the actual existence of the disposed property

[0248] {

[0249] "value":["Contractual relevance"],

[0250] "source_charges":["Contract fraud"],

[0251] "note":["

Contractual Relevance

Contractual Relevance

[0252] "Subject of the crime":{

[0253] "Candidate value":[{

[0254] "value":["natural person"],

[0255] "source_charges":["fraud","contract fraud"],

[0256] "note": ["A natural person must simultaneously meet the following conditions: 1) be at least 16 years old; 2) possess full criminal responsibility and be mentally sound enough to recognize and control their actions. The person must possess both the ability to recognize and control their actions to be criminally responsible; intermittent mentally ill persons, when mentally sound, are considered to have full criminal responsibility."]

[0257] {

[0258] "value":["unit"],

[0259] "source_charges":["Contract fraud"],

[0260] "note":["The three elements constituting a crime by an entity are: 1) the act is carried out in the name of the entity; 2) the act is used to obtain illegal profits for the entity; 3) the illegal gains belong to the entity. However, acts of using the entity's name for personal gain during affiliated or contracted operations are not considered crimes committed by an entity."]}]}

[0261] "Subjective aspect":{

[0262] "Form of culpability":{

[0263] "Candidate value":[{

[0264] "value":["directly intentional"],

[0265] "source_charges":["fraud","contract fraud"]}]},

[0266] "Criminal Purpose":{

[0267] "Candidate value":[{

[0268] "value":["illegally appropriating other people's property"]

[0269] "source_charges":["fraud"]},

[0270] {

[0271] "value": ["illegally possessing the other party's property under the contract"],

[0272] "source_charges":["Contract fraud"]}]},

[0273] "Known elements":{

[0274] "Candidate value":[{

[0275] "value": ["knowingly engaging in deceptive behavior", "knowingly allowing the victim to dispose of their property based on a mistaken understanding"]

[0276] "source_charges":["fraud"]},

[0277] {

[0278] "value": ["Knowingly lacking the ability or intention to fulfill the contract", "Knowingly intending to abscond after receiving money"]

[0279] "source_charges":["Contract fraud"],

[0280] "note":["

Knowledge Element

Knowledge Element

[0281] "Completion Standard":{

[0282] "Candidate value":[{

[0283] "value":["Property actually controlled by the perpetrator", "Property loss suffered by the victim"]

[0284] "source_charges":["fraud"],

[0285] "note":["

Completed Case Standard

[0286] {

[0287] "value": ["Victim disposes of property based on contract", "Perpetrator actually controls property", "Victim suffers property loss"]

[0288] "source_charges":["Contract fraud"],

[0289] "note":["

Completed Standard

[0290] Sentencing factors: {

[0291] "Candidate value": [{

[0292] "value": ["relatively large amount", "huge amount", "especially huge amount", "principal offender", "accomplice", "coerced accomplice", "instigator", "criminal preparation", "criminal abandonment", "attempted crime", "legitimate defense", "excessive defense", "emergency avoidance", "excessive avoidance", "juvenile delinquency", "criminal offense by the elderly", "criminal offense by a mentally ill person with limited capacity for civil conduct", "criminal offense by a deaf-mute or blind person", "surrender", "confession", "general meritorious service", "special meritorious service", "recidivist", "prison record", "first offense", "occasional offense", "plead guilty and accept punishment", "voluntarily plead guilty in court", "forgiveness", "actively compensate and reach forgiveness", "actively compensate and reach a settlement"]

[0293] "source_charges": ["fraud", "contract fraud"],

[0294] "note": ["

A relatively large amount

A huge amount

An especially huge amount

Juvenile delinquency

Elderly delinquency

[0295] }}.

[0296] Generate annotated templates for obfuscated charge groups;

[0297] {

[0298] "Object of Crime": {

[0299] "value": "",

[0300] "FD": "",

[0301] "Note": {

[0302] Optional values: ["Public and private property ownership", "Economic order (contract management order)"],

[0303] "Explanation": "

Public and Private Property Ownership

Public and Private Property Ownership

Economic Order (Contract Management Order)

Economic Order (Contract Management Order)

[0304] "Objective aspects": {

[0305] "The essence of behavior": {

[0306] "value": "",

[0307] "FD": "",

[0308] "Note": {

[0309] Optional values: ["obtained through deception", "obtained through contract fraud"],

[0310] "Explanation": "

Deceptive Acquisition

Deceptive Acquisition Through Contracts

Deceptive Acquisition Through Contracts

[0311] "Behavioral characteristics": {

[0312] "value": "",

[0313] "FD": "",

[0314] "Note": {

[0315] "Optional Values": ["The victim disposes of property that they have the right to dispose of to the other party, and the victim is aware of the actual existence of the disposed property", "Contractual Relevance"]

[0316] "Explanation": "[The victim disposes of property they have the right to dispose of to the other party] The first step is to determine whether there is an 'act of disposal,' that is, the victim transfers property they have the right to dispose of to the perpetrator. In summary, fraud involves an 'act of disposal.' [The victim is aware of the actual existence of the disposed property] Even if an 'act of disposal' exists, a second step is needed: whether there is 'awareness of disposal,' that is, the victim is aware that they are disposing of a certain existing property. The victim must be aware of the type, name, and other physical characteristics of the disposed property. In cases of quantity error, if the property is disposed of entirely (sold by weight), it is presumed that the victim is aware of the actual existence of the property. In summary, fraud involves 'awareness of disposal.' [Contractual Relevance] The deceptive act is inseparable from the conclusion or performance of the contract, and can be achieved through contract terms, fictitious parties, etc. [Contractual Relevance] Compared with ordinary fraud, there is an additional object of 'disruption of contract management order,' reflecting a focus on protecting market transaction security and the principle of good faith."

[0317] "Subject of the crime":{

[0318] "Candidate value":[{

[0319] "value": "",

[0320] "FD": "",

[0321] "Note": {

[0322] Optional values: ["natural person", "organization"],

[0323] "Explanation": "

Natural Person

Entity

[0324] "Subjective aspect": {

[0325] "Forms of culpability": {

[0326] "value": "",

[0327] "FD": "",

[0328] "Note": {

[0329] Optional value: ["Directly intentional"],

[0330] Note: Both the crime of fraud and the crime of contract fraud require direct intent.

[0331] "Criminal Purpose": {

[0332] "value": "",

[0333] "FD": "",

[0334] "Note": {

[0335] Optional values: ["illegally possessing another person's property", "illegally possessing the property of the other party to a contract"]

[0336] "Explanation": "

Illegally appropriating another's property

Illegally appropriating the property of the other party to a contract

[0337] "Known element": {

[0338] "value": "",

[0339] "FD": "",

[0340] "Note": {

[0341] "Optional values": ["Knowing that he or she was committing a deceptive act, knowing that the victim would dispose of the property based on a mistaken understanding", "Knowing that he or she had no ability or intention to perform the contract, knowing that he or she would abscond after receiving the money or property"]

[0342] "Explanation": "[Knowing that one's actions are deceptive, and knowing that the victim will dispose of property based on a mistaken understanding] The perpetrator of fraud must be aware of the deceptive nature of their actions and the resulting mistaken disposal of property by the victim. [Knowing that one has no ability or intention to perform the contract, and knowing that one will abscond after receiving the money] In contract fraud, the perpetrator must know beforehand that they have no genuine intention or ability to perform the contract, or have the intent to abscond after receiving the money, thus distinguishing it from civil breach of contract."

[0343] "Completion Standard": {

[0344] "value": "",

[0345] "FD": "",

[0346] "Note": {

[0347] Optional values: ["The perpetrator actually controls the property, and the victim suffers property loss", "The victim disposes of the property based on a contract, the perpetrator actually controls the property, and the victim suffers property loss"]

[0348] "Explanation": "

Completed Standard

Completed Standard

[0349] Sentencing factors: {

[0350] "value": "",

[0351] "FD&Hold_that": "",

[0352] "Note": {

[0353] Optional values: ["relatively large amount", "huge amount", "especially huge amount", "principal offender", "accomplice", "coerced offender", "instigator", "criminal preparation", "criminal abandonment", "criminal attempt", "legitimate defense", "excessive defense", "emergency avoidance", "excessive avoidance", "juvenile delinquency", "elderly delinquency", "criminal offense by a mentally ill person with limited capacity for civil conduct", "criminal offense by a deaf-mute or blind person", "surrender", "confession", "general meritorious service", "special meritorious service", "recidivist", "prisoner", "first-time offender", "occasional offender", "plead guilty and accept punishment", "voluntarily plead guilty in court", "forgiveness", "actively compensate and reach forgiveness", "actively compensate and reach a settlement"]

[0354] "Explanation": "

Relatively large amount

Huge amount

Particularly huge amount

Juvenile delinquency

Elderly delinquency

[0355] }

[0356] Step 5: Construct cue words and call the Large Language Model (LLM) for reasoning to extract case elements from the confused charge template.

[0357] In this embodiment of the invention, step 5 includes:

[0358] The descriptive text, the annotated template for the confusing charge group, and the case fact FD are taken as input and submitted to the large language model. The request is to populate the elements for each charge in the confusing charge group according to the template structure, and to limit the field values ​​to the set of candidate values ​​given in the template annotation.

[0359] Template-based case element extraction and task instructions;

[0360] The FD and Hold_that fields must respectively reference the original text of the facts of the crime (FD) and the court's opinion (Hold_that): ① Precise citation of corresponding candidate values: Enclose the FD and Hold_that segments in double quotes (e.g., "He repeatedly fabricated reasons such as seeing a doctor and buying medicine, and asked Peng to transfer money to him," and "The defendant Zhang was summoned to the case and was able to truthfully confess the facts of the crime after being brought to justice, which constitutes a confession. Therefore, the public prosecutor's opinion that he has a confession is accepted by this court"); ② No direct evidence: Write "No direct evidence," val The explanations in the ue reference notes require understanding of optional values ​​and that they must be strictly selected from "Notes → Optional Values": For example, the behavioral characteristic value must be one of the following: ① "The victim disposes of property that he has the right to dispose of to the other party", "The victim is aware of the actual existence of the disposed property"; ② "Contractual relevance" elements of the crime (object of the crime, objective aspect, subject of the crime, subjective aspect, standard of completion) are based solely on FD analysis and do not involve Hold_that; only the "Sentencing Factors" part requires a combination of FD and Hold_that.

[0361] Large model compliant output:

[0362] {

[0363] "Object of Crime": {

[0364] "value": ["ownership of public and private property"],

[0365] "FD": "The defendant, Zhang, falsely claimed to have a niece named 'Li,' and used this identity to communicate with the victim, Peng, via WeChat. He fabricated reasons such as seeking medical treatment and buying medicine, and defrauded Peng of 41,921.00 yuan transferred via WeChat."

[0366] "Note": {}}

[0367] "Objective aspects": {

[0368] "The essence of behavior": {

[0369] "value": ["obtained through deception"],

[0370] "FD": "Zhang fabricated the identity of 'Li' and details of seeing a doctor and buying medicine, causing the victim, Peng, to have a mistaken understanding, and based on this mistake, Peng disposed of property through WeChat transfers."

[0371] "Note": {}}

[0372] "Behavioral characteristics": {

[0373] "value": ["The victim disposes of property that they possessed and had the right to dispose of to the other party"]

[0374] "FD": "The victim, Peng, voluntarily transferred funds from his bank account to the defendant Zhang's account via WeChat, thus completing the disposal and transfer of property."

[0375] "Note": {}}}

[0376] "Subject of the crime": {

[0377] "value": ["natural person"],

[0378] "FD": "“Defendant Zhang Moumou”,

[0379] "Note": {

[0380] Optional values: ["natural person", "organization"],

[0381] "Explanation": "

Natural Person

Entity

[0382] "Subjective aspect": {

[0383] "Forms of culpability": {

[0384] "value": ["directly intentional"],

[0385] "FD": "“He falsely claimed to have a niece named 'Li Moumou' and “repeatedly fabricated reasons such as seeing a doctor and buying medicine, requesting Peng Mou to transfer money to him”,

[0386] "Note": {

[0387] Optional value: ["Directly intentional"],

[0388] Note: Both the crime of fraud and the crime of contract fraud require direct intent.

[0389] "Criminal Purpose": {

[0390] "value": ["illegally appropriating other people's property"],

[0391] "FD": "No direct evidence"

[0392] "Note": {

[0393] Optional values: ["illegally possessing another person's property", "illegally possessing the property of the other party to a contract"]

[0394] "Explanation": "

Illegally appropriating another's property

Illegally appropriating the property of the other party to a contract

[0395] "Known element": {

[0396] "value": ["knowingly engaging in deceptive behavior", "knowingly allowing the victim to dispose of their property based on a mistaken understanding"]

[0397] "FD": "“false claim” or "fictional"

[0398] "Note": {

[0399] "Optional values": ["Knowing that he or she was committing a deceptive act, knowing that the victim would dispose of the property based on a mistaken understanding", "Knowing that he or she had no ability or intention to perform the contract, knowing that he or she would abscond after receiving the money or property"]

[0400] "Explanation": "[Knowing that one's actions are deceptive, and knowing that the victim will dispose of property based on a mistaken understanding] The perpetrator of fraud must be aware of the deceptive nature of their actions and the resulting mistaken disposal of property by the victim. [Knowing that one has no ability or intention to perform the contract, and knowing that one will abscond after receiving the money] In contract fraud, the perpetrator must know beforehand that they have no genuine intention or ability to perform the contract, or have the intent to abscond after receiving the money, thus distinguishing it from civil breach of contract."

[0401] "Completion Standard": {

[0402] "value": ["The perpetrator actually controls the property, and the victim suffers property loss"]

[0403] "FD": "The defendant, Zhang, has actually received and controlled the transfer of 41,921.00 yuan from the victim, Peng, and the victim has suffered actual property loss."

[0404] "Note": {}},

[0405] Sentencing factors: {

[0406] "value": ["substantial amount", "confess", "plead guilty and accept punishment"],

[0407] "FD&Hold_that": "Peng transferred a total of 41,921.00 yuan to Zhang via WeChat. (FD) and 'defrauded others of property, the amount is relatively large.' (Hold_that), [Analysis] The amount involved, 41,921 yuan, meets the 'relatively large amount' standard (≥6,000 yuan) for fraud and the 'relatively large amount' standard (≥20,000 yuan) for individual crimes in contract fraud.", "The defendant Zhang was summoned to the case. (FD) and after being brought to justice, he was able to truthfully confess the facts of the crime, which constitutes a confession. (Hold_that)", "The defendant Zhang voluntarily and truthfully confessed his crimes, admitted the criminal facts charged, and was willing to accept punishment. (Hold_that)",

[0408] "Note": {

[0409] Optional values: ["relatively large amount", "huge amount", "especially huge amount", "principal offender", "accomplice", "coerced offender", "instigator", "criminal preparation", "criminal abandonment", "attempted crime", "legitimate defense", "excessive defense", "emergency avoidance", "excessive avoidance", "juvenile delinquency", "elderly delinquency", "criminal offense", "criminal offense by a mentally ill person with limited capacity for civil conduct", "criminal offense by a deaf-mute or blind person", "surrender", "confession", "general meritorious service", "special meritorious service", "recidivist", "prison record", "first offense", "occasional offense", "plead guilty and accept punishment", "voluntarily plead guilty in court", "forgiveness", "actively compensate and reach forgiveness", "actively compensate and reach a settlement"]

[0410] "Explanation": "

Relatively large amount

Relatively huge amount

Especially huge amount

Crimes committed by minors

Crimes committed by the elderly

[0411] Step 6: According to the case elements, analyze the output of the LLM, perform verification, and generate and store the element table of the confusion charge group.

[0412] In the embodiment of the present invention, step 6 includes:

[0413] Use regular expressions to extract JSON paragraphs or strip markup symbols from the output text of the LLM, call the JSON parsing function to convert it into a dictionary structure; perform field integrity check and candidate value legality verification on the parsing result; organize the results passing the verification into an element table of the confusion charge group and save it for subsequent analysis and modeling.

[0414] Result parsing and legality verification, extract the JSON structure from the model output; check whether the fields are complete, whether the field values belong to the preset candidate set, and whether the data format meets the requirements; discard or re-extract the output that does not meet the requirements.

[0415] Generate and store the element table of the confusion charge group, organize the results passing the verification into a unified "element table of the confusion charge group", and list the element filling situations from the perspectives of the crime of fraud and the crime of contract fraud respectively; store the element table together with information such as the case number for subsequent query and analysis.

[0416] Generate and store the element table of the G1 confusion charge group;

[0417] {

[0418] "case_no": "2020 Liao 0422 Xing Chu X Hao",

[0419] "charge": "crime of fraud",

[0420] "confusion_group": ["fraud", "contract fraud"],

[0421] "elements": {

[0422] "Fraud": {

[0423] "Object of Crime": {

[0424] "value": ["ownership of public and private property"],

[0425] "FD": "Peng paid Zhang a total of 41,921.00 yuan via WeChat transfer."

[0426] "Matching Criteria": "FD directly indicates that the victim's property was damaged, which fully meets the object definition of fraud involving infringement of public and private property ownership."

[0427] "Subject of the crime": {

[0428] "value": ["natural person"],

[0429] "FD": "“Defendant Zhang Moumou”,

[0430] "Matching Criteria": "FD throughout refers to the clearly defined individual 'Defendant Zhang Moumou,' meeting the requirement of a natural person subject."

[0431] "Objective aspects": {

[0432] "The essence of behavior": {

[0433] "value": ["obtained through deception"],

[0434] "FD": "“He falsely claimed to have a niece named 'Li Moumou' and “repeatedly fabricated reasons such as seeing a doctor and buying medicine, requesting Peng Mou to transfer money to him”,

[0435] "Matching Criteria": "FD fully demonstrates the typical fraud chain of 'fabricating facts → causing the victim to make a mistake → disposing of property based on the mistake.'"

[0436] "Subjective aspect": {

[0437] "Forms of culpability": {

[0438] "value": ["directly intentional"],

[0439] "FD": "“Falsely claimed to have a niece named 'Li Moumou'”, and “repeatedly fabricated reasons for seeking medical treatment and purchasing medicine”.

[0440] "Matching Criteria": "The proactive and continuous 'falsehood' and 'fabrication' in FD (Financial Deception) indicate that the perpetrator held a hopeful or indifferent attitude towards the fraudulent act and its consequences, which meets the characteristics of direct intent."

[0441] "Criminal Purpose": {

[0442] "value": ["illegally appropriating other people's property"],

[0443] "FD": No direct evidence.

[0444] "Matching Basis": "FD describes a pattern of demanding money under fabricated reasons, strongly suggesting an 'intent to illegally possess,' though not explicitly stated in writing. This intent has been judicially determined by Hold_that."

[0445] "Known element": {

[0446] "value": ["knowingly engaging in deceptive behavior"],

[0447] "FD": "“false claim” or "fictional"

[0448] "Matching Criteria": "The descriptions of behavior such as 'falsely claiming' and 'fabricating' in the FD itself indicate that the perpetrator was aware of the deceptive nature of their actions."

[0449] "Completion Standard": {

[0450] "value": ["Property actually controlled by the perpetrator", "Property loss suffered by the victim"]

[0451] "FD": "Peng paid Zhang a total of 41,921.00 yuan via WeChat transfer."

[0452] "Matching Basis": "FD clearly stated that the funds had been 'paid' to the defendant, indicating that he had actual control of the property, the victim suffered property loss, and the crime was thus completed."

[0453] Sentencing factors: {

[0454] "value": ["substantial amount", "confession", "plea for guilt and acceptance of punishment"],

[0455] "FD&Hold_that": "“Peng paid Zhang a total of 41,921.00 yuan via WeChat transfer.” (FD) and “Defrauding others of a substantial amount of money.” (Hold_that); “The defendant Zhang was summoned to the station.” (FD) and “After being apprehended, he truthfully confessed to the crime, which constitutes a confession.” (Hold_that); “The defendant Zhang voluntarily and truthfully confessed to his crimes, admitted the charged facts, and was willing to accept punishment.” (Hold_that)

[0456] "Matching Basis": "All circumstances are based on a combination of the factual basis of FD and the judicial determination of Hold_that. The amount deemed 'relatively large' (41,921 yuan) was determined in FD and characterized by Hold_that."

[0457] "Contract fraud": {

[0458] "Object of Crime": {

[0459] "value": ["Public and private property ownership", "Economic order (contract management order)"],

[0460] "FD": "Peng paid Zhang a total of 41,921.00 yuan via WeChat transfer."

[0461] "Matching Basis": "FD only reflects an infringement of property rights, without any factual description involving 'contract' or 'economic order,' therefore it does not meet the complex object requirement of the crime of contract fraud."

[0462] "Subject of the crime": {

[0463] "value": ["natural person"],

[0464] "FD": "“Defendant Zhang Moumou”,

[0465] "Matching Criteria": "FD indicates that the perpetrator is a natural person, which meets the requirement of a natural person as the subject of the crime of contract fraud."

[0466] "Objective aspects": {

[0467] "The essence of behavior": {

[0468] "value": ["defrauded by using a contract"],

[0469] "FD": No direct evidence.

[0470] "Matching Criteria": "FD describes deception and money transfers between individuals based on social relationships, without mentioning the signing, performance, or exploitation of any form of contract, and is unrelated to the core behavior of 'exploiting contracts.'"

[0471] "Subjective aspect": {

[0472] "Forms of culpability": {

[0473] "value": ["directly intentional"],

[0474] "FD": "“Falsely claimed to have a niece named 'Li Moumou'”, and “repeatedly fabricated reasons for seeking medical treatment and purchasing medicine”.

[0475] "Matching Criteria": "The conduct can be presumed to be intentional, but this intent is not specifically aimed at 'using the contract' to commit fraud."

[0476] "Criminal Purpose": {

[0477] "value": ["illegally possessing the property of the other party to the contract"],

[0478] "FD": No direct evidence.

[0479] "Matching Criteria": "Neither FD nor Hold_that indicates the existence of a specific 'contractual counterparty,' suggesting that the criminal intent is directed at an unspecified individual, rather than a contractual party."

[0480] "Known element": {

[0481] "value": ["Knowing that there is no ability or intention to perform the contract"]

[0482] "FD": No direct evidence.

[0483] "Matching Basis": "FD did not involve any premises or commitments related to 'performance,' therefore it is impossible to determine the actor's level of awareness regarding this."

[0484] "Completion Standard": {

[0485] "value": ["Victim disposes of property based on contract", "Perpetrator actually controls property", "Victim suffers property loss"]

[0486] "FD": "Peng paid Zhang a total of 41,921.00 yuan via WeChat transfer."

[0487] "Matching Basis": "FD only satisfies the two points of 'control of property' and 'property loss,' while the core requirement of 'disposal of property based on contract' lacks factual basis."

[0488] Sentencing factors: {

[0489] "value": ["substantial amount", "confession", "plea for guilt and acceptance of punishment"],

[0490] "FD&Hold_that": "“Peng paid Zhang a total of 41,921.00 yuan via WeChat transfer.” (FD) and “fraudulently obtained property from others, amounting to a substantial sum.” (Hold_that)

[0491] "Matching Criteria": "The amount (41,921 yuan) meets the threshold for 'relatively large amount' (≥20,000 yuan for individuals) in the crime of contract fraud, but this circumstance is shared by both crimes and is not unique to contract fraud, and there are no other circumstances specific to this crime (such as corporate crime). 'The defendant Zhang was summoned to the case.' (FD) and 'After being apprehended, he was able to truthfully confess to the crime, which constitutes a confession.' (Hold_that); 'The defendant Zhang voluntarily and truthfully confessed to his crimes, admitted the charged crimes, and was willing to accept punishment.' (Hold_that)."

[0492] In this embodiment of the invention, downstream applications are also included after step 6;

[0493] The elements table of confused charges will be provided to the given charge / characterization auxiliary analysis module, the similar case retrieval module, and the conviction prediction or sentencing prediction model, respectively, and will be used to determine the charge requirements that a case meets, to search by element similarity, and as feature input.

[0494] This invention provides a hierarchical extraction system for legal elements based on large models and knowledge graphs, such as... Figure 3 As shown, the system includes:

[0495] The data preprocessing module is used to obtain basic information and factual text of the case; read case records from the original judgment documents or preprocessed datasets; extract information such as case number (case_no), charge, and criminal facts (FD); and filter out samples that are obviously too short or marked as "investigate further" in the case details section, making them unsuitable for extraction.

[0496] The automatic matching module for confusing charge groups is used to automatically match confusing charge groups Gi based on basic information and factual text; maintain the mapping relationship between [charge → confusing charge group], for example: fraud / contract fraud → confusing charge group G1; traffic accident / dangerous driving → confusing charge group G2, etc.; automatically determine the set of confusing charge groups to which the current case belongs or needs to be compared and analyzed based on the charge information of the case or based on the pre-trained coarse-grained charge prediction results; and support the situation where one case corresponds to multiple confusing charge groups.

[0497] The knowledge graph management and field metadata extraction module is used to store the criminal law knowledge graph KG, and to read the four elements and element metadata of the confused crime group Gi through the knowledge graph KG; it stores the criminal law knowledge graph KG, records the four elements structure and detailed elements of each crime (such as the nature of the behavior, harmful behavior, behavior characteristics, knowledge element, completion standard, etc.), as well as content / note information for extracting constraints and annotations; it maintains a predefined set of confused crime groups, such as {fraud, contract fraud}, {traffic accident, dangerous driving}, etc.; and it provides an interface to read the definition of the corresponding crime element by confused crime group.

[0498] The module for merging and confusing crime groups with multiple crime elements is used to merge multiple crime elements and unify field modeling, generating annotated templates for confusing crime groups. For a specific confusing crime group, it reads the four elements and detailed element structures of each crime in the current group from the knowledge graph. Through a merging algorithm, it aligns and unifies fields such as "behavioral nature, harmful behavior, and behavioral characteristics," and merges and removes duplicates of candidate values ​​by crime under each field. It adopts a recursive merging strategy for data structures such as list / dict / content / note to form a unified, multi-level element metadata structure.

[0499] In this embodiment of the invention, an automatic annotation template generation module is also included, which is used to: summarize the candidate value set of each field based on the merged element metadata; summarize the criminal facts or the court's opinion corresponding to the candidate values ​​of each field; summarize note information and form a brief explanatory text; automatically generate a multi-level JSON template with the core structure of "unified field + crime column", and write the following in the form of notes after the fields: "[Optional Values]", listing the candidate value set; "[Explanation]", giving a brief explanation formed by summarizing multiple notes.

[0500] The Large Language Model (LLM) prompt construction and reasoning module is used to construct prompt words and call the LLM for reasoning to extract case elements from the confused crime group template; combine explanatory text, confused crime group element template (including automatic annotations) and case crime facts FD to form prompt information for reasoning; call the LLM to generate JSON structured output that conforms to the template constraints; and require the model to select only the candidate value set given by the template when taking values ​​for each field.

[0501] The parsing output and legality verification module is used to parse the LLM output based on case elements, perform verification, generate and store the confusing crime group element table, extract JSON strings from model output (such as text containing code blocks) and parse them; check whether the fields are complete, whether the field values ​​belong to the preset candidate set, and whether the data format meets the requirements; and convert the results that pass the verification into a unified confusing crime group element table structure.

[0502] In this embodiment of the invention, a result storage and downstream application module is also included to write the element table of the confused charge group corresponding to each case into a JSONL file or a relevant database row by row; and to provide a data interface for downstream systems such as conviction prediction, similar case retrieval, and sentencing recommendations to call and display.

[0503] This invention introduces an analytical perspective of confusing crime groups, treating a group of easily confused crimes (such as fraud and contract fraud) as a whole analytical unit. For a single case, it can simultaneously extract the constituent elements of each crime within the group, thereby supporting comparative analysis from the perspective of multiple crimes in a single case.

[0504] By utilizing knowledge graphs to automatically generate unified element templates for multiple crimes, the four elements and their detailed elements of each crime are modeled in the criminal law knowledge graph. Through multi-crime element merging and field alignment algorithms, a multi-level template framework with "unified field + crime column" as the core structure is automatically generated to achieve consistent modeling across crimes.

[0505] By leveraging the strong constraints of the large language model's output, post-processing is significantly simplified. Candidate values ​​and explanatory information from the knowledge graph are automatically written into template annotations. This not only improves semantic reasoning performance but also effectively suppresses the generation of illusions in the large model, guiding it to select from a predefined candidate set. By constraining the output space from the source, post-processing can be simplified to JSON parsing and validity verification, rather than relying on a large number of normalization rules.

[0506] An automatic matching and annotation mechanism for confusing charge groups is introduced. Based on the charge or preliminary prediction results of a case, it automatically matches possible confusing charge groups. At the same time, the content / note in the knowledge graph is automatically converted into field annotations of "optional values ​​+ legal explanations", forming an automatic annotation template for large language models, realizing the linkage between knowledge graph, template and model reasoning.

[0507] A group element table of confusing charges is formed that can directly serve the distinction of conviction and downstream modeling. Through the above method, a group element table of confusing charges with a unified structure and horizontal comparability is generated for each case, providing a high-quality structured feature foundation for the correction of conviction and characterization errors, case retrieval, judicial policy research and data-driven conviction / sentencing prediction models.

[0508] The key technical aspects to be protected by this invention include, but are not limited to, the following:

[0509] 1. Definition and organization method of confusing crime groups: It proposes the idea of ​​extracting and comparing elements of several easily confused crimes as an analytical unit, and managing them uniformly in the system in the form of confusing crime groups.

[0510] 2. A multi-crime “four-element-factor” modeling method based on knowledge graph: In the knowledge graph, the four elements and their detailed elements, including sentencing factors, of each crime are uniformly modeled, and candidate values ​​and explanatory information are stored in the form of content / note for subsequent template generation and extraction constraints. This framework adopts an open design. The elements listed in this invention are only examples, and the system can be flexibly expanded and refined according to actual needs.

[0511] 3. Automatic matching mechanism for confusing charge groups: Starting from the charge information or charge prediction results of the case, the system automatically locates the confusing charge groups that should be included in the analysis, without the need for manual specification of the comparison charge set for each case.

[0512] 4. Multi-crime element merging and alignment algorithm: For knowledge graph fragments corresponding to confused crime groups, recursively merge and deduplicate at the dict, list, content / note levels to generate a unified element structure and realize the modeling method of "unified field + multi-crime candidate value set".

[0513] 5. The "Unified Field + Crime Name Column" Confused Crime Name Group Element Template Structure: Through field alignment and merging, an element comparison template is formed with unified fields as rows and crime names as columns, so that the same case can complete the element filling in from multiple crime name perspectives at the same time.

[0514] 6. Automatic annotation template generation mechanism: Automatically converts the content / not information in the knowledge graph into annotations (including a list of optional values ​​and brief descriptions) after the fields in the template, forming a constrained template for the extraction model, so that the model takes into account both standardized values ​​and legal meaning when it is generated.

[0515] 7. Automatic Template Generation and Constraint-Based Prompt Method: Based on the configuration of the confused crime group and the knowledge graph content, the method automatically generates element templates and embeds the templates (including candidate values, the corresponding criminal facts or the court's opinion and explanation) into the reasoning input. It clearly constrains the output structure and value range. The four elements of criminal law (object of crime, objective aspect, subject of crime, and subjective aspect) and the standard of completed crime are all based on semantic reasoning of criminal facts. Sentencing factors are extracted based on criminal facts and the court's opinion. This method performs excellently in the extraction of common conviction and sentencing factors, with an accuracy rate of over 99% in the extraction of sentencing factors. It also has good scalability and can effectively adapt to further expansion and refinement of the element system.

[0516] 8. Lightweight post-processing mechanism based on constraint output: Structured results can be generated by parsing JSON and validating the validity of candidate values, without relying on complex text normalization and multi-layer rule systems.

[0517] 9. Application interface design of "Confused Crime Group Element Table" in conviction differentiation and downstream modeling: The extracted results are provided to conviction prediction, case retrieval, sentencing suggestions and other systems in the form of "Confused Crime Group Element Table" as feature input and visualization basis.

[0518] 10. The end-to-end processing flow driven by the above template: the complete process design and implementation from case data input, automatic matching of confused crime groups, element merging and annotation template generation, to model extraction, result verification and storage.

[0519] Compared with the prior art, the present invention has the following advantages:

[0520] 1. Introducing the perspective of confusing crime groups to achieve parallel extraction of elements of multiple crimes in one case: This invention no longer focuses on extracting elements around a single crime, but rather on confusing crime groups in judicial practice, generating element filling results for each crime group in the same case at the same time; it is conducive to directly comparing the differences in the constituent elements of easily confused crimes such as "fraud - contract fraud" and "traffic accident - dangerous driving" at the structured level.

[0521] 2. Knowledge Graph-Driven Unified Element Framework for Multiple Crimes: The framework unifies the four elements and their detailed components through a criminal law knowledge graph, and constructs an element template of "unified field + crime column" through a merging algorithm. This enables comparative analysis of the nature of the behavior, harmful behavior, subjective aspects, sentencing circumstances, etc., of each crime under the same field dimension, which conforms to the legal system of conviction and sentencing, and facilitates interpretability analysis and tracing of the root causes by judges, prosecutors, lawyers and other judicial practitioners.

[0522] 3. Automatic matching of confusing charge groups, improving the system's automation level: The system can automatically match the corresponding confusing charge groups based on the case's charge or coarse-grained prediction results, without the need for manual comparison of each case; suitable for large-scale batch processing and case handling assistance scenarios.

[0523] 4. Automatic merging and annotation of multiple crime elements, automatic template generation, facilitating expansion and maintenance: The element template is automatically generated by the knowledge graph and merging algorithm, and the fields and candidate values ​​can be automatically adjusted as the knowledge graph is updated; the automatic annotation template converts content / not information into a structure of "optional value + legal explanation", which makes it easier for the model to accurately understand the meaning of each element during extraction; it is easy to extend to other confusing crime groups or other areas of legal application.

[0524] 5. The generated output is constrained by the template, and the post-processing workload is small: The template clearly lists the candidate value set and brief description of each field, guiding the model to select within the predetermined value range; the post-processing mainly involves JSON parsing and candidate value validity verification, without the need for large-scale natural language normalization and complex rule matching, which helps to reduce the overall system complexity.

[0525] 6. The output results can directly serve the determination of guilt and downstream modeling: The output "Confused Crime Group Element Table" is a structured and comparable data representation that can be directly used as feature input for models such as: conviction prediction and conviction anomaly warning; case retrieval and case cluster analysis; research on crime boundaries and scope of application; and sentencing recommendations.

[0526] The technical solution provided by this invention includes the following steps: acquiring basic case information and factual text; automatically matching a confused charge group Gi based on the basic information and factual text; reading the four elements and element metadata of the confused charge group Gi through a knowledge graph (KG); merging multiple charge elements and modeling unified fields to generate an annotated confused charge group template; constructing prompt words and calling a large language model (LLM) for reasoning to extract case elements from the confused charge group template; parsing the LLM output based on the case elements, performing verification, generating and storing a confused charge group element table. This method achieves automatic merging, automatic annotation, and templated extraction of multiple charge element groups.

[0527] The above description is only a preferred embodiment of the present invention and is not intended to limit the present invention. Any modifications, equivalent substitutions, improvements, etc., made within the spirit and principles of the present invention should be included within the scope of protection of the present invention.

Claims

1. A hierarchical extraction method for legal elements based on large models and knowledge graphs, characterized in that, The method includes: Step 1: Obtain basic information and factual text of the case; Step 2: Based on the basic information and factual text, automatically match the confusing charge group Gi; Step 3: Read the four elements and metadata of the confusing crime group Gi through the knowledge graph KG; Step 4: Merge multiple crime elements and model unified fields to generate annotated templates for confusing crime groups; Step 5: Construct cue words and call the Large Language Model (LLM) for reasoning to extract case elements from the confused charge group template; Step 6: Based on the case elements, parse the LLM output, perform verification, and generate and store the element table of the confusing crime group; Step 3 includes: Using the knowledge graph KG, we read the four elements and element metadata corresponding to each crime in the confused crime group Gi. The read content includes the candidate value set content and annotation note information; the element metadata includes the nature of the behavior, the harmful behavior, the characteristics of the behavior, the knowledge element, and the standard of completion. The knowledge graph KG used is organized as follows: The root node represents the criminal law knowledge graph; the child nodes represent specific crimes. Under the crime node, it is further broken down into: The object of a crime, its objective aspect; the subject of a crime, its subjective aspect; the standard for completion of a crime. Record at the leaf nodes of each element: The content is the set of candidate values ​​for the current element under the current charge; the note is the annotation information for the content of the current element, which is used to supplement the legal connotation and applicable boundaries briefly expressed in the content, and is used to generate the description field in the confusing charge group template. Step 4 includes: The crime element structure is recursively merged. Through multi-crime element merging and field alignment algorithms, the nature of the behavior, harmful behavior, behavior characteristics, known elements, and completed standards are horizontally aligned and deduplicated. For each field, the candidate value set of each crime is retained to form a unified, multi-level element metadata structure. A JSON template is generated based on the element metadata, and annotation information is automatically added to each field in the template to obtain an annotated confusion charge group template for large language model inference.

2. The method according to claim 1, characterized in that, The basic information in step 1 includes the case number (case_no), the charge (charge), the facts of the crime (FD), and the court's opinion (Hold_that).

3. The method according to claim 2, characterized in that, Step 2 includes: Based on the case charge or the previous coarse-grained classification results, the corresponding confusing charge group Gi is automatically determined by the mapping relationship of [charge → confusing charge group]. If no confusing charge group is matched, the process ends directly or enters other processing paths.

4. The method according to claim 1, characterized in that, Step 5 includes: The descriptive text, the annotated template for the confusing charge group, and the case fact FD are taken as input and submitted to the large language model. The request is to populate the elements for each charge in the confusing charge group according to the template structure, and to limit the field values ​​to the set of candidate values ​​given in the template annotation.

5. The method according to claim 4, characterized in that, Step 6 includes: Using regular expressions, extract JSON paragraphs or strip markers from the LLM output text, and call the JSON parsing function to convert them into a dictionary structure; perform field integrity checks and candidate value validity checks on the parsing results; organize the results that pass the checks into a confusing crime group element table and save it for subsequent analysis and modeling.

6. The method according to claim 5, characterized in that, Step 6 is followed by downstream applications; The elements table of confused charges will be provided to the given charge / characterization auxiliary analysis module, the similar case retrieval module, and the conviction prediction or sentencing prediction model, respectively, and will be used to determine the charge requirements that a case meets, to search by element similarity, and as feature input.

7. A hierarchical legal element extraction system based on large models and knowledge graphs, characterized in that, The system is used to implement the hierarchical extraction method of legal elements based on large models and knowledge graphs as described in claim 1. The system includes: The data preprocessing module is used to obtain basic information and factual text of the case; The automatic matching module for confusing charge groups is used to automatically match the confusing charge group Gi based on basic information and factual text; The knowledge graph management and field metadata extraction module is used to store the criminal law knowledge graph KG and read the four elements and element metadata of the confused crime group Gi through the knowledge graph KG; The module for merging multiple crime elements and confusing crime groups is used for merging multiple crime elements and unifying field modeling, generating annotated confusing crime group templates; The Large Language Model (LLM) prompt construction and reasoning module is used to construct prompt words and call the Large Language Model (LLM) for reasoning to extract case elements from the confused crime group template. The parsing output and legality verification module is used to parse the LLM output based on case elements, perform verification, and generate and store the element table of the confusing crime group.