Dialogue topic partitioning method and system based on context correlation
A context and correlation technology, applied in semantic analysis, special data processing applications, instruments, etc., can solve the problems that the method of detecting topic transfer has not yet appeared, and achieve the effect of stable reliability, strong reliability, and high test accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0055] This embodiment provides a method for segmenting dialogue topics based on contextual correlation, including the following steps:
[0056] Step 1: Collect multiple rounds of dialogue data, and randomly sample them to obtain a training data set;
[0057] Step 2: Carry out vectorization processing on the training data set to obtain the corresponding corpus vector space of the training data set;
[0058] Step 3: organizing the corpus vector space into a sequence of sentences;
[0059] Step 4: Calculate the correlation between adjacent sentences;
[0060] Step 5: Identify the topic boundaries of multiple rounds of dialogue data based on the correlation between adjacent sentences to form a topic segmentation model.
[0061] Optionally, step 6 is also included: testing the topic segmentation model with a verification data set. The verification data set is obtained by randomly sampling the collected dialogue data of multiple rounds.
[0062] Optionally, step 7 is also inclu...
Embodiment 2
[0114] The purpose of this embodiment is to provide a dialogue topic segmentation system based on context information.
[0115] In order to achieve the above object, the present invention adopts the following technical scheme:
[0116] A dialogue topic segmentation system based on context information, comprising a processor and a computer-readable storage medium, the processor is used to implement instructions; the computer-readable storage medium is used to store multiple instructions, and the instructions are suitable for being loaded by the processor and Perform the following processing:
[0117] Step 1: Collect multiple rounds of dialogue data, and randomly sample them to obtain a training data set;
[0118] Step 2: Carry out vectorization processing on the training data set to obtain the corresponding corpus vector space of the training data set;
[0119] Step 3: organizing the corpus vector space into a sequence of sentences;
[0120] Step 4: Calculate the correlation...
Embodiment 3
[0123] The purpose of this embodiment is to provide a computer-readable storage medium.
[0124] In order to achieve the above object, the present invention adopts the following technical scheme:
[0125] A computer-readable storage medium, on which a computer program is stored for dialogue topic segmentation based on context information, the program performs the following steps when executed by a processor:
[0126] Step 1: Collect multiple rounds of dialogue data, and randomly sample them to obtain a training data set;
[0127] Step 2: Carry out vectorization processing on the training data set to obtain the corresponding corpus vector space of the training data set;
[0128] Step 3: organizing the corpus vector space into a sequence of sentences;
[0129] Step 4: Calculate the correlation between adjacent sentences;
[0130] Step 5: Identify the topic boundary of multiple rounds of dialogue data according to the correlation between adjacent sentences, form a topic segmen...
PUM

Abstract
Description
Claims
Application Information

- Generate Ideas
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com