Unlock instant, AI-driven research and patent intelligence for your innovation.
A data processing method and device based on a question answering platform
What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A data and platform technology, applied in the field of data processing, can solve the problems of low data mining efficiency and accuracy, and achieve the effect of quantifying similarity, eliminating noise, and improving processing efficiency
Active Publication Date: 2019-09-03
BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD
View PDF8 Cites 0 Cited by
Summary
Abstract
Description
Claims
Application Information
AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology
Problems solved by technology
[0005] The technical problem to be solved by the embodiments of the present invention is to provide a data processing method based on a question-and-answer platform to solve the problem of low efficiency and accuracy of data mining
Method used
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more
Image
Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
Click on the blue label to locate the original text in one second.
Reading with bidirectional positioning of images and text.
Smart Image
Examples
Experimental program
Comparison scheme
Effect test
Embodiment 2
[0043] On the basis of the above embodiments, this embodiment further discusses the data mining process of the question answering platform.
[0044] refer to figure 2 , which shows a flow chart of steps in an optional embodiment of a data processing method based on a question answering platform of the present invention, which may specifically include the following steps:
[0045] In step 201, the question and the answer data corresponding to the question are obtained from the question-and-answer platform.
[0046] Step 202, feature extraction is performed on the text of each question and corresponding answer data.
[0047] Step 203, classify the question and answer data according to the extracted features, and classify the question and answer data into predetermined preset categories respectively.
[0048] Obtain questions and their corresponding answer data from the question-and-answer platform, and store them in a data structure in which questions and questions correspond...
Embodiment 3
[0085] On the basis of the above embodiments, this embodiment also provides a data processing device based on a question answering platform.
[0086] refer to Figure 4 , which shows a structural block diagram of an embodiment of a data processing device based on a question-and-answer platform in the present invention, which may specifically include the following modules:
[0087] An analysis module 401, configured to perform text analysis on each answer data obtained from the question-and-answer platform to determine the similarity of each answer data;
[0088] The clustering module 402 is used to cluster the questions corresponding to the answer data according to the similarity according to the correspondence between the questions and the answer data recorded by the question-and-answer platform, so as to obtain each question cluster;
[0089] Generating module 403, is used for carrying out text analysis to each question in each question cluster, extracts the relevant word p...
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More
PUM
Login to View More
Abstract
The invention provides a data processing method and apparatus based on a question-and-answer platform to solve a problem of low efficiency and precision of data mining. The method comprises: performing textual analysis on answer data acquired from the question-and-answer platform, and determining similarity of the answer data; according to correspondences, between problems and the answer data, recorded by the question-and-answer platform, clustering the questions corresponding to the answer data according to the similarity to obtain question clusters; and performing textual analysis on questions in each question cluster, and extracting relative word pairs composed of keywords of the questions in the question clusters, wherein keywords in a same question cluster are correlated. According to the invention, similarity analysis on the answer data can remove noise in the answer data, and reduce processing of irrelevant data in the answer data, and effectively quantify similarity of the questions, thus improving data processing efficiency and data processing precision.
Description
technical field [0001] The present invention relates to the technical field of data processing, in particular to a data processing method based on a question answering platform and a data processing device based on a question answering platform. Background technique [0002] The question-and-answer platform provides a communication platform for users. Users can receive help from experts and other netizens on the question-and-answer platform, and at the same time try to provide effective help to other users. The content of the question-and-answer platform is generated by users. After statistics and mining, valuable information in various aspects can be obtained. [0003] When mining related entities based on the Q&A platform, it usually extracts related questions on the same topic in the Q&A platform, and then mines related entity information based on the answers of different users to the same question. For example, on the topic of electric vehicles, for question 1 on the qu...
Claims
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More
Application Information
Patent Timeline
Application Date:The date an application was filed.
Publication Date:The date a patent or application was officially published.
First Publication Date:The earliest publication date of a patent with the same application number.
Issue Date:Publication date of the patent grant document.
PCT Entry Date:The Entry date of PCT National Phase.
Estimated Expiry Date:The statutory expiry date of a patent right according to the Patent Law, and it is the longest term of protection that the patent right can achieve without the termination of the patent right due to other reasons(Term extension factor has been taken into account ).
Invalid Date:Actual expiry date is based on effective date or publication date of legal transaction data of invalid patent.