Question and answer content extraction method and system in programming environment, electronic equipment and medium

A programming environment and extraction system technology, applied in the Internet field, can solve problems such as extraction by scholars, achieve the effects of reducing costs, improving development efficiency, and reducing browsing time

Active Publication Date: 2021-07-06
NANJING UNIV
View PDF15 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the current abstract extraction work is aimed at the general field, and no scholars have proposed techniques and methods for abstract extraction in the field of software engineering.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Question and answer content extraction method and system in programming environment, electronic equipment and medium
  • Question and answer content extraction method and system in programming environment, electronic equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0029] Embodiment 1: The present invention proposes an extraction system for question and answer content under a programming environment, including:

[0030] The data processing module is used to perform: preprocessing the input network question and answer text data, removing useless information and performing word segmentation;

[0031] An entity recognition module, configured to perform: performing entity recognition in the field of software engineering on the text processed by the data processing module;

[0032] The document reading module is used to execute: input the text identified by the entity recognition module into the neural network for document reading;

[0033] The summary extraction module is used for execution: using another neural network to extract the key content in the question and answer text.

[0034] Preferably, the specific execution of the data processing module includes: initial state; processing code segments in the question-and-answer text; processin...

Embodiment 2

[0037] Embodiment 2: the present invention also proposes the extraction method of question and answer content under the programming environment, and the overall framework of the present invention is as follows figure 1 As shown, a method for extracting question and answer content under a programming environment proposed by the present invention includes the following 4 steps:

[0038] Step 1: For Q&A text on the web, first clear all tab, since the code snippet in the Q&A appears in the tab, clear The content in the tag also clears the code segment; then delete all the html tags, for example And so on; then replace the URL that appears in the text with "@u@", replace the expression that appears such as ":)" with "@e@", and replace the "@" that appears with other users' content with "@a@ "Replace; Finally, use the nltk word segmentation tool to segment the text. The word segmentation needs to take the API name as a whole. For example, os.path.join(path) needs to be separat...

Embodiment 3

[0048] Embodiment 3: The present invention also proposes an electronic device, including a memory, a processor, and a computer program stored on the memory and operable on the processor, wherein the method is implemented when the processor executes the program step.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a question and answer content extraction method and system in a programming environment, electronic equipment and a medium, and the system comprises: a data processing module, which is used for preprocessing input network question and answer text data, removing useless information and carrying out word segmentation; an entity recognition module, which is used for performing entity recognition in the field of software engineering on the text processed by the data processing module; a document reading module, which is used for inputting the text recognized by the entity recognition module into a neural network for document reading; and an abstract extraction module, which is used for extracting the key content in the question and answer text by using another neural network. The key content in the technical questions and answers can be extracted, the browsing time of developers is shortened, and the field development efficiency of programming is improved.

Description

technical field [0001] The invention relates to a method, system, electronic equipment and medium for extracting question and answer content in a programming environment, and belongs to the technical field of the Internet. Background technique [0002] Software development is a flexible and challenging job, and developers need strong learning ability and problem-solving ability. In the programming field, in addition to consulting reference books when encountering problems, developers often seek help from the Internet, ask other developers who have encountered similar problems, and learn from other people's solutions to avoid duplication of effort and improve development efficiency. Therefore, the software question-and-answer community has gradually become active, and is committed to providing developers with a platform to help each other and record questions. [0003] There are more and more active developers on the technical question and answer platform. They ask questions...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F8/70G06F16/33G06F16/332G06F16/34G06F40/211G06F40/295
CPCG06F8/70G06F16/3329G06F16/3344G06F16/345G06F16/3335G06F40/295G06F40/211Y02D10/00
Inventor 陈林赵恒辉李言辉
Owner NANJING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products