Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

NLP-based research data standardization method and system

A standard and data technology, applied in the field of data processing, can solve the problems of high staff requirements, error-prone, and heavy workload.

Active Publication Date: 2021-08-31
立信(重庆)数据科技股份有限公司
View PDF12 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The manual processing is identified and judged by the staff. The staff need to remember all the standardized research fields, indicators, and standard descriptions of the index options, and then analyze the research data and index descriptions and standard descriptions of a single project to select the most suitable one. Associating with the standard description not only has very high requirements on the staff but also has a huge workload, and it is easy to make mistakes
The machine processing method is automatically associated by the system through semantic recognition. Although the efficiency is guaranteed, errors are prone to occur when a certain item or indicator has multiple standard descriptions with similar semantics

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • NLP-based research data standardization method and system
  • NLP-based research data standardization method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0041] Such as figure 1 As shown, an NLP-based survey data standardization method, it should be noted that the implementation of this method relies on an NLP-based survey data standardization system, such as figure 2 As shown, the system includes standard library, entry unit, recognition unit and confirmation unit. Wherein, the input unit and the confirmation unit are integrated in the working end, and the identification unit and the standard library are integrated in the server. In this embodiment, the working end is a PC loaded with corresponding programs, and the server is a cloud server.

[0042] Standard descriptions of various research fields are stored in the standard library. Standard descriptions include research field standard descriptions, index standard descriptions, and index option standard descriptions; the input unit is used to enter survey data and description information. The description information includes field descriptions, index descriptions, and Index...

Embodiment 2

[0052] Different from Embodiment 1, the NLP-based survey data standardization system of this embodiment also includes a checking unit and a statistical unit. Wherein, the checking unit is integrated in the second working end, the second working end is a PC loaded with corresponding programs, and the statistical unit is integrated in the server.

[0053] The description of each standard description is also stored in the standard library; the confirmation unit is also used to send an unassociated signal to the verification unit, and the unassociated signal includes description information and the corresponding unassociated successful standard description; the verification unit is used to receive the unassociated After the signal, display the corresponding description information, the unlinked successful standard description and the description of the standard description; the checking unit is also used for inputting modification information or missing information after receiving ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the field of data processing, and particularly relates to an NLP-based research data standardization method and system. The method comprises the following steps: a standard library manufacturing step: creating a standard library, formulating standard descriptions of various research fields and storing the standard descriptions in the standard library, the standard descriptions comprising research field standard descriptions, index standard descriptions and index option standard descriptions; a data input step: inputting survey data and description information of the project, the description information including field description, index description and index option description; an information identification step: carrying out word segmentation on the input description information, then carrying out vectorization to obtain word vectors, carrying out identification on the word vectors and a standard library by utilizing LSTM, and selecting a preset number of standard descriptions with the highest similarity from the calculated similarities; and an association confirmation step: selecting the most accurate one from the identified standard descriptions, and associating the most accurate one with the corresponding description information. According to the method, the processing accuracy can be ensured while the processing efficiency is ensured.

Description

technical field [0001] The invention belongs to the field of data processing, and in particular relates to an NLP-based survey data standardization method and system. Background technique [0002] Market research is the use of scientific methods to purposefully and systematically collect, record, and organize information and materials related to marketing, analyze market conditions, understand the current situation and development trends of the market, and provide objective and correct information for market forecasts and marketing decisions. data process. With the continuous development and changes of the times, the scope of market research is no longer limited to marketing companies, but has expanded to government departments, enterprises and institutions, etc. [0003] However, in order for market research to play a role, the volume of research data must pass the test. If the volume of research data is too small, there will be a problem that the research is not comprehen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/25G06Q30/02G06F40/30G06N3/04
CPCG06F16/258G06Q30/0201G06F40/30G06N3/044
Inventor 周羽张吴黄海波邓莎
Owner 立信(重庆)数据科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products