Method and system for mining and searching unstructured text data in financial field
An unstructured, text data technology, applied in the fields of data processing and finance, can solve the problems of sparseness and unreadability, unable to make full use of the relational network, unable to express the results intuitively, etc., to achieve intuitive readability and utilization value Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0079] Those skilled in the art can implement the present invention as a method for mining unstructured text data in the field of finance and economics. In this embodiment, the following steps are performed:
[0080] S1, collecting data, crawling data from designated financial fields on the Internet;
[0081] S2, clean the data, to remove the CSS fields or paragraph tags that are not removed during the crawling process, and then store them in the database;
[0082] S3, preprocessing data, reading the data stored in the database in step S2, performing word segmentation processing and named entity recognition processing on the sentences in the text of the obtained data, and storing the processed information in the database;
[0083] S4, mining the association relationship, mining the association relationship between named entities;
[0084] S5, building an association graph, using the mined association relationship to construct an association graph, using named entities as vert...
Embodiment 2
[0090] Those skilled in the art can implement the present invention as a method for mining unstructured text data in the field of finance and economics. In this embodiment, on the basis of Embodiment 1, a six-degree association network is constructed in step S5, and the execution is as follows step:
[0091] That is, given a center point, generate a six-degree association network centered on this point, first initialize a center node set, add the center node to the set, initialize a candidate node set, search the association network, and directly connect to the center point as Once the node is added to the candidate node set, the central node set and the candidate node set are merged into a new central node set, and the nodes in the associated relationship network that are connected to the nodes in the central node set and not in the central set are found, and the candidate nodes are added. And so on until a six-degree network is generated, or until all nodes are already in th...
Embodiment 3
[0094] Those skilled in the art can implement the present invention as a mining system for unstructured text data in the field of finance and economics. In this embodiment, it includes a data acquisition module, a data cleaning module, a data preprocessing module, an association mining module, and an association map Building blocks and complex network analysis modules;
[0095] The data acquisition module is used to crawl data from the specified financial field of the Internet;
[0096] The data cleaning module is used to remove CSS fields or paragraph tags that are not removed during the crawling process, and then store them in the database;
[0097] The data preprocessing module is used to read the data stored in the database, perform word segmentation processing and named entity recognition processing on the sentences in the text of the acquired data, and store the processed information in the database;
[0098] The association mining module is used to use the preprocessed...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com