Spark-Streaming text similarity analysis-based data processing method and device
A text similarity and similarity technology, applied in the field of information processing, to achieve the effect of accurate text similarity
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0036] figure 1 It is a schematic flowchart of a data processing method based on Spark-Streaming text similarity analysis in an embodiment of the present invention. Such as figure 1 As shown, the method includes:
[0037] Step 110: dynamically obtain the real-time text database according to Spark-Streaming;
[0038] Step 120: Obtain first text information according to the real-time text database, and the first text information includes first text length information, first text word order information, first text keyword information, and first text grammar information;
[0039] Step 130: Obtain second text information according to the real-time text database, and the second text information includes second text length information, second text word order information, second text keyword information, and second text grammar information;
[0040] Step 140: Obtain text length similarity information according to the first text length information and the second text length informat...
Embodiment 2
[0083] Based on the same inventive concept as the data processing method of a text similarity analysis based on Spark-Streaming in the foregoing embodiment, the present invention also provides a data processing device based on a Spark-Streaming text similarity analysis, such as figure 2 shown, including:
[0084] The first obtaining unit 11, the first obtaining unit 11 is used to dynamically obtain the real-time text database according to Spark-Streaming;
[0085] The second obtaining unit 12, the second obtaining unit 12 is used to obtain the first text information according to the real-time text database, the first text information includes the first text length information, the first text word order information, the first text keyword information, first text grammar information;
[0086] The third obtaining unit 13, the third obtaining unit 13 is used to obtain the second text information according to the real-time text database, the second text information includes the s...
Embodiment 3
[0124] Based on the same inventive concept as the authentication method of a network authority in the foregoing embodiment, the present invention also provides a data processing device based on Spark-Streaming text similarity analysis, on which a computer program is stored, and the program is processed by a processor. During execution, the steps of any one of the above-mentioned network authority authentication methods are realized.
[0125] Among them, in image 3In, bus architecture (represented by bus 300), bus 300 may include any number of interconnected buses and bridges, bus 300 will include one or more processors represented by processor 302 and various types of memory represented by memory 304 circuits linked together. The bus 300 may also link together various other circuits, such as peripherals, voltage regulators, and power management circuits, etc., which are well known in the art and thus will not be further described herein. The bus interface 306 provides an in...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com