Method and device for matching texts
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- ALIBABA CLOUD COMPUTING LTD
- Publication Date
- 2012-04-11
- Estimated Expiration
- Not applicable · inactive patent
Smart Images
Figure 1 Figure 2 Figure 3
Abstract
Description
technical field
[0001] This application relates to the field of data processing, in particular to a text matching method and device with a large amount of data. Background technique
[0002] Existing text comparison generally adopts the method of full calculation and matching. When it is necessary to calculate the degree of correlation between texts, it is necessary to calculate all the acquired texts, and finally obtain the similarity between two pairs. In this way, each calculation of similarity The degree of calculation must be calculated for all text data, and the amount of calculation will be very huge, and its running time is on the order of O(N^2). As the number of texts N increases, the calculation time will also be very long .
[0003] This large amount of data calculation comparison has a great impact on the system performance of the equipment, which puts great pressure on the system's I / O communication, data storage, and data network transmission, resulting in sl...