Method and device for extracting keywords
A keyword and lexical analysis technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as low keyword accuracy and poor text keyword effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0027] Embodiments of the present invention provide a method for extracting keywords, such as figure 1 As shown, the method includes:
[0028] Step 101, obtaining the word set after lexical analysis and preprocessing of the text;
[0029] Optionally, the text is segmented and part-of-speech tagged, for example, for "Materialism-everyone who admits that existence, that is, matter is the first nature and the origin, while thinking is the second nature, is derived and attached to the existence of matter It is "materialism" for word segmentation and part-of-speech marking as: materialism / n- / weverything / d admits / v exists / v exists / p is / v matter / n is / v primary / n, / w is / v Primitive / n, / w and / c thinking / n is / v secondary / n, / w is / v is derived from / v comes out / v attaches / v to / p matter / n exists / v’s / u is / d is / v materialism / n. / w, where n represents a noun, w represents a punctuation mark, d represents an adverb, v represents a verb, and p represents a preposition.
[0030] Optionally, d...
Embodiment 2
[0047] Embodiments of the present invention provide a method for extracting keywords, such as figure 2 As shown, the method includes:
[0048] Step 201, obtaining a word set after lexical analysis and preprocessing of the text;
[0049]Optionally, the text is segmented and part-of-speech tagged, for example, for "Materialism-everyone who admits that existence, that is, matter is the first nature and the origin, while thinking is the second nature, is derived and attached to the existence of matter It is "materialism" for word segmentation and part-of-speech marking as: materialism / n- / weverything / d admits / v exists / v exists / p is / v matter / n is / v primary / n, / w is / v Primitive / n, / w and / c thinking / n is / v secondary / n, / w is / v is derived from / v comes out / v attaches / v to / p matter / n exists / v’s / u is / d is / v materialism / n. / w, where n represents a noun, w represents a punctuation mark, d represents an adverb, v represents a verb, and p represents a preposition.
[0050] Optionally, dif...
Embodiment 3
[0099] An embodiment of the present invention provides a device for extracting keywords, such as Figure 5 As shown, the device includes: an acquisition unit 501, a first processing unit 502, a second processing unit 503, and a keyword determination unit 504;
[0100] An acquisition unit 501, configured to acquire a word set after lexical analysis and preprocessing of the text;
[0101] Optionally, the text is segmented and part-of-speech tagged, for example, for "Materialism-everyone who admits that existence, that is, matter is the first nature and the origin, while thinking is the second nature, is derived and attached to the existence of matter It is "materialism" for word segmentation and part-of-speech marking as: materialism / n- / weverything / d admits / v exists / v exists / p is / v matter / n is / v primary / n, / w is / v Primitive / n, / w and / c thinking / n is / v secondary / n, / w is / v is derived from / v comes out / v attaches / v to / p matter / n exists / v’s / u is / d is / v materialism / n. / w, where n r...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 