Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and device for generating organization name and abbreviation, and computer-readable storage medium

A technology of organization name and abbreviation, applied in the field of natural language processing, can solve problems such as difficulty in ensuring correctness, and achieve the effect of improving recall and accuracy

Active Publication Date: 2021-06-08
BEIJING MININGLAMP SOFTWARE SYST CO LTD
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] 2. It is difficult to guarantee the correctness of the organization name abbreviation dictionary generated based on the combination of words, for example: "Minglue Software" is not the abbreviation of "Beijing Minglue Software System Co., Ltd."

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for generating organization name and abbreviation, and computer-readable storage medium
  • Method and device for generating organization name and abbreviation, and computer-readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0064] Embodiment 1 Organization name abbreviation generation method 1

[0065] Such as figure 1 As shown, a method for generating an institution name abbreviation according to an embodiment of the present invention includes the following steps:

[0066] Step 101: Obtain a dictionary of place names, a dictionary of institutional terms, a dictionary of industry terms, and a text corpus;

[0067] It should be noted that this application divides the words used in the full name of the institution into the following four categories: nouns of geographical names, proper names of institutions, nouns of industry and nouns of institutional nature, among which nouns of geographical names are used to identify the information of place names in the full name of institutions; The proper name of the organization is used to identify the proper noun of the organization name in the full name of the organization; the industry noun is used to identify the noun that reflects the industry to which ...

Embodiment 2

[0167] Embodiment 2 Organization name abbreviation generation method 2

[0168] Such as figure 2 As shown, a method for generating an institution name abbreviation according to an embodiment of the present invention includes the following steps:

[0169] Step 201: Obtain the full name of the institution and a text corpus, and search the text containing the full name of the institution in the text corpus;

[0170] In an exemplary embodiment, the text corpus includes a news corpus and a Wikipedia corpus.

[0171] In an example of this embodiment, the text corpus is built by crawling the news corpus and downloading the text data of Wikipedia (these data will be updated regularly), and the data in the text corpus is indexed by using retrieval software to facilitate subsequent search.

[0172] Step 202: In the retrieved text, extract the character strings of I to J characters adjacent to Chinese characters as candidate character strings, wherein I and J are preset natural numbe...

Embodiment 3

[0187] Embodiment three: computer-readable storage medium

[0188] An embodiment of the present invention also provides a computer-readable storage medium, where one or more programs are stored in the computer-readable storage medium, and the one or more programs can be executed by one or more processors to implement the following: Steps in the method for creating an institution name abbreviation described in any of the above items.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

This application discloses a method and device for generating an organization name abbreviation, and a computer-readable storage medium. The method includes obtaining a dictionary of place names, a dictionary of institutional terms, a dictionary of industry terms, and a text corpus; Segment the full name of the institution name with the industry noun dictionary to obtain the corresponding place names, institution nouns, industry nouns and institution proper names; by combining place names, institution nouns, industry nouns and institution proper names, candidate institution names are obtained Abbreviation: Use the abbreviated name of the candidate institution to search in the text corpus. If the retrieved m texts contain the co-occurrence of the abbreviated name of the candidate institution and the full name of the institution, use the abbreviated name of the candidate institution as the abbreviated name of the institution, and m is a natural number. This application can accurately and effectively generate a reasonable abbreviation of the institution name by segmenting the full name of the institution, and combining and correlating the nouns after the word segmentation.

Description

technical field [0001] The present application relates to but not limited to the technical field of natural language processing (Natural Language Processing, NLP), and in particular relates to a method and device for generating an organization name and abbreviation, and a computer-readable storage medium. Background technique [0002] Each institution name basically has one or more institution name abbreviations. For example, the abbreviation of Alibaba Network Technology Co., Ltd. is Alibaba Group, Alibaba or Ali; the abbreviation of Beijing Minglue Software System Co., Ltd. is Minglue Data, Minglue Company, Minglue, etc. In addition to simplifying the name of the institution, the abbreviation of the institution name usually also reflects the industry to which the institution belongs and the uniqueness of the institution. [0003] Due to the variety of abbreviations of institution names, it is difficult to summarize them with simple rules. In the field of NLP, there are st...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/33G06F40/289
CPCG06F16/3344G06F40/289
Inventor 陈奇宁牟小峰
Owner BEIJING MININGLAMP SOFTWARE SYST CO LTD