Unlock instant, AI-driven research and patent intelligence for your innovation.

Organization name abbreviation generation method and device and computer readable storage medium

A technology of organization name and abbreviation, applied in the field of natural language processing, can solve problems such as difficulty in ensuring correctness, and achieve the effect of improving recall and accuracy

Active Publication Date: 2019-08-06
BEIJING MININGLAMP SOFTWARE SYST CO LTD
View PDF7 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] 2. It is difficult to guarantee the correctness of the organization name abbreviation dictionary generated based on the combination of words, for example: "Minglue Software" is not the abbreviation of "Beijing Minglue Software System Co., Ltd."

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Organization name abbreviation generation method and device and computer readable storage medium
  • Organization name abbreviation generation method and device and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0064] Embodiment 1 Organization name abbreviation generation method 1

[0065] Such as figure 1 As shown, a method for generating an institution name abbreviation according to an embodiment of the present invention includes the following steps:

[0066] Step 101: Obtain a dictionary of place names, a dictionary of institutional terms, a dictionary of industry terms, and a text corpus;

[0067] It should be noted that this application divides the words used in the full name of the institution into the following four categories: nouns of geographical names, proper names of institutions, nouns of industry and nouns of institutional nature, among which nouns of geographical names are used to identify the information of place names in the full name of institutions; The proper name of the organization is used to identify the proper noun of the organization name in the full name of the organization; the industry noun is used to identify the noun that reflects the industry to which ...

Embodiment 2

[0167] Embodiment 2 Organization name abbreviation generation method 2

[0168] Such as figure 2 As shown, a method for generating an institution name abbreviation according to an embodiment of the present invention includes the following steps:

[0169] Step 201: Obtain the full name of the institution and a text corpus, and search the text containing the full name of the institution in the text corpus;

[0170] In an exemplary embodiment, the text corpus includes a news corpus and a Wikipedia corpus.

[0171] In an example of this embodiment, the text corpus is built by crawling the news corpus and downloading the text data of Wikipedia (these data will be updated regularly), and the data in the text corpus is indexed by using retrieval software to facilitate subsequent search.

[0172] Step 202: In the retrieved text, extract the character strings of I to J characters adjacent to Chinese characters as candidate character strings, wherein I and J are preset natural numbe...

Embodiment 3

[0187] Embodiment three: computer-readable storage medium

[0188] An embodiment of the present invention also provides a computer-readable storage medium, where one or more programs are stored in the computer-readable storage medium, and the one or more programs can be executed by one or more processors to implement the following: Steps in the method for creating an institution name abbreviation described in any of the above items.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an organization name abbreviation generation method and device and a computer readable storage medium. The method comprises the steps of acquiring a place name noun dictionary,an organization property noun dictionary, an industry noun dictionary and a text corpus; based on the place name noun dictionary, the organization property noun dictionary and the industry noun dictionary, performing word segmentation on the organization name full names to obtain corresponding place name nouns, organization property nouns, industry nouns and organization special names; combiningthe place name nouns, the organization property nouns, the industry nouns and the organization special names to obtain candidate organization names abbreviation; and searching in a text corpus by using the candidate organization name abbreviation, and if the m retrieved texts contain co-occurrence of the candidate organization name abbreviation and the organization name full name, taking the candidate organization name abbreviation as the organization name abbreviation, and m being a natural number. According to the method and the device, the full name of the organization name is subjected toword segmentation, and the nouns after word segmentation are combined and associated for retrieval, so that the reasonable organization name abbreviation can be accurately and effectively generated.

Description

technical field [0001] The present application relates to but not limited to the technical field of natural language processing (Natural Language Processing, NLP), and in particular relates to a method and device for generating an organization name and abbreviation, and a computer-readable storage medium. Background technique [0002] Each institution name basically has one or more institution name abbreviations. For example, the abbreviation of Alibaba Network Technology Co., Ltd. is Alibaba Group, Alibaba or Ali; the abbreviation of Beijing Minglue Software System Co., Ltd. is Minglue Data, Minglue Company, Minglue, etc. In addition to simplifying the name of the institution, the abbreviation of the institution name usually also reflects the industry to which the institution belongs and the uniqueness of the institution. [0003] Due to the variety of abbreviations of institution names, it is difficult to summarize them with simple rules. In the field of NLP, there are st...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F17/27
CPCG06F16/3344G06F40/289
Inventor 陈奇宁牟小峰
Owner BEIJING MININGLAMP SOFTWARE SYST CO LTD