Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

795 results about "Punctuation" patented technology

Punctuation (formerly sometimes called pointing) is the use of spacing, conventional signs and certain typographical devices as aids to the understanding and correct reading of written text whether read silently or aloud. Another description is, "It is the practice action or system of inserting points or other small marks into texts in order to aid interpretation; division of text into sentences, clauses, etc., by means of such marks."

Intelligent error correcting system and method in network searching process

InactiveCN101206673AMeets preferencesSolve the problem of pinyin error correctionSpecial data processing applicationsLinguistic modelAlgorithm
The invention relates to an intelligent error correction system of key words in the process of searching networks and a method thereof. On an Internet platform, firstly, a related linguistic model and a corresponding dictionary as well as a data index database are established through the training of related data information; secondly, a text is inputted, a Pinyin error correction part calculates the mistakes of Pinyin and characters, the error correction of characters is calculated by a fuzzy match; finally, all results are filtered according to the degree of association, a plurality of results are sorted to get the proximal results. The polyphone mistakes and character types as well as word types mistakes inputted by a user are corrected by means of the sound-character conversion and fuzzy error correction technical methods to correct the character replace mistakes, the unwanted character or the leakage of character mistakes, the character position mistakes, etc. in the input process. Moreover, the basic functions are expanded on the basis such as the English-Chinese and punctuations mixing error correction, the fuzzy match technique, the related prompt technique and the enhanced intelligence error correction.
Owner:北京当当网信息技术有限公司

Method and apparatus for processing text and character data

An apparatus and method for processing text or character data are disclosed. A text processing system receives a character input string and determines whether to apply character processing. A non-English language such as Italian can be entered into a processing system such as a computer using a standard English based keyboard such that additional keys for providing accents or other grammatical and punctuation symbols or characters not existing in English are not required. In one mode, text is automatically accented or punctuated without requiring user intervention. In another mode, a user is provided with a list of accent or punctuation choices so that the user may select the optimum accent or punctuation. Text processing of an input may be activated by a text sequence including a possible vowel accent or apostrophe error, and may continue as an input method editor loop in response to repeated actuations of the key associated with the first activation event. When an activator event input is detected, a rules based system is utilized to select a correctly accented and punctuated character. A list of alternative accents and punctuations is optionally displayed, and a user may toggle through the list using the activator event to select a desired character. The display provides information for a level of certainty of a selected character or word.
Owner:CLOANTO CORP

System, plug-in, and method for improving text composition by modifying character prominence according to assigned character information measures

A computer implemented system, plug-in application and method for composing a formatted text input to improve legibility, readability and / or print economy while preserving the format of the text input and satisfying any user selected aesthetic constraints. This is accomplished by reading in blocks of text input having defined characters including letters and punctuation in a given input format. A language unit such as a lexical or sub-lexical unit, a subset of punctuation or another defined unit for a particular language is examined and an information measure (IM) is assigned to each character in the language unit indicating the predictability of that character to differentiate the language unit from other language units. Typically, multiple different IMs are assigned to each character and combined to form a combined IM (CIM). The process is repeated for at least a plurality of language units and typically until all the text input in the block has been analyzed and information measures assigned to all of the characters. An adjustment to a physical feature is determined for each character in the plurality of units to modify the visual prominence of that character according to the values of the assigned information measures and a permitted range of physical variation for the block. The adjustments are applied to each character to compose the text input consistent with the input format.
Owner:LANGUAGE TECH

Chinese domain term recognition method based on mutual information and conditional random field model

The invention discloses a Chinese domain term recognition method based on mutual information and a conditional random field model. The Chinese domain term recognition method includes the following steps: (1) gathering domain text corpus and marking all the punctuations, spaces, numbers, ASSCII (American Standard Code for Information Interchange) characters and characters except Chinese characters in the corpus; (2) setting character strings and computing the mutual information values of the character strings, (3) computing the left comentropy and the right comentropy of every character string, (4) defining character string evaluation function, setting evaluation function threshold, computing the evaluation function values of every character string, determining that every character string is a word, comparing in sequence the evaluation function value of the former character with the evaluation function value of the latter character in the character string and segmenting character meaning character strings one by one, (5) utilizing conditional random fields to train a conditional random field model and recognizing domain terms with the conditional random field model. When the Chinese domain term recognition method is used to recognize terms, the data sparsity of legitimate terms is overcome, the amount of calculation of conditional random fields is reduced, and the accuracy of the Chinese domain term recognition is improved.
Owner:SHANGHAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products