Unlock instant, AI-driven research and patent intelligence for your innovation.
Method and system for collecting forum reply increment
What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A collection method and forum technology, which is applied in the fields of instruments, network data retrieval, data processing applications, etc., can solve problems such as missing or missed searches, and achieve the effect of avoiding repeated collection.
Active Publication Date: 2014-07-16
NEW FOUNDER HLDG DEV LLC +2
View PDF5 Cites 0 Cited by
Summary
Abstract
Description
Claims
Application Information
AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology
Problems solved by technology
[0003] Aiming at the deficiencies in the prior art, the technical problem to be solved by the present invention is to provide a method and system for incremental collection of forum replies, which can quickly, accurately and completely collect all the master / reply information of a post , overcoming the defect that the existing search engine misses or cannot find the information when searching for the page-turning reply information of the post, and the defect that the existing forum collection system only collects the home page information of the post and does not collect the reply information
Method used
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more
Image
Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
Click on the blue label to locate the original text in one second.
Reading with bidirectional positioning of images and text.
Smart Image
Examples
Experimental program
Comparison scheme
Effect test
example 1
[0080] The URL of a post on the international forum channel of the Qiangguo Community of People’s Daily Online is:
[0084] The starting page number nFirstPostPageIndex is 1, the page turning base nPageUsBaseNum is 1, N PerPage for 20.
[0085] According to the page-turning link rules, the first and third parts of the page-turning URL are extracted, which are " / postDetail.do?id=91384467&view=1&pageNo=" and "&boardId=6" respectively.
[0086] According to the above information, assuming that when the post is collected for the first time, the post already has 210 replies, then there are 10 page-turning URLs acquired by splicing, which are:
[0096] The starting page number is 0, and the page turning base is 30. According to the page-turning link rules, the first part of the page-turning URL extracted is:
[0097] / f?z=919731090&ct=335544320&lm=0&sc=0&rn=30&tn=baiduPostBrowser&word=%B6%B7%C6%C6%B2%D4%F1%B7&pn=. The third part has no content.
[0098] N PerPage for 30. According to the above information, assuming that when the post is collected for the first time, there are already 210 replies to the post, then there are 6 page turning URLs obtained by splicing, which are:
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More
PUM
Login to View More
Abstract
The invention discloses a method and system for incremental collection of forum replies, belonging to the technical field of network information collection. The method of the present invention periodically judges whether there are newly added posts and posts with new replies in all forum list pages that need to be collected; New reply information is extracted from posts with new replies. The system of the present invention includes a judging device (11) for periodically judging whether there are new posts and posts with new replies in all the forum list pages that need to be collected; Reply information, an extraction device (12) for extracting new reply information from posts with new replies. The invention can quickly, accurately and completely collect all the main and reply information of a post, thereby solving the problem that existing search engines miss or fail to search for the reply information of a post when searching for the page-turning reply information.
Description
technical field [0001] The invention belongs to the technical field of network information collection, and in particular relates to a method and system for incremental collection of forum replies. Background technique [0002] With the emergence of the Internet, especially the extensive establishment of online forums and online communities, people all over the world can freely express and exchange various opinions together. There are more than one million online forums in China, and 80% of websites have their own independent forums, and the number of people who regularly visit online forums has exceeded 100 million. Different from other forms, online forums have the characteristics of fast speed and wide range. A topic that attracts people's attention may reach tens of thousands of netizens' replies and discussions in a short period of time, and the reply information reaches hundreds or thousands of pages. At this time, the user not only wants to view the speeches of the i...
Claims
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More
Application Information
Patent Timeline
Application Date:The date an application was filed.
Publication Date:The date a patent or application was officially published.
First Publication Date:The earliest publication date of a patent with the same application number.
Issue Date:Publication date of the patent grant document.
PCT Entry Date:The Entry date of PCT National Phase.
Estimated Expiry Date:The statutory expiry date of a patent right according to the Patent Law, and it is the longest term of protection that the patent right can achieve without the termination of the patent right due to other reasons(Term extension factor has been taken into account ).
Invalid Date:Actual expiry date is based on effective date or publication date of legal transaction data of invalid patent.