Image processing apparatus and image processing method
The image processing apparatus manages OCR function restrictions independently, addressing information leakage concerns while preserving usability by allowing dependent functions to utilize character recognition results.
Patent Information
- Authority / Receiving Office
- JP · JP
- Patent Type
- Patents
- Current Assignee / Owner
- SHARP KK
- Filing Date
- 2022-05-31
- Publication Date
- 2026-06-22
Smart Images

Figure 0007877067000001 
Figure 0007877067000002 
Figure 0007877067000003
Abstract
Description
Technical Field
[0001] The present disclosure relates to an image processing apparatus and the like.
Background Art
[0002] There is known a technique for restricting an image processing apparatus from executing a predetermined process on a user from the viewpoint of security or the like.
[0003] For example, Patent Document 1 describes a technique that can suitably set whether to convert an original paper document into reusable vector data before acquiring vector data from the paper document.
Prior Art Documents
Patent Documents
[0004]
Patent Document 1
Summary of the Invention
Problems to be Solved by the Invention
[0005] An object of the present disclosure is to provide a technique that can appropriately restrict a user from using a predetermined function, for example.
Means for Solving the Problems
[0006] The image processing apparatus of the present disclosure includes an acquisition unit that acquires an input image, a character recognition unit that performs a character recognition process of recognizing characters included in the input image based on a first setting, an image processing unit that executes image processing on the input image based on a result of the character recognition process, an output unit that outputs an output file using at least one of the result of the image processing and the result of the character recognition process, and a control unit that restricts use of the result of the character recognition process in the output unit when a second setting is made regardless of the first setting.
[0007] The image processing method disclosed herein performs character recognition processing to recognize characters contained in an image based on a first setting, performs image processing on the input image based on the result of the character recognition processing, outputs an output file using the result of the image processing if a second setting is set to restrict the character recognition processing regardless of the first setting, and outputs an output file using the result of the image processing and the result of the character recognition processing if the second setting to restrict the character recognition processing is not set. [Effects of the Invention]
[0008] According to this disclosure, for example, it is possible to provide a technology that can appropriately restrict a user's use of a predetermined function. [Brief explanation of the drawing]
[0009] [Figure 1] This is a diagram illustrating the system overview in the first embodiment. [Figure 2] This figure illustrates the hardware configuration of the image forming apparatus 10 in the first embodiment. [Figure 3] This diagram illustrates the hardware configuration of the terminal device 20 in the first embodiment. [Figure 4] This is a diagram illustrating the software configuration in the first embodiment. [Figure 5] This figure illustrates (a) an example of the data structure of the job setting table and (b) an example of the data structure of the system settings in the first embodiment. [Figure 6] This diagram schematically illustrates the screen transitions in the first embodiment. [Figure 7] This is a flowchart illustrating the processing (main processing) in the first embodiment. [Figure 8] This is a flowchart illustrating the processing (job setting processing) in the first embodiment. [Figure 9] This is a flowchart illustrating the processing (job execution processing) in the first embodiment. [Figure 10]It is a diagram showing an example of a display screen (system setting screen) in the first embodiment. [Figure 11] It is a diagram showing an example of a display screen (job setting screen) in the first embodiment. [Figure 12] It is a diagram showing an example of a display screen (system setting screen) in the second embodiment. [Figure 13] It is a flowchart for explaining the process (job execution process) in the third embodiment. [Figure 14] It is a diagram showing an example of a display screen (system setting screen) in the third embodiment. [Figure 15] It is a flowchart for explaining the process (job execution process) in the fourth embodiment. [Figure 16] It is a diagram showing an example of a display screen (system setting screen) in the fourth embodiment. [Figure 17] It is a diagram for explaining the software configuration in the fifth embodiment. [Figure 18] It is a flowchart for explaining the process (job execution process) in the fifth embodiment. [Figure 19] It is a flowchart for explaining the process (job execution process) in the sixth embodiment.
Modes for Carrying Out the Invention
[0010] Hereinafter, modes for carrying out the present disclosure will be described with reference to the drawings. Note that the embodiments shown below are one of the embodiments providing the present disclosure, and the content of the present disclosure is not limitedly interpreted based on the following description.
[0011] In a general image processing apparatus, a function (for example, an OCR function) that executes character recognition processing on an acquired document image and outputs the result of the character recognition is known. Here, the ON / OFF of the OCR function can be freely set by the user on the setting screen (for example, the scan mode setting screen) at the time of job execution.
[0012] However, when the user uses the OCR function, for example, a file embedded with the text that is the result of character recognition is output. With such a file, by using the search function or the copy function, the user can easily extract the information as text. To prevent information leakage, there has been a desire to prohibit the OCR function in the image processing apparatus.
[0013] However, if the OCR function is prohibited, functions premised on the OCR function (for example, the top-bottom determination function, the automatic file name setting function, the business card scanning function, the translation function, etc.) will also become unusable, which may lead to a decline in usability.
[0014] As described above, among the processes (functions) executed in the image processing apparatus, there are cases where they are used by the user and cases where they are used as internal processes of the apparatus. For such specific processes (specific functions) used in both cases, while restricting the use by the user, they can be used as internal processes of the apparatus, thereby realizing, for example, ensuring security and maintaining the usability for the user in the following embodiments.
[0015] Thereby, for example, as an example of an image processing apparatus, by adding a setting regarding the permission / prohibition of the OCR function to the system setting screen of an MFP (Multifunction Peripheral / Printer / Product), it becomes possible to prohibit the OCR function as the MFP system, and as a result, it becomes possible to prevent information leakage in advance.
[0016] Also, for example, even if the OCR function is prohibited as the MFP system, for functions premised on the OCR function (such as the top-bottom determination function, the automatic file name setting function, the business card scanning function, the translation function, etc.), it is possible to make them usable in a state where the functions are partially restricted, thereby making it possible to achieve both security assurance and usability maintenance.
[0017] [1. First Embodiment] [1.1 System as a Whole] Figure 1 is a diagram illustrating the overview of System 1. System 1 has an image forming apparatus 10 connected to the network as an example of an image processing device. Here, the image forming apparatus is a device commonly known as a printer, multifunction printer (MFP), copier, or facsimile.
[0018] Furthermore, terminal devices 20 and 30 may be connected to the network NW. Terminal device 20 is a device capable of submitting jobs to the image forming apparatus 10 and displaying system setting screens. Terminal device 20 may be, for example, a device connected to the network NW (for example, a computer, tablet, smartphone, etc.). In Figure 1, terminal device 20 is shown as a computer and terminal device 30 as a smartphone as examples. In this specification, terminal device 20 will be used as an example for explanation.
[0019] Furthermore, although the terminal device 20 (terminal device 30) is described as being connected to the same network NW as the image forming apparatus 10, it may be connected to a different network. For example, the terminal device 20 may be connected to the image forming apparatus 10 via an external network, such as the Internet.
[0020] [1.2 Hardware Configuration] The hardware configuration of each device will be described below with reference to the figures. Figure 2 shows an example of the hardware configuration of the image forming apparatus 10, and Figure 3 shows an example of the hardware configuration of the terminal device 20.
[0021] [1.2.1 Image forming apparatus] As shown in Figure 2, the image forming apparatus 10 includes a control unit 100, a storage device consisting of a storage unit 110, a ROM (Read Only Memory) 120, and a RAM (Random Access Memory) 130, a display unit 140, an operation unit 150, an image forming unit 160, an image reading unit 162, and a communication unit 170.
[0022] The control unit 100 is a functional unit for controlling the entire image forming apparatus 10. The control unit 100 realizes various functions by reading and executing various programs stored in the storage 110 and ROM 120, and may be realized by one or more control devices / arithmetic units (CPU (Central Processing Unit), SoC (System on a Chip)).
[0023] Storage 110 is a non-volatile storage device capable of storing programs and data. For example, it may consist of storage devices such as HDDs (Hard Disk Drives) or SSDs (Solid State Drives). Alternatively, storage 110 may be configured as an externally connectable USB memory device. Furthermore, storage 110 may be, for example, a storage area located in the cloud.
[0024] ROM120 is a non-volatile memory that can retain programs and data even when the power is turned off.
[0025] RAM130 is the main memory primarily used by the control unit 100 during processing. RAM130 is a rewritable memory that temporarily holds data including programs read from storage 110 and ROM 120, as well as execution results.
[0026] The display unit 140 is a display device capable of displaying various information and execution screens. The display unit 140 may be, for example, a liquid crystal display (LCD), an electroluminescent (OLED) display, or an electrophoretic display. The display unit 140 also includes an interface to which a display device can be connected. For example, it may consist of an external display device connected via HDMI® (High-Definition Multimedia Interface), DVI (Digital Visual Interface), or DisplayPort.
[0027] The operation unit 150 is an operating device that allows user input. For example, it may be a touch panel integrated with the display unit 140, or an operating device such as operation buttons. The operation unit 150 may also include an interface to which other operating devices can be connected. For example, the image forming apparatus 10 may connect different operating devices (operating devices having touch panels).
[0028] The communication unit 170 is a communication interface for communicating with other devices. For example, it may be a network interface capable of providing wired or wireless connections. In this embodiment, it is possible to communicate with other devices via a network NW.
[0029] The image forming unit 160 forms an image, for example, on recording paper. The image forming unit 160 includes, for example, an image carrier, forms a toner image on the image carrier, and forms an image by transferring the image on the image carrier onto the recording paper. The image forming unit 160 may be configured as an image forming device such as a printer. Alternatively, the image forming unit 160 may form an image electronically as an image file.
[0030] The image reading unit 162 reads the original document (image) and outputs it as image data. The image reading unit 262 may be, for example, a scanner, or a reading device that utilizes a CCD (Charge Coupled Device) or CIS (Contact Image Sensor).
[0031] [1.2.2 Terminal Devices] As shown in Figure 3, the terminal device 20 includes a control unit 200, a storage device 210, a ROM 220, and a RAM 230, a display unit 240, an operation unit 250, and a communication unit 260.
[0032] These configurations are similar in basic form to the configuration of the image forming apparatus 10 described above. For example, the control unit 200 is a functional unit for controlling the entire terminal device 20. The control unit 200 realizes various functions by reading and executing various programs stored in the storage 210 and ROM 220, and may be realized by one or more control devices / arithmetic units (CPU (Central Processing Unit), SoC (System on a Chip)).
[0033] Storage 210 is a non-volatile storage device capable of storing programs and data. For example, it may consist of storage devices such as HDDs (Hard Disk Drives) or SSDs (Solid State Drives). Alternatively, Storage 210 may consist of externally connectable USB memory, CD-ROM drives, or BD drives. Furthermore, Storage 110 may be, for example, a storage area located in the cloud.
[0034] [1.3 Software Configuration] The software configuration will be explained with reference to Figure 4. Figure 4 is a diagram illustrating the software configuration between the image forming apparatus 10 and the terminal device 20.
[0035] The control unit 100 of the image forming apparatus 10 functions as a job execution unit 1010, a character recognition unit 1012, an image processing unit 1014, an image output unit 1016, a system setting unit 1030, and a user interface provision unit (UI provision unit) 1040 by executing a program (application) stored in the storage unit 110 or ROM 120.
[0036] Furthermore, the storage 110 of the image forming apparatus 10 stores a job setting table 1110 and a system setting table 1120.
[0037] The job execution unit 1010 executes a job selected by the user. The job may be, for example, a copy job that implements a copying function, a print job that prints based on print data sent from the terminal device 20, a scan job that reads a document and outputs it as image data (for example, if the image reading unit 162 is a scanner device), or a facsimile job that sends and receives image data via facsimile communication. Depending on the job to be executed, the job execution unit 1010 performs character recognition processing, image processing, and file output processing.
[0038] Furthermore, the job execution unit 1010 displays screens for the user to execute jobs. For example, the job execution unit 1010 displays a job mode screen for the user to submit a job, a job settings screen for the user to configure settings during job execution, and a job execution screen displayed when the user executes a job.
[0039] The character recognition unit 1012 performs character recognition. For example, by performing character recognition processing on an image read by the image reading unit 162, it is possible to output character data. The character recognition unit 1012 may perform character recognition by, for example, recognizing character patterns. The character recognition processing implemented by the character recognition unit 1012 is, for example, called OCR (Optical Character Recognition) processing, and includes layout analysis processing, line or character extraction processing, character pattern matching processing, etc.
[0040] The image processing unit 1014 performs various processing on the image. In addition to general image processing such as tone conversion processing to change the gradation of an image and sharpening processing to sharpen an image, the image processing unit 1014 can implement various other functions.
[0041] Furthermore, the image processing unit 1014 may perform specific processing using the results of the processing performed by the character recognition unit 1012 or the character data output by the character recognition unit 1012. For example, the image processing unit 1014 may perform a process to determine the orientation of an image (up / down determination process) using the orientation of the characters determined by the character recognition unit 1012 during character recognition. Alternatively, for example, the image processing unit 1014 may perform an automatic file name setting process to set a file name based on the character data output by the character recognition unit 1012.
[0042] Furthermore, the image processing unit 1014 may implement a process that extracts a specific pattern from the input image and outputs multiple images. For example, the image processing unit 1014 may implement a process that detects a business card image from the input image and outputs it as business card data (business card scanning process). The image processing unit 1014 may also recognize the company name, name, job title, address, telephone number, etc. written on the business card from the character data output by the character recognition unit 1012 and output it as destination information.
[0043] The image output unit 1016 outputs a predetermined image. For example, the character recognition unit 1012 performs character recognition processing on the image (input image) read by the image reading unit 162. The image output unit 1016 can output an image (output image) by combining the image (input image) read by the image reading unit 162 and the character data recognized by the character recognition unit 1012.
[0044] Furthermore, the image output unit 1016 can output images as files. For example, the image output unit 1016 can output image data as an image file or as a document file. The image output unit 1016 may also temporarily output image data and output that image to another device. For example, the image output unit 1016 can print the image on recording paper by outputting the image to the image forming unit 160. Alternatively, the image output unit 1016 may output the image to another device via facsimile communication through the image forming unit 160 or the communication unit 170.
[0045] Furthermore, the image output unit 1016 can output the image processed by the image processing unit 1014 in a file of a predetermined format. The image output unit 1016 can output files in various formats, such as PDF (Portable Document Format) files, document files (e.g., DOC format files, DOCX format files, ODT format files, etc.), spreadsheet files (e.g., XLS format files, XLSX format files, ODS format files, etc.), and image files (e.g., TIFF format, JPEG format, etc.). In this way, the image output unit 1016 functions as an output unit (file output unit).
[0046] The system configuration unit 1030 performs system configuration, which is the overall system configuration of the image forming apparatus 10. Here, system configuration refers to settings that are commonly used when the image forming apparatus 10 is operating, and are also used when executing jobs. The system configuration unit 1030 is generally configured or updated by a specific user (for example, an administrator user). The system configuration unit 1030 stores the system configuration as a system configuration table 1120 in the storage area 110.
[0047] Furthermore, the system setting unit 1030 can change system settings via an external terminal device 20 through the UI provision unit 1040. The UI provision unit 1040, for example, provides a web interface to the connected terminal device 20, enabling changes to the system settings.
[0048] For example, when the control unit 200 of the terminal device 20 executes the UI display unit 2010, it displays the UI provided by the UI provision unit 1040. For example, the UI display unit 2010 is implemented in a web browser on the terminal device 20. Alternatively, the UI display unit 2010 may be a dedicated application executed on the terminal device 20.
[0049] The UI display unit 2010 can access the system setting unit 1030 via the UI provision unit 1040. The user can then change the system settings of the image forming apparatus 10 via the terminal device 20 by changing the system settings displayed on the UI display unit 2010.
[0050] In the following embodiments, the system settings screen is described as being displayed on the image forming apparatus 10, but it may also be displayed on an external device such as a terminal device 20.
[0051] The job settings table 1110 stores the job settings. The job settings include the setting values that the job execution unit 1010 refers to when executing a job. The user can change the setting values before executing a job. Figure 5(a) shows an example of the job settings table 1110.
[0052] For example, the job setting table 1110 can store the job mode (e.g., "e-mail scan"), the file format to be output when the job is executed (e.g., "PDF"), whether to perform character recognition processing (use OCR function) (e.g., "ON"), the resolution for scanning the image (or the resolution of the output file) (e.g., "400dpi"), the compression ratio of the output file (e.g., "medium"), whether to encrypt the output file (e.g., "OFF"), and the method of outputting the file (e.g., "OFF" for outputting a file page by page).
[0053] The job execution unit 1010 executes a job by referring to the job settings stored in the job settings table 1110 when executing a job.
[0054] The system setting table 1120 stores common system settings for the image forming apparatus 10. Figure 5(b) shows an example of the system setting table 1120.
[0055] For example, the system settings table 1120 stores a setting value (e.g., "OFF") for the setting item "Prohibit sending from PC-Fax". The system settings table 1120 can also store common settings among users. For example, it stores a setting that indicates whether or not a user can use the OCR function when executing a job (e.g., "Disable OCR" or "OFF").
[0056] Here, the setting of whether or not the user can use the OCR function in the image forming apparatus 10 is stored in the system settings (system settings table 1120) and the job settings (job settings table 1110). Regarding the use of the OCR function, the setting value set in the system settings takes precedence over the job settings in principle. Therefore, if the OCR function is unavailable in the system settings (OCR prohibited "ON"), it is preferable that the OCR function cannot be enabled in the job settings.
[0057] In this embodiment, "inability to use the OCR function" means that the user cannot freely use the results of the character recognition process on the original document. Therefore, it does not include the image forming apparatus 10 (control unit 100) prohibiting the execution of the character recognition process. Generally, when a user uses the OCR function, the image forming apparatus 10 outputs character data from the original document (image) in a format that the user can use. In this embodiment, if the OCR function is unavailable, no character data will be output.
[0058] Furthermore, the inability to use the OCR function may include cases where text data is output with restrictions. For example, when the OCR function is unavailable, the control unit 100 may embed the text data in a file and output it, but may impose restrictions on copying or editing the text data.
[0059] [1.4 Screen transition] The screen transitions in this embodiment will be explained with reference to Figure 6. The image forming apparatus 10 can switch between, for example, the home screen (D10), the system settings screen (D12), the job mode screen (D14), the job settings screen (D16), and the job execution screen (D18).
[0060] First, the image forming apparatus 10 displays the home screen (D10). The home screen (D10) is the screen displayed, for example, when in menu mode (home mode). From the home screen, the user can switch to each mode by selecting an item displayed on the home screen or submitting a job.
[0061] When the user selects a system setting from the home screen (D10), the image forming apparatus 10 transitions from menu mode to system setting mode. In system setting mode, the system setting unit 1030 displays the system setting screen (D12). The system setting screen (D12) is a screen that allows the user to change system settings, which are common settings for the operation of the image forming apparatus 10, and configures the functions of the image forming apparatus. The system setting screen may be displayed on the image forming apparatus 10, or it may be displayed on an external terminal device 20 that accesses the image forming apparatus 10. The system setting mode is a mode executed by, for example, an administrator. Therefore, the system setting screen (D12) is displayed by, for example, an administrator.
[0062] After the system settings are changed or registered by the administrator on the system settings screen (D12), the image forming apparatus 10 transitions to home mode and the home screen (D10) is displayed.
[0063] When a user selects a job mode on the home screen (D10), the image forming apparatus 10 executes the selected job mode. Once a job mode is executed, the control unit 100 (job execution unit 1010) displays a job mode screen (D14) corresponding to the executed job. For example, job mode screens for executing job modes include a copy screen for executing copy jobs, a scan screen for executing scan jobs, and a fax screen for executing fax communication jobs.
[0064] Here, the user selects a job setting if they want to change the settings in job mode. Once the user selects a job setting, the image forming apparatus 10 displays the job setting screen (D16). On the job setting screen, the user can change the settings for the job to be executed. The user can also select the functions to be used in the job. For example, the user can choose to use the OCR function.
[0065] After the user has configured the job settings on the job settings screen, the image forming apparatus 10 transitions to the job mode screen (D14). When the user instructs the image forming apparatus 10 to execute the job, the image forming apparatus 10 displays the job execution screen (D18) and executes the job. Once the image forming apparatus 10 has completed the job execution, it transitions back to the job mode screen (D14). When the job mode is exited, the image forming apparatus 10 transitions to home mode and displays the home screen (D10).
[0066] Note that the settings for job configuration may be displayed not only on the job configuration screen (D16) but also on the job mode screen (D14). For example, frequently used settings (such as selecting the file output format or the resolution for scanning documents) may be displayed on the job mode screen. Alternatively, frequently used settings may be displayed on the job mode screen, while less frequently used settings may be displayed on the job configuration screen.
[0067] Thus, the image forming apparatus 10 of this embodiment has two setting screens. Specifically, the image forming apparatus 10 has a first setting screen, which is a job setting screen mainly used by the user, and a second setting screen, which is a system setting screen mainly used by the system administrator.
[0068] Each screen described in Figure 6 may be displayed on the image forming apparatus 10 or on other devices. For example, terminal devices 20 and 30 may display each screen via a web browser. Alternatively, terminal devices 20 and 30 may run applications such as an operation application or a settings application, and display each screen within those applications.
[0069] [1.5 Processing Flow] [1.5.1 Main Processing] Next, the main processing flow in this embodiment will be explained with reference to Figure 7. The control unit 100 displays the home screen in the image forming apparatus 10 (step S10). Next, the control unit 100 prompts the user to select a process (step S12). If the user selects to execute a job (step S12; job execution), the job execution unit 1010 starts executing the job.
[0070] The job execution unit 1010 displays the job execution screen (step S14). Then, if necessary, the job execution unit 1010 executes the job setting process (step S16). After the user has set up the job, the job execution unit 1010 executes the job based on the job setting (step S18). The control unit 100 does not need to execute the job setting process if the user does not perform a job setting operation.
[0071] Furthermore, when system settings are selected on the home screen (step S12; system settings), the control unit 100 switches its operating mode to system settings mode. In system settings mode, the system settings unit 1030 displays the system settings screen on the display unit 140 (step S20). The system settings unit 1030 changes the setting values of the system settings that have been changed by the user (step S30). The system settings screen may also be displayed on an external device such as the terminal device 20 via the UI provision unit 1040.
[0072] In Figure 7, we will focus on the use of the OCR function among the system settings. The control unit 100 determines whether or not the use of the OCR function is prohibited in the system settings (step S32).
[0073] When the use of the OCR function is set to be prohibited, the control unit 100 (system setting unit 1030) sets the use of the OCR function to be prohibited (step S32; Yes → step S36). Also, when the use of the OCR function is set to be permitted, the control unit 100 (system setting unit 1030) sets the use of the OCR function to be permitted (step S32; No → step S34).
[0074] Thus, the control unit 100 that performs the process shown in Figure 7 can also perform the settings in system setting mode, just as it does in job mode.
[0075] [1.5.2 Job Setup Process] Figure 8 is a flowchart illustrating the flow of the job setting process executed by the control unit 100. The job setting process is the process executed in step S16 of Figure 7, and is performed in job mode.
[0076] The control unit 100 prohibits the selection of the OCR function (makes it unselectable) when the use of the OCR function is prohibited in the system settings (for example, "Disable OCR" is set to "ON") (Step S52; Yes → Step S54).
[0077] To prohibit the selection of the OCR function, for example, the control unit 100 displays on the display unit 140 that the operation button for enabling the OCR function cannot be selected. To display that it cannot be selected, for example, the control unit 100 may gray out the operation button to prevent selection, or it may hide the operation button to prevent selection. The control unit 100 may also display an error message such as "Cannot be selected" if the operation button is selected.
[0078] Next, when the user selects the operation button to enable the OCR function (step S56; Yes), the control unit 100 switches the OCR function on or off (step S58). That is, when the OCR function is "OFF" (disabled), the control unit 100 can switch the OCR function to "ON" (enabled) when the user selects the operation button. Also, when the OCR function is already "ON" (enabled), the control unit 100 can switch the OCR function to "OFF" (disabled) when the user selects the operation button. In this way, for example, the operation button indicating the OCR function is a toggle button, making it easy to switch the OCR function on or off. Note that the operation button may be a button display, or it may be a checkbox, radio button, or pull-down menu, or other selection methods.
[0079] Furthermore, if the OCR function is set to "disabled" in the system settings, the control unit 100 will prohibit (make unselectable) the selection of the OCR function in step S54. When the OCR function is disabled, the user cannot select the operation button that enables the use of the OCR function.
[0080] Next, if the user selects another setting item, the control unit 100 changes the job settings according to the selected item (step S60; Yes → step S62).
[0081] Thus, the job setting process allows the control unit 100 (job execution unit 1010) to configure the settings for job execution based on user selection.
[0082] [1.5.3 Job Execution Process] The job execution process will be explained with reference to Figure 9. The job execution process is performed by the control unit 100 (job execution unit 1010).
[0083] First, the control unit 100 (job execution unit 1010) executes the document acquisition process (step S102). When the control unit 100 executes the document acquisition process, it reads the document from the image reading unit 162, for example, and acquires it as an image. The control unit 100 may also acquire images from other devices or from external memory (for example, a USB memory) via the communication unit 170.
[0084] Next, the control unit 100 determines whether the use of the OCR function is "prohibited" (step S104). If the use of the OCR function is not "prohibited" (step S104; No), the control unit 100 performs character recognition processing (step S120). Specifically, the character recognition unit 1012 performs character recognition processing on the image acquired in step S102 and outputs character data based on the recognized characters.
[0085] The control unit 100 (image processing unit 1014) executes the configured image processing (step S122). For example, in this figure, if orientation detection is enabled, the image processing unit 1014 determines the orientation of the document based on the results of the character recognition processing. Then, the control unit 100 determines the tilt of the document's orientation as needed and rotates it by the determined angle to output an image with the correct orientation of the document (output image with orientation correction of the input image) or an image with the tilt corrected (output image with tilt correction of the input image).
[0086] Then, the image output unit 1016 outputs a file containing the character data output by the character recognition process (step S124). For example, the image output unit 1016 outputs an image and a file with embedded character data (e.g., a document file, a PDF file). In other words, the control unit 100 (image output unit 1016) embeds the recognized characters into the output file.
[0087] Thus, when the use of the OCR function is not prohibited, the control unit 100 outputs a file containing the output image and text data. The control unit 100 (image output unit 1016) may also output a file based on text data different from the image. The file output by the control unit 100 (image output unit 1016) may be, for example, a document file usable with word processing software or spreadsheet software (e.g., DOCX file, XLSX file, PPTX file, etc.). The output file may also be a file that can be displayed while maintaining the layout on other terminal devices (e.g., a PDF file). Furthermore, the image output unit 1016 may output both a file based on the output image (image file) and a file based on text data (text file, HTML file, etc.).
[0088] Returning to step S104, when the use of the OCR function is "prohibited," the control unit 100 performs character recognition processing only in image processing. In other words, when the use of the OCR function is prohibited, the user cannot use the results of the character recognition processing. From the user's perspective, since no character data is output, they will feel that the OCR function is not being used.
[0089] For example, when the control unit 100 (image processing unit 1014) performs image processing, if it uses the results of character recognition, it performs character recognition processing. For example, Figure 9 shows that when the orientation determination function is enabled (step S106; Yes), the control unit 100 (image processing unit 1014) performs character recognition processing (step S108).
[0090] In other words, the control unit 100 (character recognition unit 1012) recognizes characters contained in the image and outputs information about the recognized characters. This information about the recognized characters includes, for example, the orientation of the characters. The control unit 100 (image processing unit 1014) determines the orientation of the original document from the orientation of the characters determined by the character recognition process and performs a process to correct the orientation of the original document so that it is in the correct direction (step S110).
[0091] Then, since the use of the OCR function is prohibited, the control unit 100 (image output unit 1016) outputs a file that contains an output image but does not contain any text data (step S112).
[0092] Then, the control unit 100 executes a file transmission process (step S114). The file transmission process involves sending the file output in step S112 or step S124 to another device or outputting it to a recording medium. The control unit 100 may also send the file as an email attachment or send the file to a cloud storage area during the file transmission process. Furthermore, the control unit 100 may post the file to a social networking service (SNS).
[0093] [1.6 Example of Operation] An example of operation will be explained using an example of the display screen. Figure 10 shows the display screen W100, which is an example of a system settings screen. In the display screen W100, setting items are displayed in area R100. These setting items are, for example, the setting items corresponding to the group selected in the group area R104.
[0094] Area R100 displays one or more setting items. For example, under setting item R102, it is possible to set "Disable OCR". By setting "Disable OCR" to "ON", the user will not be able to use the OCR function.
[0095] When the registration button B100 is selected, the items set in area R100 are newly stored in the system settings table 1120. Also, when the update button B102 is selected, the contents stored in the system settings table 1120 are updated.
[0096] Figure 11 shows the display screen W110, which is an example of a job setting screen. In area R110, the user can select the file format of the output file. For example, the user can select output formats such as "TIFF," "XPS," and "JPEG" for image files, "PDF," "Text," and "RTF" for document files, and "DOCX," "XLSX," and "PPTX" for other files.
[0097] Furthermore, when selecting a PDF file, the user may choose from multiple types. For example, the user may be able to select formats such as "PDF / A-1a," "PDF / A-1b," "PDF / A-2," and "PDF / X" as the type of PDF file.
[0098] Furthermore, the control unit 100 may encrypt files depending on the file, or it may set a compression ratio. The control unit 100 may also password-protect files.
[0099] Area R112 may allow the user to select how to send the output file. For example, in area R112 of display screen W110, "E-mail" is selected. The control unit 100 (image output unit 1016) attaches the output file to an email and sends it to the specified recipient. The recipient of the email may be selected in advance or after the job has been executed (for example, after selecting the start button).
[0100] Furthermore, area R114 displays a button (checkbox) to select whether to use the OCR function. For example, if the user turns the OCR function "ON" in area R114, the control unit 100 outputs a file containing text data (for example, a PDF file with embedded text data).
[0101] Furthermore, if the system settings prohibit the use of the OCR function, the user will not be able to select the OCR function operation button for area R114.
[0102] [1.7 Effects] Thus, according to this embodiment, the availability of the OCR function can be set in both the system settings screen and the job settings screen. Furthermore, by disabling the OCR function in the system settings screen, the use of the OCR function by users can be restricted. In other words, administrators can uniformly prohibit users from using the OCR function in the system settings screen in order to ensure security.
[0103] Furthermore, in this embodiment, the use of the OCR function and the execution of character recognition processing by the image forming apparatus 10 are managed separately. Therefore, the user cannot use the OCR function, but the image forming apparatus 10 can perform character recognition processing, which is not particularly relevant to security. In other words, by performing character recognition processing, the image forming apparatus 10 can use the recognized character data and the results of the character recognition processing in other processes. This allows the image forming apparatus 10 to ensure the security of the files and image data it creates while maintaining ease of use.
[0104] [2. Second Embodiment] A second embodiment will now be described. In the second embodiment, the system settings screen allows the image forming apparatus 10 to configure the functions that can be used when the user's use of the OCR function is prohibited. Note that this embodiment will only describe the differences from the first embodiment, and explanations of similar aspects regarding the functional configuration and processing will be omitted.
[0105] Figure 12 shows an example of a system setting screen in the second embodiment. The display screen W200 in Figure 12 allows the user to select whether or not to use the OCR function in area R200.
[0106] Furthermore, when the use of the OCR function is prohibited in region R200, the system settings screen allows the user to select whether or not to allow the image processing unit 1014 to perform processing in region R202.
[0107] For example, in region R202, the following functions can be set as processes to be executed by the control unit 100 (for example, the image processing unit 1014): "up / down detection function," "automatic file name detection function," "business card scanning function," and "translation function."
[0108] For example, if the "Orientation Determination Function" is set to "ON," the image processing unit 1014 will perform orientation determination using the results of the character recognition process. In this case, the user cannot output a file containing the recognized character data, but the orientation determination function can be used in image processing by utilizing the results of the character recognition process.
[0109] Thus, according to this embodiment, even when the use of the OCR function is restricted, the control unit 100 (image processing unit 1014) can execute processes that utilize the results of character recognition processing without restriction. Furthermore, according to this embodiment, the control unit 100 can select an executable process from among multiple processes.
[0110] [3. Third Embodiment] A third embodiment will now be described. In the third embodiment, when the use of the OCR function is "prohibited" in the system settings screen, the image output unit 1016 outputs the file with attributes that prohibit editing and copying of text (an attribute to prohibit editing, an attribute to prohibit copying) added when outputting the file. Note that this embodiment will only describe the differences from the first embodiment, and explanations of similar aspects regarding the functional configuration and processing will be omitted.
[0111] Figure 13 is a flowchart that replaces Figure 9 of the first embodiment. Processes similar to those in the first embodiment are denoted by the same reference numerals, and their descriptions are omitted.
[0112] If the use of the OCR function is set to be prohibited (step S104; Yes), the control unit 100 (character recognition unit 1012) performs character recognition processing on the image (step S108).
[0113] Then, the control unit 100 (image output unit 1016) outputs a file containing character data (step S302). At this point, the control unit 100 applies protection settings to the character data in the output file (step S304).
[0114] Protection settings refer to settings that add security-related attributes to text data. Examples of protection settings include "prohibit editing of text data," "prohibit copying of text data," and "prohibit searching of text data."
[0115] Here, the control unit 100 may uniformly set protection settings for character data, or it may set them individually on the system settings screen. For example, Figure 14 shows an example of a system settings screen. Display screen W300 in Figure 14 is an area where the use of the OCR function is set in area R300. Here, when the use of the OCR function is prohibited (restricted) by the user in area R300, further protection settings can be made for the OCR function in area R302. For example, in display screen W300, the protection settings can be set to "prohibit editing of strings" and "prohibit copying of strings". Furthermore, the protection setting "prohibit output of strings" may also be set. In addition, multiple protection settings may be set, or only one may be selected.
[0116] Thus, according to this embodiment, when the use of the OCR function is prohibited for the user, it is possible to set up a system that prevents the user from using the text data contained in the output file by outputting the text data and applying protection settings to the text data.
[0117] [4. Fourth Embodiment] A fourth embodiment will now be described. The fourth embodiment is an embodiment in which the user selects a format that requires character recognition processing as the output file. Note that this embodiment will only describe the differences from the first embodiment, and explanations of similar aspects regarding the functional configuration and processing will be omitted.
[0118] Figure 15 is a flowchart that replaces Figure 9 of the first embodiment. Processes similar to those in the first embodiment are denoted by the same reference numerals, and their descriptions are omitted.
[0119] If the system settings prohibit the use of the OCR function (step S104; Yes), it is determined whether the format of the output file selected by the user is a format that requires character recognition processing (step S402).
[0120] Here, a format that requires character recognition processing refers to an output file that contains content based on character data. For example, this includes text files consisting only of character data, or document files used by word processing software (such as DOCX files).
[0121] Since output files in a format based on character data always contain character data, the control unit 100 cannot output them without using the OCR function. Therefore, when an output file in a format requiring the use of the OCR function is selected by the user (step S402; Yes), the control unit 100 determines whether or not outputting the output file is prohibited (step S404). Information on whether output files are prohibited is stored, for example, as a system setting in the system setting table 1120.
[0122] If the use of the OCR function is prohibited and file output is prohibited, the control unit 100 performs error processing (step S404; Yes → step S406). For example, the control unit 100 displays an error message "Cannot output file" on the screen and disables the selection of that format.
[0123] If file output is not prohibited (step S404; No), the control unit 100 determines whether a target format is set (step S408). For example, the target format is stored in the system settings (system settings table 1120). If a target format is set, the control unit 100 selects the target format set in the system settings instead of the format selected by the user (step S408; Yes → step S410).
[0124] If the target format for conversion is not set, the control unit 100 selects a pre-set fixed format (step S408; No → step S412). The fixed format may be a format pre-set by the image forming apparatus 10.
[0125] Then, the control unit 100 (image output unit 1016) outputs the output file in the selected format (step S414).
[0126] Furthermore, if the use of the OCR function is not prohibited (step S104; No), the image output unit 1016 uses the results of the character recognition process to output an output file in a format selected by the user (step S420).
[0127] Furthermore, if the use of the OCR function is prohibited in the system settings beforehand, it may be necessary to prevent the selection of files requiring character recognition processing as output files. In this case, the processing in steps S402 to S412 does not need to be executed.
[0128] Figure 16 shows an example of a system settings screen. In Figure 16, area W400 is the area where you configure whether or not to use the OCR function within area R400.
[0129] Furthermore, area R402 is where you select the alternative format to be converted to when a format requiring character recognition processing is selected. For example, PDF, TIFF, and JPEG can be selected as alternative formats in area R402.
[0130] Additionally, area R402 may be configured to allow the selection of an error termination option. When an error termination option is selected, the control unit 100 executes error processing and does not output a file (for example, S404:Yes → S406 in Figure 15).
[0131] Furthermore, in the job setting screen, the control unit 100 may display only the output formats. For example, in the job setting screen shown in Figure 11, area R110 displays multiple output formats. The formats displayed in area R110 may be limited to, for example, only the output formats. Alternatively, formats that cannot be selected from those displayed in area R110 may be grayed out.
[0132] Thus, in this embodiment, even when the OCR function is unavailable, the user can select and output a file in a format that requires character recognition processing. When the OCR function is unavailable, the control unit 100 may apply protection settings to the output file. Alternatively, the control unit 100 may prevent the selection of output files that cannot be protected. For example, when the OCR function is unavailable, the control unit 100 may prevent the selection of text files, but allow the selection of DOCX format files that can be protected.
[0133] [5. Fifth Embodiment] A fifth embodiment will now be described. The fifth embodiment is an embodiment in which optional processing is performed. Note that this embodiment will only describe the differences from the first embodiment, and will omit explanations of similar aspects regarding the functional configuration and processing.
[0134] Figure 17 is a diagram illustrating the software configuration of the fifth embodiment, and replaces Figure 4 of the first embodiment. Compared with the first embodiment, this embodiment has an additional option processing unit 1050.
[0135] The option processing unit 1050 provides optional functions to the image forming apparatus 10. The option processing unit 1050 may be pre-installed in the image forming apparatus 10 or added later. For example, in this embodiment, a translation function can be provided to the user by executing the processing of the translation processing unit 1052 as an example of the option processing unit 1050.
[0136] The translation processing unit 1052 performs the process of translating characters (strings) into a predetermined language. For example, based on the string (character data) output by the character recognition unit 1012, it translates and outputs it into other set languages (e.g., Japanese to English, Japanese to Chinese, English to Chinese, etc.).
[0137] The translation processing unit 1052 may, for example, embed the translated string (character data) into the image output by the image output unit 1016. Alternatively, it may output an output file in which the images of the characters in the language included in the image output by the image output unit 1016 have been replaced with images of the characters in the translated language. Furthermore, the translation processing unit 1052 may output a separate text file or document file based on the translated character data.
[0138] Figure 18 is a flowchart that replaces Figure 9 of the first embodiment. Processes similar to those in the first embodiment are denoted by the same reference numerals, and their descriptions are omitted.
[0139] If the use of the OCR function is not prohibited (step S104; No), the character recognition unit 1012 performs character recognition processing (step S512). The translation processing unit 1052 performs translation processing based on the character data resulting from character recognition (step S514). Then, the image output unit 1016 outputs an output file containing the translated character data (step S516). The image output unit 1016 may also include the character data recognized by the character recognition unit 1012 in addition to the translated character data.
[0140] Furthermore, when the use of OCR recognition is prohibited (step S104; Yes), the control unit 100 determines whether the translation function using character recognition processing is enabled (step S502). Here, when the translation function using character recognition processing is enabled (step S502; Yes), the character recognition unit 1012 executes the character recognition process (step S504).
[0141] Then, the translation processing unit 1052 performs translation processing based on the character data which is the result of the character recognition process (step S506).
[0142] The image output unit 1016 outputs an output file that does not contain character data (step S508). Here, the following methods are possible for the image output unit 1016 to output a file that does not contain character data.
[0143] (1) The image output unit 1016 outputs the image without any text data. For example, the image output unit 1016 replaces the images of the untranslated text included in the image data by the image processing unit 1014 with the images of the translated text and outputs the image as an output file. Alternatively, the image output unit 1016 may output the image as an output file that includes both the images of the untranslated text and the images of the translated text.
[0144] (2) The image output unit 1016 outputs a file containing the translated text data, but with protection settings applied. The image output unit 1016 protects the translated text data, outputting an output file in which the translated text data cannot be edited, cannot be copied, etc.
[0145] (3) The image output unit 1016 outputs a separate output file based on the translated text data. For example, it outputs a translated text file or document file separately from the image-based output file. The image output unit 106 may also set protection settings for the translated document file.
[0146] Furthermore, by setting the scope of the protection settings, the control unit 100 can output files in a flexible format. For example, the control unit 100 may set protection settings only for the text data before translation, or only for the text data after translation.
[0147] [6. Sixth Embodiment] A sixth embodiment will now be described. The sixth embodiment is one in which the OCR function is permitted for a specific user even when the OCR function is disabled in the system. Note that this embodiment will only describe the differences from the first embodiment, and will omit explanations of similar aspects regarding the functional configuration and processing.
[0148] Figure 19 is a flowchart that replaces Figure 9 of the first embodiment. Processes similar to those in the first embodiment are denoted by the same reference numerals, and their descriptions are omitted.
[0149] The control unit 100 enables the OCR function for authorized users even when the OCR function is prohibited in the system settings (step S104; Yes) (step S602; Yes).
[0150] Here, users permitted to use the OCR function may be predetermined users such as administrators, or they may be set individually for each user, or they may be set individually for each group (for example, the administrator group). Furthermore, users permitted to use the OCR function may also be authenticated users. Additionally, users may be granted permission to use the OCR function by entering a password when using it.
[0151] [7. Seventh Embodiment] A seventh embodiment will now be described. The embodiments described above described the case where the OCR function is a specific function. That is, if the use of a specific function is prohibited in the system settings screen, the user will not be able to use that function, but the device can still execute processing corresponding to that function internally.
[0152] In this embodiment, we will describe how a specific function can be applied to other functions. For example, we will describe the color output function as a specific function. The color output function is a function in which the image forming apparatus 10 outputs color printed materials or color files.
[0153] For example, in step S104 of Figure 9, it is determined whether the color output function is prohibited instead of the OCR function. If the color output function is prohibited, the control unit 100 executes processing related to color output.
[0154] For example, when a process using a color image (e.g., region separation process, image sharpening process) is enabled, the control unit 100 executes the process based on the color image. Since the color output function is disabled, the control unit 100 outputs a monochrome file (a grayscale file, or a black and white binarized file). Furthermore, in the case of printing, the control unit 100 performs the printing process based on the monochrome image data.
[0155] Furthermore, if the use of the color output function is not prohibited, the control unit 100 executes a process that uses a color image based on the color image instead of steps S120 to S122. Then, the control unit 100 outputs a color image file. Also, in the case of printing, the control unit 100 performs printing based on the color image data.
[0156] [8. Variation] This disclosure is not limited to the embodiments described above, and various modifications are possible. In other words, embodiments obtained by combining technical means that are appropriately modified within the scope of the gist of this disclosure are also included in the technical scope of this disclosure.
[0157] Furthermore, although the embodiments described above are explained separately for the sake of explanation, they can be combined and implemented to the extent possible. In addition, we intend to obtain rights to any of the technologies described in this specification through amendments or divisional applications.
[0158] Furthermore, in each embodiment, the program that operates in each device is a program that controls the CPU and other components (a program that makes the computer function) in order to realize the functions of the embodiments described above. The information handled by these devices is temporarily stored in a temporary storage device (for example, RAM) during processing, and then stored in various ROMs or HDDs, and read, modified, and written by the CPU as needed.
[0159] Here, the recording medium for storing the program may be any of the following: semiconductor media (e.g., ROM or non-volatile memory card), optical recording medium or magneto-optical recording medium (e.g., DVD (Digital Versatile Disc), CD (Compact Disc), BD (Blu-ray® Disc), etc.), magnetic recording medium (e.g., magnetic tape, flexible disk, etc.).
[0160] Furthermore, when distributing the program to the market, it can be stored on a portable recording medium and distributed, or transferred to a server computer connected via a network such as the Internet. In this case, the storage device of the server device is, of course, also included in this disclosure.
[0161] Furthermore, the data mentioned above may not be stored within the device itself, but rather stored on an external device and retrieved as needed. For example, the data may be stored on a NAS (Network Attached Storage) or on the cloud.
[0162] Furthermore, the scope of this disclosure is not limited to the configurations explicitly described in the specification, but also includes combinations of the technologies disclosed herein. While the configurations for which patent protection is sought are described in the attached claims, there is no intention to exclude them from the technical scope simply because they are not described in the claims.
[0163] Furthermore, the phrases "in the case of..." and "when..." in the above-mentioned specification are explained as examples only, and do not represent a configuration limited to those described. Even for configurations other than those described, we disclose information that would be obvious to a person skilled in the art, and we intend to acquire rights to such information.
[0164] Furthermore, the descriptions of the processes and data flows described in the specification are not limited to the order in which they are described. For example, configurations in which parts of the process are deleted or the order is rearranged are also disclosed, and the company intends to acquire rights to them. [Explanation of symbols]
[0165] 1 System 10 Image forming apparatus 100 Control Unit 110 storage 120 ROM 130 RAM 140 Display section 150 Operation section 160 Image forming unit 162 Image reading unit 170 Communications Department 20 Terminal devices 200 Control Unit 210 storage 220 ROM 230 RAM 240 Display section 250 Operation section 260 Communications Department
Claims
1. An acquisition unit that acquires the input image, A character recognition unit performs character recognition processing to recognize characters contained in the input image based on the first setting, An image processing unit that performs image processing on an input image based on the results of the character recognition process, An output unit that outputs an output file using at least one of the results of the image processing and the results of the character recognition processing, Regardless of the first setting, when the second setting is made, the output unit includes a control unit that restricts the use of the results of the character recognition processing, An image processing apparatus comprising, The first setting can be configured on the first setting screen for configuring the character recognition process in the character recognition unit. Unlike the first settings screen, the second setting is performed on a second settings screen for configuring the functions of the image processing device. Image processing device.
2. The second settings screen includes an item for setting whether or not to restrict the character recognition process in the second settings. The image processing apparatus according to claim 1.
3. The second settings screen further includes an item for configuring image processing that can utilize the results of the character recognition process. The image processing apparatus according to claim 2.
4. The image processing unit performs at least one of the following image processing operations: tilt correction of the input image or orientation correction of the input image. The image processing apparatus according to claim 1.
5. The output unit is, It is possible to output an output file containing the results of the character recognition process to the input image. When the second setting described above is enabled, an output file is output that does not contain the results of the character recognition process. The image processing apparatus according to claim 1.
6. The output unit is, It is possible to output an output file containing the results of the character recognition process to the input image. When the second setting described above is enabled, an output file containing the results of the character recognition process, to which the copy protection attribute has been added, is output. The image processing apparatus according to claim 1.
7. Regardless of the first setting, the control unit prohibits the execution of the character recognition process when the second setting is enabled. The image processing apparatus according to any one of claims 1 to 6.
8. Based on the first setting, character recognition processing is performed to recognize the characters contained in the image. Based on the results of the character recognition process, image processing is performed on the input image. Regardless of the first setting, if a second setting restricting the character recognition process is enabled, the output file will be output using the results of the image processing; if the second setting restricting the character recognition process is disabled, the output file will be output using the results of the image processing and the results of the character recognition process. An image processing method in an image processing device, The first setting can be configured on the first settings screen for configuring the character recognition process. Unlike the first settings screen, the second setting is performed on a second settings screen for configuring the functions of the image processing device. Image processing methods.