US20120195504A1 - Computing device and method for isolating and cutting out figures in design patent document - Google Patents
Computing device and method for isolating and cutting out figures in design patent document Download PDFInfo
- Publication number
- US20120195504A1 US20120195504A1 US13/339,181 US201113339181A US2012195504A1 US 20120195504 A1 US20120195504 A1 US 20120195504A1 US 201113339181 A US201113339181 A US 201113339181A US 2012195504 A1 US2012195504 A1 US 2012195504A1
- Authority
- US
- United States
- Prior art keywords
- black
- pixels
- histogram
- figures
- white
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/10—Segmentation; Edge detection
- G06T7/11—Region-based segmentation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20068—Projection on vertical or horizontal image axis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30176—Document
Definitions
- Embodiments of the present disclosure generally relate to data analysis technology, and more particularly to a computing device and a method for isolating and cutting out figures in a design patent document.
- FIG. 1 is a block diagram of one embodiment of a computing device.
- FIG. 2A is a schematic diagram of one embodiment of a black-and-white image.
- FIG. 2B is a first histogram based on pixel information of each row in the black-and-white image in FIG. 2A .
- FIG. 2C is a schematic diagram of a multiplicity of blocks laid out and partitioned according to blank rows.
- FIG. 2D is a schematic diagram of a block selected from FIG. 2C .
- FIG. 2E is a second histogram based on pixel information of each line in the selected block in FIG. 2D .
- FIG. 2F is a schematic diagram of a multiplicity of areas laid out and partitioned according to blank lines.
- FIG. 2G is a schematic diagram of a figure area selected from FIG. 2F .
- FIG. 3 is a flowchart of one embodiment of a method for isolating and cutting out figures in a design patent document.
- module refers to logic embodied in hardware or firmware, or to a collection of software instructions, written in a programming language, such as, Java, C, or assembly.
- One or more software instructions in the modules may be embedded in firmware, such as in an EPROM.
- the modules described herein may be implemented as either software and/or hardware modules and may be stored in any type of non-transitory computer-readable medium or other storage device.
- Some non-limiting examples of non-transitory computer-readable media include CDs, DVDs, BLU-RAY, flash memory, and hard disk drives.
- FIG. 1 is a block diagram of one embodiment of a computing device 1 .
- the computing device 1 includes a cutting unit 10 for isolating and cutting out figures in a design patent document, a storage unit 20 , and a processor 30 .
- the computing device 1 is electrically connected to a display device 2 .
- the display device 3 displays the design patent document and the figures cut out from the design patent document.
- the cutting unit 10 may include one or more function modules (a list is given in FIG. 1 ).
- the one or more function modules may comprise computerized code in the form of one or more programs that are stored in the storage unit 20 , and executed by the processor 30 to provide the functions of the cutting unit 10 .
- the storage unit 20 may be a cache or a dedicated memory, such as an EPROM or a flash memory.
- the cutting unit 10 includes a reading module 101 , an orientation module 102 , a conversion module 103 , a creation module 104 , a dividing module 105 , a selection module 106 , and a display module 107 .
- the reading module 101 is operable to read a figure section of a design patent document.
- the figure section of the design patent document includes one or more figures of the design patent document.
- the orientation module 102 is operable to orientate all of the figures in the figure section in the same direction (have the same display aspect).
- the orientation module 102 rotates at least once a figure in the figure section by ninety degrees clockwise in response to the figure is displayed on the figure section in a wrong orientation.
- the wrong orientation of the figure may be defined as the figure being in landscape view instead of portrait view when the patent document is written because the width of the figure is greater than the height of the figure.
- the conversion module 103 is operable to convert each page of the figure section into a black-and-white image.
- the conversion module 103 first determines whether the page of the figure section is in color. If the page of the figure section is in color, the conversion module 103 converts the page into a grayscale image.
- the grayscale image has 256 different shades of gray, where the pixel values can range from 0 to 255.
- the conversion module 103 converts the grayscale image into a black-and-white image by reference to a predetermined pixel value. In the grayscale image, the areas in which the pixel values are more than the predetermined pixel value are converted into white areas, and the areas in which the pixel values are less than the predetermined pixel value are converted into black areas.
- FIG. 2A is a schematic diagram of one embodiment of the black-and-white image.
- the creation module 104 is operable to create a first histogram based on information of the black pixels and the white pixels in the black-and-white image.
- the X-axis or horizontal axis represents the series of numbered rows from the bottom to the top of the black-and-white image
- the Y-axis or vertical axis represents the quantity of the black pixels in each row of the black-and-white image.
- FIG. 2B shows a first histogram based on pixel information of each row in the black-and-white image in FIG. 2A .
- the dividing module 105 is operable to divide the black-and-white image into a multiplicity of blocks which contain words or figures, according to the information of the white pixels in the first histogram.
- the rows which only have white pixels are regarded as blank rows, and it is the blank rows which divide the black-and-white image into the multiplicity of blocks.
- FIG. 2C is a schematic diagram of the multiplicity of blocks laid out and partitioned according to the blank rows.
- the selection module 106 is operable to select the blocks which have a height value greater than a predetermined height value.
- FIG. 2D is a schematic diagram of such a block, selected from FIG. 2C .
- the creation module 104 is further operable to create a second histogram based on information of the black pixels and the white pixels in each selected block.
- the X-axis or horizontal axis represents the series of numbered lines from the left to the right of the selected block
- the Y-axis or vertical axis represents the quantity of the black pixels in each line of the selected block.
- FIG. 2E shows a second histogram based on the pixel information of each line in the selected block in FIG. 2D .
- the dividing module 105 is further operable to divide the selected block into a multiplicity of areas which contain words or figures, according to the information of the white pixels in the second histogram.
- the lines which only have white pixels are regarded as white or blank lines, and it is the blank lines which divide the selected block into the multiplicity of areas.
- FIG. 2F is a schematic diagram of the multiplicity of areas laid out and partitioned according to the blank lines.
- the selection module 106 is further operable to select the areas which have a width value greater than a predetermined width value, to maintain the integrity of figure areas that include the figures of the design patent.
- FIG. 2G is a schematic diagram of a figure area selected from FIG. 2F .
- the display module 107 is operable to display the selected figure areas on the display device 3 .
- FIG. 3 is a flowchart of one embodiment of a method for isolating and cutting out figures in a design patent document. Depending on the embodiment, additional steps may be added, others removed, and the ordering of the steps may be changed.
- step S 10 the reading module 101 reads a figure section of the design patent document.
- the figure section of the design patent document includes one or more figures.
- step S 12 the orientation module 102 provides the same orientation to all of the figures in the figure section.
- step S 14 the conversion module 103 converts each page of the figure section into a black-and-white image.
- step S 16 the creation module 104 creates a first histogram based on information of the black pixels and the white pixels in the black-and-white image.
- step S 18 the dividing module 105 divides the black-and-white image into a multiplicity of blocks which contain words or figures, according to the information of the white pixels in the first histogram.
- step S 20 the selection module 106 selects the blocks which have a height value greater than a predetermined height value.
- step S 22 the creation module 104 creates a second histogram based on information of the black pixels and the white pixels in each selected block.
- step S 24 the dividing module 105 divides the selected block into a multiplicity of areas which contain words or figures, according to the information of the white pixels in the second histogram.
- step S 26 the selection module 106 selects the areas which have a width value greater than a predetermined width value, to maintain the integrity of figure areas that include the figures of the design patent.
- step S 28 the display module 107 displays the selected figure areas on the display device 3 .
Abstract
Description
- 1. Technical Field
- Embodiments of the present disclosure generally relate to data analysis technology, and more particularly to a computing device and a method for isolating and cutting out figures in a design patent document.
- 2. Description of Related Art
- When users read a design patent document, the whole document is displayed, but the figures of the design patent cannot be displayed separately. Users cannot read the figures to understand intuitively the characteristics of the design patent. Furthermore, when users want to search design patent documents that satisfy certain conditions, a search result is obtained based on one or more keywords, but keyword searches cannot be applied to the figures.
-
FIG. 1 is a block diagram of one embodiment of a computing device. -
FIG. 2A is a schematic diagram of one embodiment of a black-and-white image. -
FIG. 2B is a first histogram based on pixel information of each row in the black-and-white image inFIG. 2A . -
FIG. 2C is a schematic diagram of a multiplicity of blocks laid out and partitioned according to blank rows. -
FIG. 2D is a schematic diagram of a block selected fromFIG. 2C . -
FIG. 2E is a second histogram based on pixel information of each line in the selected block inFIG. 2D . -
FIG. 2F is a schematic diagram of a multiplicity of areas laid out and partitioned according to blank lines. -
FIG. 2G is a schematic diagram of a figure area selected fromFIG. 2F . -
FIG. 3 is a flowchart of one embodiment of a method for isolating and cutting out figures in a design patent document. - The application is illustrated by way of examples and not by way of limitation in the figures of the accompanying drawings in which like references indicate similar elements. It should be noted that references to “an” or “one” embodiment in this disclosure are not necessarily to the same embodiment, and such references mean at least one.
- In general, the word “module”, as used herein, refers to logic embodied in hardware or firmware, or to a collection of software instructions, written in a programming language, such as, Java, C, or assembly. One or more software instructions in the modules may be embedded in firmware, such as in an EPROM. The modules described herein may be implemented as either software and/or hardware modules and may be stored in any type of non-transitory computer-readable medium or other storage device. Some non-limiting examples of non-transitory computer-readable media include CDs, DVDs, BLU-RAY, flash memory, and hard disk drives.
-
FIG. 1 is a block diagram of one embodiment of acomputing device 1. In one embodiment, thecomputing device 1 includes acutting unit 10 for isolating and cutting out figures in a design patent document, astorage unit 20, and aprocessor 30. Thecomputing device 1 is electrically connected to a display device 2. The display device 3 displays the design patent document and the figures cut out from the design patent document. - In one embodiment, the
cutting unit 10 may include one or more function modules (a list is given inFIG. 1 ). The one or more function modules may comprise computerized code in the form of one or more programs that are stored in thestorage unit 20, and executed by theprocessor 30 to provide the functions of thecutting unit 10. Thestorage unit 20 may be a cache or a dedicated memory, such as an EPROM or a flash memory. - In one embodiment, the
cutting unit 10 includes areading module 101, anorientation module 102, aconversion module 103, acreation module 104, adividing module 105, aselection module 106, and adisplay module 107. - The
reading module 101 is operable to read a figure section of a design patent document. The figure section of the design patent document includes one or more figures of the design patent document. - The
orientation module 102 is operable to orientate all of the figures in the figure section in the same direction (have the same display aspect). In one embodiment, theorientation module 102 rotates at least once a figure in the figure section by ninety degrees clockwise in response to the figure is displayed on the figure section in a wrong orientation. The wrong orientation of the figure may be defined as the figure being in landscape view instead of portrait view when the patent document is written because the width of the figure is greater than the height of the figure. - The
conversion module 103 is operable to convert each page of the figure section into a black-and-white image. Theconversion module 103 first determines whether the page of the figure section is in color. If the page of the figure section is in color, theconversion module 103 converts the page into a grayscale image. The grayscale image has 256 different shades of gray, where the pixel values can range from 0 to 255. Theconversion module 103 converts the grayscale image into a black-and-white image by reference to a predetermined pixel value. In the grayscale image, the areas in which the pixel values are more than the predetermined pixel value are converted into white areas, and the areas in which the pixel values are less than the predetermined pixel value are converted into black areas. A pixel value of 255 denotes a blank or white area, and a pixel value of 0 denotes a black area (hereinafter, pixels with the value of 255 are regarded as white pixels, and pixels with the value of 0 are regarded as black pixels).FIG. 2A is a schematic diagram of one embodiment of the black-and-white image. - The
creation module 104 is operable to create a first histogram based on information of the black pixels and the white pixels in the black-and-white image. In the first histogram, the X-axis or horizontal axis represents the series of numbered rows from the bottom to the top of the black-and-white image, and the Y-axis or vertical axis represents the quantity of the black pixels in each row of the black-and-white image.FIG. 2B shows a first histogram based on pixel information of each row in the black-and-white image inFIG. 2A . - The dividing
module 105 is operable to divide the black-and-white image into a multiplicity of blocks which contain words or figures, according to the information of the white pixels in the first histogram. The rows which only have white pixels are regarded as blank rows, and it is the blank rows which divide the black-and-white image into the multiplicity of blocks.FIG. 2C is a schematic diagram of the multiplicity of blocks laid out and partitioned according to the blank rows. - The
selection module 106 is operable to select the blocks which have a height value greater than a predetermined height value.FIG. 2D is a schematic diagram of such a block, selected fromFIG. 2C . - The
creation module 104 is further operable to create a second histogram based on information of the black pixels and the white pixels in each selected block. In the second histogram, the X-axis or horizontal axis represents the series of numbered lines from the left to the right of the selected block, and the Y-axis or vertical axis represents the quantity of the black pixels in each line of the selected block.FIG. 2E shows a second histogram based on the pixel information of each line in the selected block inFIG. 2D . - The
dividing module 105 is further operable to divide the selected block into a multiplicity of areas which contain words or figures, according to the information of the white pixels in the second histogram. The lines which only have white pixels are regarded as white or blank lines, and it is the blank lines which divide the selected block into the multiplicity of areas.FIG. 2F is a schematic diagram of the multiplicity of areas laid out and partitioned according to the blank lines. - The
selection module 106 is further operable to select the areas which have a width value greater than a predetermined width value, to maintain the integrity of figure areas that include the figures of the design patent.FIG. 2G is a schematic diagram of a figure area selected fromFIG. 2F . - The
display module 107 is operable to display the selected figure areas on the display device 3. -
FIG. 3 is a flowchart of one embodiment of a method for isolating and cutting out figures in a design patent document. Depending on the embodiment, additional steps may be added, others removed, and the ordering of the steps may be changed. - In step S10, the
reading module 101 reads a figure section of the design patent document. The figure section of the design patent document includes one or more figures. - In step S12, the
orientation module 102 provides the same orientation to all of the figures in the figure section. - In step S14, the
conversion module 103 converts each page of the figure section into a black-and-white image. - In step S16, the
creation module 104 creates a first histogram based on information of the black pixels and the white pixels in the black-and-white image. - In step S18, the
dividing module 105 divides the black-and-white image into a multiplicity of blocks which contain words or figures, according to the information of the white pixels in the first histogram. - In step S20, the
selection module 106 selects the blocks which have a height value greater than a predetermined height value. - In step S22, the
creation module 104 creates a second histogram based on information of the black pixels and the white pixels in each selected block. - In step S24, the
dividing module 105 divides the selected block into a multiplicity of areas which contain words or figures, according to the information of the white pixels in the second histogram. - In step S26, the
selection module 106 selects the areas which have a width value greater than a predetermined width value, to maintain the integrity of figure areas that include the figures of the design patent. - In step S28, the
display module 107 displays the selected figure areas on the display device 3. - Although certain inventive embodiments of the present disclosure have been specifically described, the present disclosure is not to be construed as being limited thereto. Various changes or modifications may be made to the present disclosure without departing from the scope and spirit of the present disclosure.
Claims (12)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201110028759.7 | 2011-01-27 | ||
CN201110028759 | 2011-01-27 | ||
CN2011100287597A CN102622724A (en) | 2011-01-27 | 2011-01-27 | Appearance patent image cutting method and system |
Publications (2)
Publication Number | Publication Date |
---|---|
US20120195504A1 true US20120195504A1 (en) | 2012-08-02 |
US8787668B2 US8787668B2 (en) | 2014-07-22 |
Family
ID=46562629
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/339,181 Expired - Fee Related US8787668B2 (en) | 2011-01-27 | 2011-12-28 | Computing device and method for isolating and cutting out figures in design patent document |
Country Status (2)
Country | Link |
---|---|
US (1) | US8787668B2 (en) |
CN (1) | CN102622724A (en) |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5859921A (en) * | 1995-05-10 | 1999-01-12 | Mitsubishi Denki Kabushiki Kaisha | Apparatus for processing an image of a face |
US20020037102A1 (en) * | 2000-07-12 | 2002-03-28 | Yukari Toda | Image processing apparatus, image processing method, and program and storage medium therefor |
US20050100219A1 (en) * | 2003-11-10 | 2005-05-12 | Kathrin Berkner | Features for retrieval and similarity matching of documents from the JPEG 2000-compressed domain |
US20090324026A1 (en) * | 2008-06-27 | 2009-12-31 | Palo Alto Research Center Incorporated | System and method for finding a picture image in an image collection using localized two-dimensional visual fingerprints |
US20100054585A1 (en) * | 2008-09-03 | 2010-03-04 | Jean-Pierre Guillou | Text localization for image and video OCR |
US20110007970A1 (en) * | 2009-07-10 | 2011-01-13 | Palo Alto Research Center Incorporated | System and method for segmenting text lines in documents |
US20110007366A1 (en) * | 2009-07-10 | 2011-01-13 | Palo Alto Research Center Incorporated | System and method for classifying connected groups of foreground pixels in scanned document images according to the type of marking |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4053188B2 (en) * | 1999-07-06 | 2008-02-27 | 富士通株式会社 | Pattern cutting device and pattern recognition device |
JP4477468B2 (en) * | 2004-10-15 | 2010-06-09 | 富士通株式会社 | Device part image retrieval device for assembly drawings |
CN100354876C (en) * | 2005-04-13 | 2007-12-12 | 佳能株式会社 | Method and equipment for intensifying character line image and storage medium |
JP4572248B2 (en) * | 2008-06-23 | 2010-11-04 | シャープ株式会社 | Image processing apparatus, image forming apparatus, image processing method, control program, and recording medium |
CN101751568B (en) * | 2008-12-12 | 2012-07-18 | 汉王科技股份有限公司 | ID No. locating and recognizing method |
-
2011
- 2011-01-27 CN CN2011100287597A patent/CN102622724A/en active Pending
- 2011-12-28 US US13/339,181 patent/US8787668B2/en not_active Expired - Fee Related
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5859921A (en) * | 1995-05-10 | 1999-01-12 | Mitsubishi Denki Kabushiki Kaisha | Apparatus for processing an image of a face |
US20020037102A1 (en) * | 2000-07-12 | 2002-03-28 | Yukari Toda | Image processing apparatus, image processing method, and program and storage medium therefor |
US20070154091A1 (en) * | 2000-07-12 | 2007-07-05 | Canon Kabushiki Kaisha | Image Processing Apparatus, Image Processing Method, And Program And Storage Medium Therefor |
US20050100219A1 (en) * | 2003-11-10 | 2005-05-12 | Kathrin Berkner | Features for retrieval and similarity matching of documents from the JPEG 2000-compressed domain |
US20090324026A1 (en) * | 2008-06-27 | 2009-12-31 | Palo Alto Research Center Incorporated | System and method for finding a picture image in an image collection using localized two-dimensional visual fingerprints |
US20100054585A1 (en) * | 2008-09-03 | 2010-03-04 | Jean-Pierre Guillou | Text localization for image and video OCR |
US20110007970A1 (en) * | 2009-07-10 | 2011-01-13 | Palo Alto Research Center Incorporated | System and method for segmenting text lines in documents |
US20110007366A1 (en) * | 2009-07-10 | 2011-01-13 | Palo Alto Research Center Incorporated | System and method for classifying connected groups of foreground pixels in scanned document images according to the type of marking |
Also Published As
Publication number | Publication date |
---|---|
US8787668B2 (en) | 2014-07-22 |
CN102622724A (en) | 2012-08-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10409895B2 (en) | Optimizing a document based on dynamically updating content | |
JP4918776B2 (en) | Electronic document comparison program, electronic document comparison device, and electronic document comparison method | |
US8660351B2 (en) | Auto-cropping images using saliency maps | |
JP5321109B2 (en) | Information processing apparatus and information processing program | |
KR20140023596A (en) | Apparatus, method and computer readable recording medium for editting the image automatically by analyzing an image | |
US20150149883A1 (en) | Image processing device, image processing device control method, program, and information storage medium | |
US9129383B2 (en) | Character string detection device, image processing device, character string detection method, control program and storage medium | |
US20220270232A1 (en) | Apparatus for adjusting parameter related to defect detection for image processing for image processing, method for information processing, and program | |
US8803998B2 (en) | Image optimization system and method for optimizing images | |
US8787668B2 (en) | Computing device and method for isolating and cutting out figures in design patent document | |
US20120013623A1 (en) | Host computer and method for adjusting display parameters | |
US20110191721A1 (en) | Method and apparatus for displaying additional information of content | |
US8483542B2 (en) | Image processing device and method | |
US9607398B2 (en) | Image processing apparatus and method of controlling the same | |
US8704850B2 (en) | Two-dimensional object packing | |
US20120192054A1 (en) | Computing device and method for cutting out summary diagram of patent document | |
US8615522B2 (en) | Computing device, storage medium and method for outputting dimension data using the computing device | |
US9424234B2 (en) | Methods and apparatus for automatically creating variations of original content based on searching for alternative content objects based on characteristics of the original content | |
US20120229857A1 (en) | Moving labels in graphical output to avoid overprinting | |
US20130321427A1 (en) | Computing device and method for generating drawing codes of pictures | |
EP3765976B1 (en) | Service-backed contextual document embedding | |
US20120121172A1 (en) | In-Image Accessibility Indication | |
US8643651B2 (en) | Type-setting method for a text image file | |
US20120183222A1 (en) | Computing device and method for automatically typesetting patent images | |
US8861894B2 (en) | Methods and apparatus for edge-aware pixel data generation |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: HONG FU JIN PRECISION INDUSTRY (SHENZHEN) CO., LTD Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:XIAO, WEI-QING;LEE, CHUNG-I;YEH, CHIEN-FA;SIGNING DATES FROM 20111215 TO 20111225;REEL/FRAME:027454/0739 Owner name: HON HAI PRECISION INDUSTRY CO., LTD., TAIWAN Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:XIAO, WEI-QING;LEE, CHUNG-I;YEH, CHIEN-FA;SIGNING DATES FROM 20111215 TO 20111225;REEL/FRAME:027454/0739 |
|
FEPP | Fee payment procedure |
Free format text: MAINTENANCE FEE REMINDER MAILED (ORIGINAL EVENT CODE: REM.) |
|
LAPS | Lapse for failure to pay maintenance fees |
Free format text: PATENT EXPIRED FOR FAILURE TO PAY MAINTENANCE FEES (ORIGINAL EVENT CODE: EXP.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
STCH | Information on status: patent discontinuation |
Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362 |
|
FP | Expired due to failure to pay maintenance fee |
Effective date: 20180722 |