[Bugs] fix crop without padding and recog metainfo delete unuse info #1526

Harold-lkk · 2022-11-15T02:45:42Z

Delete category in rec json

    "metainfo": {
        "dataset_type": "TextRecogDataset",
        "task_name": "textrecog",
        "category": [
            {
                "id": 0,
                "name": "text"
            }
        ]
    }

TextRecogCropConverter defaults not to padding in crop image

Harold-lkk · 2022-11-15T02:46:32Z

merge before #1506

xinke-wang · 2022-11-15T03:05:25Z

A question here, in 0.x version, the pad ratio was set as 0.4 and 0.2 by default. I am not sure if this matters to the test performance. Though personally, I think it should be set to 0 as this PR, maybe someone should test the model to see if it works as expected.

mmocr/mmocr/datasets/pipelines/crop.py

Lines 87 to 97 in 26bc471

    
           def crop_img(src_img, box, long_edge_pad_ratio=0.4, short_edge_pad_ratio=0.2): 
        
               """Crop text region with their bounding box. 
        
               Args: 
        
                   src_img (np.array): The original image. 
        
                   box (list[float | int]): Points of quadrangle. 
        
                   long_edge_pad_ratio (float): Box pad ratio for long edge 
        
                       corresponding to font size. 
        
                   short_edge_pad_ratio (float): Box pad ratio for short edge 
        
                       corresponding to font size. 
        
               """

Harold-lkk · 2022-11-15T03:31:09Z

A question here, in 0.x version, the pad ratio was set as 0.4 and 0.2 by default. I am not sure if this matters to the test performance. Though personally, I think it should be set to 0 as this PR, maybe someone should test the model to see if it works as expected.

mmocr/mmocr/datasets/pipelines/crop.py

Lines 87 to 97 in 26bc471

def crop_img(src_img, box, long_edge_pad_ratio=0.4, short_edge_pad_ratio=0.2):

"""Crop text region with their bounding box.

Args:

src_img (np.array): The original image.

box (list[float | int]): Points of quadrangle.

long_edge_pad_ratio (float): Box pad ratio for long edge

corresponding to font size.

short_edge_pad_ratio (float): Box pad ratio for short edge

corresponding to font size.

"""

I have tested. Difference image shape will affect the performance.
In the 0.x version, the crop image is only used for ocr.py, because of bad performance of text detection

xinke-wang · 2022-11-15T03:50:11Z

A question here, in 0.x version, the pad ratio was set as 0.4 and 0.2 by default. I am not sure if this matters to the test performance. Though personally, I think it should be set to 0 as this PR, maybe someone should test the model to see if it works as expected.

mmocr/mmocr/datasets/pipelines/crop.py

Lines 87 to 97 in 26bc471

def crop_img(src_img, box, long_edge_pad_ratio=0.4, short_edge_pad_ratio=0.2):

"""Crop text region with their bounding box.

Args:

src_img (np.array): The original image.

box (list[float | int]): Points of quadrangle.

long_edge_pad_ratio (float): Box pad ratio for long edge

corresponding to font size.

short_edge_pad_ratio (float): Box pad ratio for short edge

corresponding to font size.

"""

I have tested. Difference image shape will affect the performance. In the 0.x version, the crop image is only used for ocr.py, because of bad performance of text detection

Ok. It is noteworthy that some converters such as totaltext converter also used the crop_image and the default setting. I am not sure if it affects totaltext's performance.

mmocr/tools/data/textrecog/totaltext_converter.py

Line 302 in 26bc471

dst_img = crop_img(image, anno['bbox'])

Harold-lkk · 2022-11-15T08:07:29Z

A question here, in 0.x version, the pad ratio was set as 0.4 and 0.2 by default. I am not sure if this matters to the test performance. Though personally, I think it should be set to 0 as this PR, maybe someone should test the model to see if it works as expected.

mmocr/mmocr/datasets/pipelines/crop.py

Lines 87 to 97 in 26bc471

def crop_img(src_img, box, long_edge_pad_ratio=0.4, short_edge_pad_ratio=0.2):

"""Crop text region with their bounding box.

Args:

src_img (np.array): The original image.

box (list[float | int]): Points of quadrangle.

long_edge_pad_ratio (float): Box pad ratio for long edge

corresponding to font size.

short_edge_pad_ratio (float): Box pad ratio for short edge

corresponding to font size.

"""

I have tested. Difference image shape will affect the performance. In the 0.x version, the crop image is only used for ocr.py, because of bad performance of text detection

Ok. It is noteworthy that some converters such as totaltext converter also used the crop_image and the default setting. I am not sure if it affects totaltext's performance.

mmocr/tools/data/textrecog/totaltext_converter.py

Line 302 in 26bc471

dst_img = crop_img(image, anno['bbox'])

Maybe it's a bug. I check other data_converters, and they are all crop_img(image, anno['box], 0, 0)

gaotongxiao · 2022-11-15T11:48:23Z

@Harold-lkk Need a followup PR to avoid BC-breaking changes

fix crop without padding and recog metainfo delete unuse info

4478ef2

Harold-lkk requested a review from xinke-wang November 15, 2022 02:45

mm-assistant bot assigned gaotongxiao Nov 15, 2022

gaotongxiao approved these changes Nov 15, 2022

View reviewed changes

xinke-wang approved these changes Nov 15, 2022

View reviewed changes

gaotongxiao added the breaking label Nov 15, 2022

gaotongxiao approved these changes Nov 16, 2022

View reviewed changes

gaotongxiao merged commit 00254f0 into open-mmlab:dev-1.x Nov 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugs] fix crop without padding and recog metainfo delete unuse info #1526

[Bugs] fix crop without padding and recog metainfo delete unuse info #1526

Uh oh!

Harold-lkk commented Nov 15, 2022

Uh oh!

Harold-lkk commented Nov 15, 2022

Uh oh!

xinke-wang commented Nov 15, 2022

Uh oh!

Harold-lkk commented Nov 15, 2022

Uh oh!

xinke-wang commented Nov 15, 2022

Uh oh!

Harold-lkk commented Nov 15, 2022 •

edited

Loading

Uh oh!

gaotongxiao commented Nov 15, 2022 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Bugs] fix crop without padding and recog metainfo delete unuse info #1526

[Bugs] fix crop without padding and recog metainfo delete unuse info #1526

Uh oh!

Conversation

Harold-lkk commented Nov 15, 2022

Uh oh!

Harold-lkk commented Nov 15, 2022

Uh oh!

xinke-wang commented Nov 15, 2022

Uh oh!

Harold-lkk commented Nov 15, 2022

Uh oh!

xinke-wang commented Nov 15, 2022

Uh oh!

Harold-lkk commented Nov 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gaotongxiao commented Nov 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Harold-lkk commented Nov 15, 2022 •

edited

Loading

gaotongxiao commented Nov 15, 2022 •

edited

Loading