Feature |
Description |
images |
JSON field containing a list of images and their metadata (size, ID, name) |
annotations |
Each object instance annotation contains a series of fields, including the category id and segmentation mask of the object. |
annotations -> segmentations
|
Contains the polygon coordinates for the segmentation mask for the specific class instance (table, list, text etc)
|
annotations -> bbox
|
Contains the bounding box coordinates for the specific class instance (table, list, text etc).
|
annotations -> is_crowd
|
This field indicates whether the class instance is a single object (is_crowd=0) or multiple objects (is_crowd=1).
In this dataset we only have single objects so this field is always set to 0.
|
annotations -> category_id
|
The class label for the current class instance. This indicates what the current bbox/segmentation mask encapsulates (table, list, text etc).
|
categories
|
JSON field containing a list of classes and their metadata (ID, name)
This dataset has 5 categories (w/ corresponding "ids") - text ("1"), title ("2"), list ("3"), table ("4"), figure ("5").
|