Dataset Management Framework, a Python library and a CLI tool to build, analyze and manage Computer Vision datasets.
A framework and CLI tool to build, transform, and analyze datasets.
VOC dataset ---> Annotation tool
+ /
COCO dataset -----> Datumaro ---> dataset ------> Model training
+ \
CVAT annotations ---> Publication, statistics etc.
Dataset reading, writing, conversion in any direction.
classification
)image_info
, instances
, person_keypoints
,captions
, labels
, panoptic
, stuff
)segmentation
, detection
,3D raw
/ velodyne points
)classification
, person re-identification
,landmarks
)classification
)classification
, detection
, segmentation
, action_classification
, person_layout
)bboxes
, masks
)bboxes
)Other formats and documentation for them can be found here.
Dataset building
occluded
annotations from imagestrain
, val
, and test
:
Dataset quality checking
Dataset comparison
Dataset statistics (image mean and std, annotation statistics)
Model integration
Check
the design document
for a full list of features.
Check
the user manual
for usage instructions.
Feel free to
open an Issue, if you
think something needs to be changed. You are welcome to participate in
development, instructions are available in our
contribution guide.
The OpenVINO™ telemetry library
is used to collect basic information about Datumaro usage.
To enable/disable telemetry data collection please see the
guide.