Custom Metrics
Sextant supports custom Metrics algorithms, you just need to upload the algorithm of your metrics to Github Repo and enter its Url into Sextant.
The algorithm of your metrics should be written with python and upload to Github.

Available Types of Indicators

‌Sextant supports two types of indicators, Float and Curve, and users can set their valid range.


‌The output is a floating point value, which is displayed on the front end as a "name=value" style, e.g. mAP=0.75.


The output is two one-dimensional arrays named x and y. It will be displayed as a curve in the front-end, with the values of x and y corresponding to the horizontal and vertical coordinates of the points on the curve.

Valid Range

‌Sextant supports setting the valid range of metrics to data or dataset.

‌For Data

Each data evaluated will return a corresponding value, such as the average value of IoU.

‌For Dataset

Each evaluation will only return one corresponding value, such as the mAP of this evaluation.


The algorithm of your metrics needs to comply with the following rules:
  1. 1.
    If you need additional dependency packages in your code, you need to create requirements.txt in the root directory and write dependencies in it (GPU operations for deep learning frameworks such as torch, tensorflow, etc. are not supported).
  2. 2.
    You need to create an, file in the root path of GitHub to ensure that the path to the repo can generate a class called Evaluator as a python package when it is cloned locally.
  3. 3.
    There is one and only one class named Evaluator in the python library.
  4. 4.
    The Evaluator needs to have the method to obtain the annotation scores of a single image. evaluate_one_data(input_source: dict, input_target: dict) -> dict method. Please refer to Graviti's docs for input_source and input_target values.
  5. 5.
    The Evaluator needs to have a method to obtain the overall annotation scores. get_result() -> dict.
  6. 6.
    The return values of the above two methods must comply with the following structure. (Currently, only two types of data are supported: float and curve.)

Code Examples

Example of Directory Structure

-- requirements.txt Example

from .Evaluator import Evaluator
all = ["Evaluator"]

Requirements.txt Example.

numpy=1.21.0 Example

import numpy as np
class Evaluator:
def __init__(self):
You can initialize your model here
evaluate_one_data(self, input_source: dict, input_target: dict) -> dict:
Do the evaluation job
:param input_source: Ground truth boxes in one image
:param input_target:Target boxes in the same image
:return: A dict containing evaluation on one image and each category within it.
def get_result(self) -> dict:
"""Overall evaluation.
A dict containing overall evaluation on all images and all categories.
'overall': {
'mAP':0.123, // Float类型返回
'pr':{ // Curve类型返回
'categories': {
'{your_category}': {
'mAP': np.mean([1, 2, 3]).tolist(),