# Create a Dataset by Filtering

Create a dataset by filtering data of the current dataset. This will not duplicate data and will help you save a lot of storage space. You can also view data details, label distribution, and other information online with visualization widgets while filtering a dataset.

* Select **Create by Filter** from the drop-down box on the TensorBay Dataset List page.

![](https://2993186011-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F-MGbJTODB-ncDvFhokcx%2Fuploads%2FoxILmQZHwkKSyRtfkSAu%2Ffilter.jpg?alt=media\&token=5aa0c4a2-bb9c-4c3b-9e20-bb304ca30d72)

## Filter Data <a href="#id-1" id="id-1"></a>

* Enter the **Data List page**, and select the dataset version you want to manage from the drop-down box on the left.

![](https://2993186011-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-MGbJTODB-ncDvFhokcx%2F-MiV9i2vXdNE2ULctyvl%2F-MiVCXbqwlX8cdYwIBG8%2F659defcaa65de971cad50a92ceef8eb.png?alt=media\&token=390598bb-95d0-43ec-927e-70e1a15c3358)

* Confirm the version to be managed. You can filter data with the data name, segment, and annotation. The filtered data will be shown in the **Data List** on the right.

![](https://2993186011-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-MGbJTODB-ncDvFhokcx%2F-MiV9i2vXdNE2ULctyvl%2F-MiVCj_u5wy9xYy40K_e%2F2f2da19dcd9e5a29c59e315a3d12feb.png?alt=media\&token=1433b681-e910-45e2-815f-4ac9a925321e)

## View Data Details <a href="#id-2" id="id-2"></a>

* On the **Data List** page, you can also preview the data online and view the specific annotation status with the visualization on the right. For details, please see Data Visualization widget.

{% content-ref url="../visualization/visualization-widgets" %}
[visualization-widgets](https://docs.graviti.com/guide/tensorbay/visualization/visualization-widgets)
{% endcontent-ref %}

![](https://2993186011-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-MGbJTODB-ncDvFhokcx%2F-MiV9i2vXdNE2ULctyvl%2F-MiVCteHZYMnOxss71Ls%2F104154462caabda99129b4a42eb2da0.png?alt=media\&token=8b20c99c-0966-4b79-ad26-f52051305621)

## View Label Distribution <a href="#id-3" id="id-3"></a>

* View the distribution of labels and annotations on the right.

![](https://2993186011-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-MGbJTODB-ncDvFhokcx%2F-MiVDjLYiauac5glf4Cs%2F-MiVE72GRpkmnOY-sOH_%2Fa25299dd00c5f6ca883b8dd7acb602d.png?alt=media\&token=e42e770a-1768-4807-b824-430f99b192f6)

## Create a Dataset by Filtered Results <a href="#id-4" id="id-4"></a>

* Select **Create a new dataset based on this result**

![](https://2993186011-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-MGbJTODB-ncDvFhokcx%2F-MiVDjLYiauac5glf4Cs%2F-MiVEvI_hTsVMUfC67pj%2Fe30ac547f733feda74fe17aa1350e57.png?alt=media\&token=4a87553f-7f64-4ff9-b79d-1fe22d50520b)

* Fill in dataset name, select storage location and set vsibility (public or private). Select **Create** to complete the dataset creation.&#x20;

![](https://2993186011-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-MGbJTODB-ncDvFhokcx%2F-MiVG2NkP3ws4g9lq4U8%2F-MiVGK5YVWQNKfVioLpb%2F270dd92ec4d1e56d6f638e2fed5f590.png?alt=media\&token=eaa3c26a-10ec-4550-869c-b3628de3cee8)

* When the dataset is created, you can jump to the Details page of the newly created dataset.

## **Advanced Search**

For fusion and normal datasets, you can not only search by filters (including segment name, annotation type, and with or without annotation) but also customize the advanced search. All you need to do is add a GitHub URL to get a file and use the file to filter data.

* On the Dataset Details page, click **Manage Data** and **View Data** to enter the Data List page.
* Click **Advanced Search**
* You will see the pop-up window for uploading GitHub Repo Link. Copy the link of the file you want to upload into the window and click **Search** to start filtering.\
  Note: The link should be under the HTTPS protocol, which means that the link should begin with “https”, and the address and revision should be separated by“:”.
* After completing the advanced search, the search results will be automatically saved in **Search Records.**

![](https://2993186011-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-MGbJTODB-ncDvFhokcx%2F-MiVG2NkP3ws4g9lq4U8%2F-MiVGhRPlZEetM-z9kKN%2F8d7e4027d3040936e76eef512c0cdc4.png?alt=media\&token=c3e7f9e9-c63a-4e11-b915-9f0868ce6cbe)

\ <br>
