UAVNLT

A Benchmark for UAV-View Natural Language Guided Tracking

Introduction

We propose a new benchmark, UAVNLT, for the UAV view Natural Language guided Tracking task. UAVNLT consists of videos taken from UAV cameras from four cities for vehicles on city roads. For each video, vehicles' bounding boxes, trajectories, and natural language are carefully annotated. Compared to the existing datasets, which are only annotated with bounding boxes, the natural language sentences in our dataset can be more suitable for many application fields where humans take part in the system for that language is not only more friendly for human-computer interaction but also can overcome the appearance features' low uniqueness for tracking. We test several existing methods on our new benchmarks and find that the performance of existing methods is not satisfactory. To pave the way for future work, we propose a baseline method suitable for this task, achieving state-of-the-art performance. We believe our new dataset and proposed baseline method will be helpful in many fields, such as smart city, smart transportation, vehicle management, .etc.

Datasets

Coming soon.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

UAVNLT

Introduction

Datasets

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

UAVNLT

Introduction

Datasets

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages