AVA-Style Dataset Creation Pipeline

A unified pipeline for creating high-quality, multi-annotator spatio-temporal action localization datasets in AVA format.

Project Status

✅ Prototype Complete

Overview

This system implements a hybrid-cloud pipeline that combines powerful local computation for video pre-processing with scalable cloud-based annotation and quality control. Transform raw video files into model-ready training datasets through an automated workflow.

Key Features

Hybrid Architecture: Local GPU processing + cloud-based annotation
Multi-Annotator Support: Collaborative annotation with quality control
AVA Format Output: Industry-standard dataset format
Automated Tracking: RF-DETR + KalmanSORT integration
Web-Based Interface: Streamlit dashboard for management

System Architecture

graph TD
    subgraph "Phase 1: Local Pre-Processing (Your GPU Machine)"
        A[Input: Raw Videos in a folder] --> B[main_pipeline.py];
        B --> C{Intermediate Files};
        C --> D["Output: <br/>- dense_proposals.pkl<br/>- frames.zip<br/>- tracking_jsons.zip"];
    end

    subgraph "Phase 2: Cloud-Based Annotation & QC (AWS/Local Docker)"
        U["Admin User"] <--> S{Unified Streamlit UI};
        S -- Manages --> J[FastAPI Backend];
        J -- API Calls --> CVAT["CVAT Server"];
        CVAT -- Webhook --> L[Webhook Listener];
        L -- Triggers --> J;
        J -- Reads/Writes --> DB[("PostgreSQL Database")];
    end

    D -- "Admin Uploads Files to..." --> S;
    S -- "Generates Final Dataset" --> CSV["Output: final_ava_dataset.csv"];

Prerequisites

Python 3.8+
Docker and Docker Compose
CUDA-capable GPU (for local processing)
Git

Installation & Setup

1. CVAT Setup (Critical Configuration Required)

Several prerequisites need to be done including adding a Docker override file and database schema.

Clone and Configure CVAT

# Clone the official CVAT repository
git clone https://github.com/cvat-ai/cvat
cd cvat

Create Docker Override File

⚠️ IMPORTANT: Create docker-compose.override.yml in the cvat/ directory:

# docker-compose.override.yml
services:
  cvat_server:
    environment:
      SMOKESCREEN_OPTS: --unsafe-allow-private-ranges
  cvat_worker_webhooks:
    environment:
      SMOKESCREEN_OPTS: --unsafe-allow-private-ranges

Add Database Schema

You also need to add cvat/initdb/schema.sql with the following content. This way we're using the same database that CVAT is using:

CREATE TABLE IF NOT EXISTS projects (
    project_id INTEGER PRIMARY KEY,
    name VARCHAR(255) NOT NULL,
    created_at TIMESTAMP WITH TIME ZONE DEFAULT CURRENT_TIMESTAMP
);

CREATE TABLE IF NOT EXISTS tasks (
    task_id INTEGER PRIMARY KEY,
    project_id INTEGER REFERENCES projects(project_id),
    name VARCHAR(255) NOT NULL,
    status VARCHAR(50),
    assignee VARCHAR(255),
    retrieved_at TIMESTAMP WITH TIME ZONE,
    qc_status VARCHAR(50) DEFAULT 'pending'
);

CREATE TABLE IF NOT EXISTS annotations (
    annotation_id SERIAL PRIMARY KEY,
    task_id INTEGER REFERENCES tasks(task_id),
    job_id INTEGER NOT NULL,
    track_id INTEGER NOT NULL,
    frame INTEGER NOT NULL,
    xtl REAL,
    ytl REAL,
    xbr REAL,
    ybr REAL,
    outside BOOLEAN,
    attributes JSONB,
    annotator VARCHAR(255)
);

Start CVAT Containers

docker-compose up -d

Initialize CVAT

Now we need to create the superuser etc.

Create Admin Account:

docker exec -it cvat_server /bin/bash

# From inside the container's shell, run the following command 
# and follow the prompts to set a username, email, and password
python3 manage.py createsuperuser

exit

Setup Organization:
- Login to CVAT UI at http://localhost:8080
- Create annotator accounts (e.g., annotator1, annotator2) from admin panel
- Add annotator accounts as members via Django Admin panel at http://localhost:8080/admin

2. Python Environment Setup

# Create and activate virtual environment
python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

# Install dependencies
pip install streamlit requests opencv-python tqdm psycopg2-binary pandas flask uvicorn python-multipart

Usage

Phase 1: Local Video Pre-Processing

1. Prepare Input Videos

We need the MP4 videos to be in the exact format: videos_files_name.zip/ containing .mp4 files and place this in the uploads directory at proposal_generation_pipeline.

# Create uploads directory
mkdir proposal_generation_pipeline/uploads

# Place your raw videos in a ZIP file
# Example: proposal_generation_pipeline/uploads/raw_cctv_batch1.zip containing .mp4 files

2. Run Pre-Processing Pipeline

python proposal_generation_pipeline/orchestrator.py --zip_file_name "(absolute path)raw_cctv_batch1.zip"

Pipeline Components:

rename_resize.py: Video format standardization
clip_video.py: 15-second segment creation
person_tracker.py: RF-DETR + KalmanSORT tracking
create_proposals_from_tracks.py: Proposal consolidation

Outputs (in outputs/ directory):

dense_proposals.pkl: Consolidated tracking data
frames.zip: Extracted video frames
tracking_jsons.zip: Individual tracking files

Upload to Admin UI (Unified UI)

Start the services:

uvicorn metrics_logging.test:app --reload

streamlit run backup_files/ui.py

Now we create the XML files in the pre-proposals part. We upload the files frames.zip and dense_proposals.pkl and get the XMLs and frames.

In the task generator page, we assign the annotator names and the CVAT admin details and port, then click upload. This should take some time as seen in the terminal. The project should now show in the CVAT admin account and in the jobs section of the individual annotator accounts.

Phase 2: Annotation & Quality Control

1. Configure CVAT Webhook

Navigate to your CVAT project
Go to Actions → Setup Webhooks
Create webhook with:
- Target URL: http://host.docker.internal:5001/webhook
- Events: Check "Job updated"
- Active: ✅ Enabled

2. Start Services

Terminal 1 - Webhook Listener:

python processing_pipeline/webhook_listener.py

Terminal 2 - Post-Annotation Processing:

# Run post_annotation_service.py (select the correct project here) 
# each time to retrieve the completed work
python post_annotation_service.py

In the QC part of the admin UI, the completed tasks should now show and are pending for approval.

You can approve single tasks here or run the IAA and kappa scores for approval by admin, then click "Generate Dataset" to get the AVA_dataset.csv.

3. Access the Dashboard

Open browser to http://localhost:8501
Upload the generated files (dense_proposals.pkl, frames.zip)
Create and manage annotation projects
Monitor annotation progress and quality control

Project Structure(High level for this readme only)

ava-pipeline/
├── proposal_generation_pipeline/
│   ├── orchestrator.py         # Phase 1 orchestrator
│   ├── uploads/               # Input video files
│   └── outputs/              # Generated pipeline outputs
├── processing_pipeline/
│   └── webhook_listener.py    # CVAT webhook handler
├── backup_files/
│   └── ui.py                 # Streamlit dashboard
├── metrics_logging/
│   └── test.py              # FastAPI backend
├── post_annotation_service.py # Annotation retrieval
├── .venv/                    # Python virtual environment
└── README.md                # This file

Output Format

The final dataset is generated as AVA_dataset.csv in AVA format:

Spatio-temporal action localization data
Multi-annotator consensus
Quality control validation
Model-ready training format

Troubleshooting

Common Issues

CVAT Webhook Not Working:
- Verify docker-compose.override.yml is created correctly
- Check webhook listener is running on port 5001
- Ensure webhook URL uses host.docker.internal
Database Connection Errors:
- Confirm PostgreSQL container is running
- Verify database and tables are created
- Check connection credentials
GPU Memory Issues:
- Reduce batch size in tracking configuration
- Monitor GPU memory usage during processing

Support

For issues and contributions:

Create an issue in the project repository
Check logs in respective service terminals
Verify all containers are running with docker ps

License

[Specify your license here]

Citation

If you use this pipeline in your research, please cite:

@software{ava_pipeline,
  title={AVA-Style Dataset Creation Pipeline},
  author={[Your Name/Organization]},
  year={2024},
  url={[Repository URL]}
}

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.github/workflows		.github/workflows
.idea		.idea
__pycache__		__pycache__
cvat		cvat
data		data
final_Depolyement_setup		final_Depolyement_setup
flow_images		flow_images
metrics_logging		metrics_logging
processing_pipeline		processing_pipeline
proposal_generation_pipeline		proposal_generation_pipeline
.env		.env
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile.Backend		Dockerfile.Backend
Dockerfile.Frontend		Dockerfile.Frontend
Readme.md		Readme.md
commands_for_production_dep.md		commands_for_production_dep.md
deployment_plan.md		deployment_plan.md
docker-compose.prod.yml		docker-compose.prod.yml
docker-compose.yml		docker-compose.yml
files.sh		files.sh
our_works.md		our_works.md
post_annotation_plan.md		post_annotation_plan.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

AVA-Style Dataset Creation Pipeline

Project Status

Overview

Key Features

System Architecture

Prerequisites

Installation & Setup

1. CVAT Setup (Critical Configuration Required)

Clone and Configure CVAT

Create Docker Override File

Add Database Schema

Start CVAT Containers

Initialize CVAT

2. Python Environment Setup

Usage

Phase 1: Local Video Pre-Processing

1. Prepare Input Videos

2. Run Pre-Processing Pipeline

Upload to Admin UI (Unified UI)

Phase 2: Annotation & Quality Control

1. Configure CVAT Webhook

2. Start Services

3. Access the Dashboard

Project Structure(High level for this readme only)

Output Format

Troubleshooting

Common Issues

Support

License

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages