Marketplace · 85+ models

Computer Vision See what your business sees

Object detection, image classification, segmentation, OCR, facial recognition, video analytics, and pose estimation models — production-ready on Yobitel GPU Cloud.

Built by Yobitel · On AWS Marketplace

Yobitel CVAT AMI

Production-grade data annotation, ready in minutes

A pre-configured Amazon Machine Image of CVAT v1.11.0 on Ubuntu 24.04. Days of DevOps disappear — subscribe on AWS Marketplace, launch an EC2 instance, and your annotation team is labelling within minutes. Data stays inside your own AWS account; no per-seat licensing.

AI-assisted labelling out of the box

SAM cuts polygon time up to 70%. DEXTR draws boundaries from 4 clicks. SORT + SiamMask track objects across video frames automatically.

Every annotation type

Bounding boxes, polygons, polylines, points, cuboids, masks, keypoints, skeletons — for image and video datasets.

Training-ready exports

Direct export to YOLO, COCO, and Pascal VOC. No post-processing scripts. Goes from labels to training in one step.

Native S3, your account

IAM-role authentication, direct read/write to your S3 buckets. Data never leaves your AWS environment.

Extensible by design

Built on open-source CVAT — modify label schemas, drop in proprietary models via Nuclio, customize the Docker Compose stack.

Pay only for compute

Flat per-hour AWS Marketplace pricing scales with usage, not with team size or task volume.

Read the deployment guide Talk to a Yobitel engineer

At a glance

Deploy in 3 steps: Subscribe · Launch · Annotate

Up to 70% faster polygon labelling with SAM

No per-seat fees · flat hourly pricing

Your data stays in your own AWS account

Stack & sizing

Base OS: Ubuntu 24.04
CVAT version: v1.11.0
Stack: Docker · PostgreSQL · Redis · Nginx
Recommended start: t3.large
For AI-assist workloads: m5.xlarge / m5.2xlarge
Supported instance types: 16 EC2 families

Why teams self-host

Compliance-grade by default

Healthcare

HIPAA

Clinical imaging data can't sit on third-party SaaS — CVAT AMI keeps PHI inside your AWS account.

Finance

GDPR

Annotating financial documents and PII under EU/regional data protection without exporting to a vendor.

Defence & Gov

Data sovereignty

Mission and intelligence imagery stays inside controlled infrastructure — no SaaS egress.

Enterprise

Internal InfoSec

When InfoSec blocks third-party data processing, self-host CVAT on your own EC2 in minutes.

The computer vision model most teams reach for first.

All categories

YOLOv8

Ultralytics·3.2M – 68M params·AGPL-3.0

Real-time object detection across 5 model sizes; benchmark leader on COCO.

Deploy on Yobitel Compare on InferenceBench

Spec sheet

Family: Ultralytics
Parameters: 3.2M – 68M
License: AGPL-3.0
Status: Live
Best for: See what your business sees
Sits in: Computer Vision

Pricing and routing rank visible on InferenceBench. Variants and quantisations appear in the Yobibyte deploy console.

The rest of the lineup

5 more in Computer Vision. All deployable in one click.

Browse all 85+

Model

Family

Params

License

Deploy

Segment Anything 2

Promptable image and video segmentation with zero-shot generalization.

Five lines to your first computer vision call.

Every model in this category is reachable from the same Yobitel SDK. Swap the model name; the rest of the call shape stays identical. Authenticated via your workspace key.

Get an API key SDK on GitHub

computer-vision-quickstart.py

PythonTypeScriptcURL

from yobitel import Inference

# YOLOv8 — real-time object detection
client = Inference(model="ultralytics/yolov8")

detections = client.detect(
    image="warehouse_camera.jpg",
    confidence_threshold=0.5,
)

for d in detections:
    print(f"{d.label:12s} {d.confidence:.2f}  box={d.box}")

Where teams ship this

Real computer vision. In production.

Four use cases that customers run today. Pick a model from the lineup above, deploy on Yobibyte, plug it into the surrounding stack. Done.

01
Visual inspection in manufacturing
02
Retail analytics and shelf monitoring
03
Medical imaging triage
04
Autonomous systems and robotics

Frameworks

Bring what your team already knows

PyTorchTensorRTONNX RuntimeTriton Inference Server

Yobitel handles the serving layer (GPU scheduling, KV cache, autoscaling, request batching) so your team focuses on the model and the product.

Learn about Yobibyte

Explore the rest

Other categories in the marketplace

120+

NLP & Language

Text generation, translation, sentiment, summarization

95+

Generative AI

Image gen, text gen, code gen, multimodal

60+

Data Analytics

Predictive analytics, forecasting, anomaly detection

45+

Automation & RPA

Process automation, workflow AI, document processing

50+

Industry-Specific

Vertical-specific models by industry

30+

Speech & Audio

ASR, TTS, speaker diarization, audio classification

25+

Recommendation

Recommender systems, personalization, content matching

From the Yobitel blog

Deeper reads on Computer Vision

All posts

Yobitel.com

CVAT on AWS — how Yobitel's pre-built AMI makes AI data annotation effortless at scale

The full deployment story: what's pre-installed, AI-assist with SAM and DEXTR, S3 integration, and 16 supported instance types.

Read on yobitel.com

Yobitel.com

Yobitel CVAT Image & Video Annotation Solutions

How Yobitel CVAT handles image and video datasets end-to-end — from raw frames to training-ready exports.

Read on yobitel.com

Yobitel.com

Yobitel Multi-Model Text-to-Image Inference Server

The companion serving stack — multiple generative vision models behind one inference endpoint, GPU-optimised.

Read on yobitel.com

Don't see what you need?

Bring your own model or fine-tune one of ours. Yobitel engineers can sit with your team and ship the right stack.

Start Building Contact Sales

Yobitel CVAT AMI

Production-grade data annotation, ready in minutes

AI-assisted labelling out of the box

SAM cuts polygon time up to 70%. DEXTR draws boundaries from 4 clicks. SORT + SiamMask track objects across video frames automatically.

Every annotation type

Bounding boxes, polygons, polylines, points, cuboids, masks, keypoints, skeletons — for image and video datasets.

Training-ready exports

Direct export to YOLO, COCO, and Pascal VOC. No post-processing scripts. Goes from labels to training in one step.

Native S3, your account

IAM-role authentication, direct read/write to your S3 buckets. Data never leaves your AWS environment.

Extensible by design

Built on open-source CVAT — modify label schemas, drop in proprietary models via Nuclio, customize the Docker Compose stack.

Pay only for compute

Flat per-hour AWS Marketplace pricing scales with usage, not with team size or task volume.

from yobitel import Inference # YOLOv8 — real-time object detection client = Inference(model="ultralytics/yolov8") detections = client.detect( image="warehouse_camera.jpg", confidence_threshold=0.5, ) for d in detections: print(f"{d.label:12s} {d.confidence:.2f} box={d.box}")